Terence Tao (@tao@mathstodon.xyz)
mathstodon.xyz
external-link
I have played a little bit with OpenAI's new iteration of GPT, GPT-o1, which performs an initial reasoning step before running the LLM. It is certainly a more capable tool than previous iterations, though still struggling with the most advanced research mathematical tasks. Here are some concrete experiments (with a prototype version of the model that I was granted access to). In https://chatgpt.com/share/2ecd7b73-3607-46b3-b855-b29003333b87 I repeated an experiment from https://mathstodon.xyz/@tao/109948249160170335 in which I asked GPT to answer a vaguely worded mathematical query which could be solved by identifying a suitable theorem (Cramer's theorem) from the literature. Previously, GPT was able to mention some relevant concepts but the details were hallucinated nonsense. This time around, Cramer's theorem was identified and a perfectly satisfactory answer was given. (1/3)

The experience seemed roughly on par with trying to advise a mediocre, but not completely incompetent, graduate student. However, this was an improvement over previous models, whose capability was closer to an actually incompetent graduate student. It may only take one or two further iterations of improved capability (and integration with other tools, such as computer algebra packages and proof assistants) until the level of “competent graduate student” is reached, at which point I could see this tool being of significant use in research level tasks.

@qooqie@lemmy.world
link
fedilink
English
143M

Using GPT without appearing like an idiot takes a competent grad student

@jsomae@lemmy.ml
creator
link
fedilink
English
33M

This I can believe tbh. It’s a very useful tool in the hands of an expert. Otherwise it’s like giving a chimp a gun.

Maybe this is why I am surprised at people’s hatred of ChatGPT. It’s borne of misuse of a tool for experts, like newcomers struggling with a C++ compiler error.

dinckel
link
fedilink
English
263M

I genuinely hate this statement. A competent grad student can solve problems. GPT cannot solve anything, as all it does is put together the shit it stole from somewhere before

O1 is (apparently) different according to some videos I watched, as it pulls apart the question and does some reasoning steps.

@aodhsishaj@lemmy.world
link
fedilink
English
93M

I’d love to see one of those videos

@jsomae@lemmy.ml
creator
link
fedilink
English
13M

like, a video of Tao giving a demonstration?

@aodhsishaj@lemmy.world
link
fedilink
English
13M

@NegentropicBoy English20•

O1 is (apparently) different according to some videos I watched, as it pulls apart the question …

Yes

Create a post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


  • 1 user online
  • 179 users / day
  • 403 users / week
  • 1.13K users / month
  • 3.98K users / 6 months
  • 1 subscriber
  • 7.77K Posts
  • 87.7K Comments
  • Modlog