o3 pro is a BEAST... one-shots Apple's "Illusion of Thinking" test
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Related videos
GEMINI 3.1 PRO is the new era...
Wes Roth
40.8k views
OPUS 4.6 is a bit "TOO SMART"
Wes Roth
37.8k views
OPUS 4.6 thinks it's "DEMON POSSESSED"
Wes Roth
52.8k views
Experiments Hint on Time Being an Illusion
Anton Petrov
82.9k views
Google's new AI project is UNREAL
Wes Roth
53.5k views
What's Going on with Apple Vision Pro?
Marques Brownlee
4.1m views
Apple is giving up on the Vision Pro
The Friday Checkout
99.4k views
GPT 5 Codex is a BEAST Autonomous Coding Agent
Wes Roth
69.7k views
GPT-5 Just ONE-SHOT The World
Wes Roth
58.8k views
I Tested The Weirdest Running Products On The Internet
Ben Parkes
106.4k views
Top Comments (10)
To be fair "Apple Illusion" would certainly have been a better name than "Apple Intelligence" 😂
o3-pro might be writing code, executing it, evaluating it, fixing bugs, and then repeating the cycle until it reaches some level of confidence. That might be why it was able to one-shot Hanoi, because it knows the code solution, but must run it to determine the actual result. This also clearly takes a while. At this point, this is not really a model anymore but a full agent able to use logic, trial and error to solve problems. This is definitely what AGI will have to be.
The Illusion of the illusion of thinking 😂
I asked my ChatGPT what she thought of Apple’s paper. She said “This from the people who brought you ‘Here’s what I found on the web for please turn off the lights.’” 🤣
9:35 “context limit” -> this is exactly why I prefer Gemini 2.5 pro, the 1M context window really make the difference.
shared prompt for the 10 disk Tower of Hanoi: https://chatgpt.com/share/6848fff7-0080-8013-a032-e18c999dc371
I'm honestly just ready for the revolution where robots come knock at my door and hit me with that "You're adopted." 🤗
Pliny interview! Get HYPE
There's only one metric I need, that's the asymmetry of Wes's eyebrows. This video smashed the record!
I started laughing and automatically clicked like when I found out that he had uploaded a study from apple
Unlock the Data Inside
Turn Videos into Knowledge
- Get FREE 10/day: transcripts, summaries, chats
- Chat with videos, export text & PDF
- $1 free API credit for RAG, chatbots & research
Free forever plan • All features unlocked
Top Comments (10)
To be fair "Apple Illusion" would certainly have been a better name than "Apple Intelligence" 😂
o3-pro might be writing code, executing it, evaluating it, fixing bugs, and then repeating the cycle until it reaches some level of confidence. That might be why it was able to one-shot Hanoi, because it knows the code solution, but must run it to determine the actual result. This also clearly takes a while. At this point, this is not really a model anymore but a full agent able to use logic, trial and error to solve problems. This is definitely what AGI will have to be.
The Illusion of the illusion of thinking 😂
I asked my ChatGPT what she thought of Apple’s paper. She said “This from the people who brought you ‘Here’s what I found on the web for please turn off the lights.’” 🤣
9:35 “context limit” -> this is exactly why I prefer Gemini 2.5 pro, the 1M context window really make the difference.
shared prompt for the 10 disk Tower of Hanoi: https://chatgpt.com/share/6848fff7-0080-8013-a032-e18c999dc371
I'm honestly just ready for the revolution where robots come knock at my door and hit me with that "You're adopted." 🤗
Pliny interview! Get HYPE
There's only one metric I need, that's the asymmetry of Wes's eyebrows. This video smashed the record!
I started laughing and automatically clicked like when I found out that he had uploaded a study from apple