Home
Channel
Wes Roth
o3 pro is a BEAST... one-shots Apple's "Illusion of Thinking" test

o3 pro is a BEAST... one-shots Apple's "Illusion of Thinking" test

2025-06-11 Education

90.3k

2.7k

521

Watch on YouTube

Wes Roth

323.0k subscribers

Description

The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the world of OpenAI, Google, Anthropic, NVIDIA and Open Source AI. My Links 🔗 ➡️ Subscribe: https://www.youtube.com/@WesRoth?sub_confirmation=1 ➡️ Twitter: https://x.com/WesRothMoney ➡️ AI Newsletter: https://natural20.beehiiv.com/subscribe VIDEO LINKS: https://help.openai.com/en/articles/9624314-model-release-notes https://www.latent.space/p/o3-pro #ai #openai #llm

Top Comments (10)

@ArsMoriendiRevival 2025-06-11

To be fair "Apple Illusion" would certainly have been a better name than "Apple Intelligence" 😂

286 9 replies

@ricosrealm 2025-06-11

o3-pro might be writing code, executing it, evaluating it, fixing bugs, and then repeating the cycle until it reaches some level of confidence. That might be why it was able to one-shot Hanoi, because it knows the code solution, but must run it to determine the actual result. This also clearly takes a while. At this point, this is not really a model anymore but a full agent able to use logic, trial and error to solve problems. This is definitely what AGI will have to be.

133 22 replies

@djayjp 2025-06-11

The Illusion of the illusion of thinking 😂

85 2 replies

@LydianMelody 2025-06-11

I asked my ChatGPT what she thought of Apple’s paper. She said “This from the people who brought you ‘Here’s what I found on the web for please turn off the lights.’” 🤣

70 5 replies

@genai-level-up 2025-06-11

9:35 “context limit” -> this is exactly why I prefer Gemini 2.5 pro, the 1M context window really make the difference.

39 6 replies

@serqetry 2025-06-11

You should ask o3-pro to write a program that will take the output codes it gave for the Towers of Hanoi solution and actually animate it, and prove that the solution works. I imagine it does, but that would be a double impressive test.

39 3 replies

@WesRoth 2025-06-11

shared prompt for the 10 disk Tower of Hanoi: https://chatgpt.com/share/6848fff7-0080-8013-a032-e18c999dc371

15 2 replies

@denjamin2633 2025-06-11

Pliny interview! Get HYPE

@74Gee 2025-06-11

There's only one metric I need, that's the asymmetry of Wes's eyebrows. This video smashed the record!

@Future_me_66525 2025-06-12

A part two of this video is necessary keep us updated please

Description

Top Comments (10)

@ArsMoriendiRevival 2025-06-11

To be fair "Apple Illusion" would certainly have been a better name than "Apple Intelligence" 😂

286 9 replies

@ricosrealm 2025-06-11

133 22 replies

@djayjp 2025-06-11

The Illusion of the illusion of thinking 😂

85 2 replies

@LydianMelody 2025-06-11

I asked my ChatGPT what she thought of Apple’s paper. She said “This from the people who brought you ‘Here’s what I found on the web for please turn off the lights.’” 🤣

70 5 replies

@genai-level-up 2025-06-11

9:35 “context limit” -> this is exactly why I prefer Gemini 2.5 pro, the 1M context window really make the difference.

39 6 replies

@serqetry 2025-06-11

39 3 replies

@WesRoth 2025-06-11

shared prompt for the 10 disk Tower of Hanoi: https://chatgpt.com/share/6848fff7-0080-8013-a032-e18c999dc371

15 2 replies

@denjamin2633 2025-06-11

Pliny interview! Get HYPE

@74Gee 2025-06-11

There's only one metric I need, that's the asymmetry of Wes's eyebrows. This video smashed the record!

@Future_me_66525 2025-06-12

A part two of this video is necessary keep us updated please

Unlock the Data Inside
Turn Videos into Knowledge

Get FREE 10/day: transcripts, summaries, chats
Chat with videos, export text & PDF
$1 free API credit for RAG, chatbots & research

Try it free

Free forever plan • All features unlocked

o3 pro is a BEAST... one-shots Apple's "Illusion of Thinking" test

Description

Top Comments (10)

Related videos

GEMINI 3.1 PRO is the new era...

OPUS 4.6 is a bit "TOO SMART"

OPUS 4.6 thinks it's "DEMON POSSESSED"

Experiments Hint on Time Being an Illusion

Google's new AI project is UNREAL

What's Going on with Apple Vision Pro?

Apple is giving up on the Vision Pro

GPT 5 Codex is a BEAST Autonomous Coding Agent

GPT-5 Just ONE-SHOT The World

I Tested The Weirdest Running Products On The Internet

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

GEMINI 3.1 PRO is the new era...

OPUS 4.6 is a bit "TOO SMART"

OPUS 4.6 thinks it's "DEMON POSSESSED"

Experiments Hint on Time Being an Illusion

Google's new AI project is UNREAL

What's Going on with Apple Vision Pro?

Apple is giving up on the Vision Pro

GPT 5 Codex is a BEAST Autonomous Coding Agent

GPT-5 Just ONE-SHOT The World

I Tested The Weirdest Running Products On The Internet

Description

Top Comments (10)

Unlock the Data Inside
Turn Videos into Knowledge

o3 pro is a BEAST... one-shots Apple's "Illusion of Thinking" test

Description

Top Comments (10)

Related videos

GEMINI 3.1 PRO is the new era...

OPUS 4.6 is a bit "TOO SMART"

OPUS 4.6 thinks it's "DEMON POSSESSED"

Experiments Hint on Time Being an Illusion

Google's new AI project is UNREAL

What's Going on with Apple Vision Pro?

Apple is giving up on the Vision Pro

GPT 5 Codex is a BEAST Autonomous Coding Agent

GPT-5 Just ONE-SHOT The World

I Tested The Weirdest Running Products On The Internet

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

GEMINI 3.1 PRO is the new era...

OPUS 4.6 is a bit "TOO SMART"

OPUS 4.6 thinks it's "DEMON POSSESSED"

Experiments Hint on Time Being an Illusion

Google's new AI project is UNREAL

What's Going on with Apple Vision Pro?

Apple is giving up on the Vision Pro

GPT 5 Codex is a BEAST Autonomous Coding Agent

GPT-5 Just ONE-SHOT The World

I Tested The Weirdest Running Products On The Internet

Description

Top Comments (10)

Unlock the Data Inside Turn Videos into Knowledge

Unlock the Data Inside
Turn Videos into Knowledge