Navigate Select ESC Close

OpenAI just solved math

2025-07-19 Education
81.3k
2.4k
710
Wes Roth
Wes Roth
320.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Today OpenAI announced that an experimental large‑language model (LLM) achieved a gold‑medal score on the IMO 2025. This result represents the first time an AI system operating purely in natural language has reached gold‑medal performance on the IMO, a long‑standing “grand challenge” benchmark for mathematical reasoning. Reddit post with details: https://www.reddit.com/r/AIGuild/comments/1m48bwp/openai_achieved_imo_gold_with_experimental/ My Blog Post About it: https://natural20.com/openai-imo-gold-medal-2025/ Noam Brown https://x.com/polynoamial/status/1946478249187377206 Alexander Wei https://x.com/alexwei_/status/1946477758605103286 GitHub - The Solved Problems https://github.com/aw31/openai-imo-2025-proofs/blob/main/problem_5.txt Nat McAleese https://x.com/__nmca__/status/1946507122369335734 IMO challenge bet with Eliezer https://www.lesswrong.com/posts/sWLLdG6DWJEy3CH7n/imo-challenge-bet-with-eliezer Emad is this the final eval 🤓 🫴🦋 https://x.com/EMostaque/status/1946591753819312302 Gary Marcus https://x.com/GaryMarcus/status/1946615636203057413 AI wins IMO gold medal in 2025? https://polymarket.com/event/ai-wins-math-olympiad-in-2025 Measuring AI Ability to Complete Long Tasks https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/ AlphaEvolve https://deepmind.google/discover/blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/ AI achieves silver-medal standard solving International Mathematical Olympiad problems https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/ Detecting misbehavior in frontier reasoning models https://openai.com/index/chain-of-thought-monitoring/ ______________________________________________ My Links 🔗 ➡️ Twitter: https://x.com/WesRothMoney ➡️ AI Newsletter: https://natural20.beehiiv.com/subscribe ______________________________________________ AI TOOLS: (these are tools I use and recommend, some of these are affiliate links) ElevenLabs for AI Voices https://try.elevenlabs.io/ggjim0jxr70r ______________________________________________ Playlists: My Interviews With AI Experts: https://www.youtube.com/playlist?list=PLb1th0f6y4XSKLYenSVDUXFjSHsZTTfhk Self-Improving AI: https://www.youtube.com/playlist?list=PLb1th0f6y4XSMXWaslDCmxxeDLyp_uK8n ______________________________________________ #ai #openai #llm

Top Comments (10)

@patruff 2025-07-20

AI last year: "There are 3 Rs in strawberry" AI this year: "I just solved math bro"

217 27 replies
@Nightfallとばり 2025-07-19

Can't imagine what all of these models will look like in 2-3 years from now...

96 23 replies
@KosmoKnot7 2025-07-20

The next S-Curve is quality of automated self-improvement.

66 5 replies
@simonmurray-girard6209 2025-07-20

Neuro plasticity is the next massive gain, when a model can learn and improve its own weights I think.

64 6 replies
@redleader7988 2025-07-20

Holy cow. This is a diffusion model. That's exactly what it is. I've seen this exact language style in diffusion language models before. I called it here first.

61 12 replies
@user-jf9rz6pp5l 2025-07-20

I'm not sure how excited we should be about this—last year, OpenAI introduced o3-preview, which almost "solved" competitive coding, but it didn't hold up as well with real-world coding tasks.

42 3 replies
@zzzaaayyynnn 2025-07-19

This isn't about math. It’s about who or what can think and how fast that power is evolving. “Many details hard. Fudge possible. Need handle cases. Done.”

22 2 replies
@kyatt_ 2025-07-19

Next S-curve might be energy-based transformers, look into that Wes :)

10
@AAjax 2025-07-20

the lack of caps is largely a texting culture thing, or in my case, a unix admin thing.

4
@aGoshawk 2025-07-19

Cant wait to use this for my theory.

1

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot