Navigate Select ESC Close

How DeepSeek Built The Current "Best" Math Prover AI

2025-06-02 Science & Technology
36.9k
1.8k
130
bycloud
bycloud
225.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Get started now with privacy focused VPN by Proton! https://proton.me/pass/bycloudai My Newsletter https://mail.bycloud.ai/ my project: find, discover & explain AI research semantically https://findmypapers.ai/ My Patreon https://www.patreon.com/c/bycloud DeepSeek-Prover-V2 [Paper] https://arxiv.org/abs/2504.21801 [V3 report] https://arxiv.org/abs/2412.19437 Kimina Prover [Paper] https://arxiv.org/abs/2504.11354 Putnam Bench [Question ex] https://prase.cz/kalva/putnam.html USAMO & AIME bench [Math Arena Leaderboard] https://matharena.ai/ Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI This video is supported by the kind Patrons & YouTube Members: 🙏 Chris LeDoux, Ben Shaener, DX Research Group, Poof N' Inu, Deagan, Robert Zawiasa, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] [email protected] [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Booga04 [Ko-fi] https://ko-fi.com/bycloudai

Top Comments (10)

@augustday9483 2025-06-02

Seems like this method of recursively breaking down problems into smaller sub-problems until triviality is achieved could be applied to many problem domains, not just math.

249 8 replies
@ainet8415 2025-06-02

hey cloud can you cover every deepseek paper because these people are brilliant

112
@wpelfeta 2025-06-02

I love seeing open source winning.

110 9 replies
@thecasualpolemic7572 2025-06-03

3:58 the “there exists delta” and “for all epsilon” need to switch places

14 1 replies
@simonthehedgehog928 2025-06-07

There's a significant mistake in the video (though admittedly not just in the video, which stems from the original paper). As discussed in the Lean Zulip channel, the 7b model simply picked up on an issue in the Lean proof assistant v4.9 when it solved these 13 Putnam problems the bigger model could not solve. Basically, some 'apply?' leads to a proof silently being sorried which is not shown by Lean in the final result. This was fixed in later Lean versions. So the 7b model is simply reward hacking these 13 problems. 49 is the final result, NOT 62.

12
@darian.rosebrook 2025-06-02

Now we need the same thing with assumed facts in generated text responses. How do you verify a statement? How do you train it to say “I’m not sure, let me check” instead of stating everything full chested as fact

4
@Unitedpenguin0920 2025-06-03

Rare sponsorship W

4
@Terenfear 2025-06-03

I fucking love that small clip you always use at the beginning on "before we dive into it", no sarcasm.

2
@Daveooooooooooo0 2025-06-25

I love the ending Patreon names

0
@lukephillips7239 2025-06-04

I think it is safe to say that the DeepSeek team consists of the smartest researches in the world. These guys just hit it out of the park every single time.

0

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot