Navigate Select ESC Close

How did a 27M Model even beat ChatGPT?

2025-12-04 Science & Technology
193.9k
9.7k
650
bycloud
bycloud
225.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Check out HubSpot's AI Decoded Guide: https://clickhubspot.com/c7a843 A tiny 27M parameter “recursive” model is suddenly rivaling frontier LLMs on the ARC AGI benchmark, challenging & revealing how iterative thought, not size, is also capable of solving hard logical tasks. Papers [HRM] https://arxiv.org/abs/2506.21734 [TRM] https://arxiv.org/abs/2510.04871 My Newsletter https://mail.bycloud.ai/ my project: find, discover & explain AI research semantically https://findmypapers.ai/ My Patreon https://www.patreon.com/c/bycloud Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI This video is supported by the kind Patrons & YouTube Members: 🙏Spam Maj, Alex, Chris LeDoux, DX Research Group, Poof N' Inu, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] [email protected] [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] Abhay [Bitcoin (BTC)] 3JFMJQVGXNA2HJE5V9qCwLiqy6wHY9Vhdx [Ethereum (ETH)] 0x3d784F55E0bE5f35c1566B2E014598C0f354f190 [Litecoin (LTC)] MGHnqALjyU2W6NuJSSW9fTWV4dcHfwHZd7 [Bitcoin Cash (BCH)] 1LkyGfzHxnSfqMF8tN7ZGDwUTyBB6vcii9 [Solana (SOL)] 6XyMCEdVhtxJQRjMKgUJaySL8cGoBPzzA2NPDMPfVkKN [Ko-fi] https://ko-fi.com/bycloudai

Top Comments (10)

@RadioactiveGuy_69 2025-12-04

Basically we rediscover narrow Ai in search of AGI

2.4k 50 replies
@ShaneHornMusic 2025-12-04

This is what Moores law should be doing with AI, make them smaller and more efficient instead of throwing money at the problem and pretending that expanding is the way to go

1.2k 41 replies
@tehZevo_ 2025-12-04

LSTM -> transformer -> transformers with thinking (recursion in output tokens) -> HRM (recursion in internal state) we've almost come full circle

1.1k 18 replies
@kdjshfihekls 2025-12-04

Actually it turned out that there was nothing novel about the HRM. Ablation studies revealed that the hierarchical part makes no difference. Small models trained for specific tasks tend to outperform general purpose models. This is not a new discovery.

600 13 replies
@appa609 2025-12-06

it turns out knowing ~100 physics equations is more efficient than fitting reality to a 100B parameter polynomial

513 3 replies
@ristekostadinov2820 2025-12-04

I really hope that we get sub 100M params models that are good in 1 programming language. Essentially being able to toggle them like languages in IDEs while running on 20-40 TOPS NPU on laptop processors.

478 11 replies
@dcdenton6859 2025-12-05

You won't believe it, but the 37-parameters MLP beats the GPT-5.1 in sin(X) calculations.

366 2 replies
@existenceisillusion6528 2025-12-04

As hinted at in the video, TRMs don't seem to scale. So instead of making the TRM larger, it would be better to create a MoE with many TRMs as the experts. 1000 TRMs would be a 7b MoE, which changes the problem to one of how to decompose tasks completely (there was a recent paper about that, but I forgot the title).

324 43 replies
@klohger 2025-12-04

oh no he got kidnapped at the end

72
@bycloudAI 2025-12-04

Check out HubSpot's AI Decoded Guide: https://clickhubspot.com/c7a843

42 3 replies

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot