How did a 27M Model even beat ChatGPT?

2025-12-04 Science & Technology

193.9k

9.7k

650

Watch on YouTube

bycloud

229.0k subscribers

Description

Check out HubSpot's AI Decoded Guide: https://clickhubspot.com/c7a843 A tiny 27M parameter “recursive” model is suddenly rivaling frontier LLMs on the ARC AGI benchmark, challenging & revealing how iterative thought, not size, is also capable of solving hard logical tasks. Papers [HRM] https://arxiv.org/abs/2506.21734 [TRM] https://arxiv.org/abs/2510.04871 My Newsletter https://mail.bycloud.ai/ my project: find, discover & explain AI research semantically https://findmypapers.ai/ My Patreon https://www.patreon.com/c/bycloud Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI This video is supported by the kind Patrons & YouTube Members: 🙏Spam Maj, Alex, Chris LeDoux, DX Research Group, Poof N' Inu, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] [email protected] [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] Abhay [Bitcoin (BTC)] 3JFMJQVGXNA2HJE5V9qCwLiqy6wHY9Vhdx [Ethereum (ETH)] 0x3d784F55E0bE5f35c1566B2E014598C0f354f190 [Litecoin (LTC)] MGHnqALjyU2W6NuJSSW9fTWV4dcHfwHZd7 [Bitcoin Cash (BCH)] 1LkyGfzHxnSfqMF8tN7ZGDwUTyBB6vcii9 [Solana (SOL)] 6XyMCEdVhtxJQRjMKgUJaySL8cGoBPzzA2NPDMPfVkKN [Ko-fi] https://ko-fi.com/bycloudai

#bycloud #bycloudai #TRM #HRM #what is HRM #what is TRM #recursive model #hierarchical reasoning model

Top Comments (10)

@RadioactiveGuy_69 2025-12-04

Basically we rediscover narrow Ai in search of AGI

2.4k 51 replies

@ShaneHornMusic 2025-12-04

This is what Moores law should be doing with AI, make them smaller and more efficient instead of throwing money at the problem and pretending that expanding is the way to go

1.2k 43 replies

@tehZevo_ 2025-12-04

LSTM -> transformer -> transformers with thinking (recursion in output tokens) -> HRM (recursion in internal state) we've almost come full circle

1.1k 18 replies

@kdjshfihekls 2025-12-04

Actually it turned out that there was nothing novel about the HRM. Ablation studies revealed that the hierarchical part makes no difference. Small models trained for specific tasks tend to outperform general purpose models. This is not a new discovery.

613 14 replies

@appa609 2025-12-06

it turns out knowing ~100 physics equations is more efficient than fitting reality to a 100B parameter polynomial

606 5 replies

@ristekostadinov2820 2025-12-04

I really hope that we get sub 100M params models that are good in 1 programming language. Essentially being able to toggle them like languages in IDEs while running on 20-40 TOPS NPU on laptop processors.

484 11 replies

@dcdenton6859 2025-12-05

You won't believe it, but the 37-parameters MLP beats the GPT-5.1 in sin(X) calculations.

435 2 replies

@existenceisillusion6528 2025-12-04

As hinted at in the video, TRMs don't seem to scale. So instead of making the TRM larger, it would be better to create a MoE with many TRMs as the experts. 1000 TRMs would be a 7b MoE, which changes the problem to one of how to decompose tasks completely (there was a recent paper about that, but I forgot the title).

332 44 replies

@KLohger 2025-12-04

oh no he got kidnapped at the end

@bycloudAI 2025-12-04

Check out HubSpot's AI Decoded Guide: https://clickhubspot.com/c7a843

44 3 replies

Description

Top Comments (10)

@RadioactiveGuy_69 2025-12-04

Basically we rediscover narrow Ai in search of AGI

2.4k 51 replies

@ShaneHornMusic 2025-12-04

This is what Moores law should be doing with AI, make them smaller and more efficient instead of throwing money at the problem and pretending that expanding is the way to go

1.2k 43 replies

@tehZevo_ 2025-12-04

LSTM -> transformer -> transformers with thinking (recursion in output tokens) -> HRM (recursion in internal state) we've almost come full circle

1.1k 18 replies

@kdjshfihekls 2025-12-04

613 14 replies

@appa609 2025-12-06

it turns out knowing ~100 physics equations is more efficient than fitting reality to a 100B parameter polynomial

606 5 replies

@ristekostadinov2820 2025-12-04

484 11 replies

@dcdenton6859 2025-12-05

You won't believe it, but the 37-parameters MLP beats the GPT-5.1 in sin(X) calculations.

435 2 replies

@existenceisillusion6528 2025-12-04

332 44 replies

@KLohger 2025-12-04

oh no he got kidnapped at the end

@bycloudAI 2025-12-04

Check out HubSpot's AI Decoded Guide: https://clickhubspot.com/c7a843

44 3 replies

Unlock the Data Inside
Turn Videos into Knowledge

Get FREE 10/day: transcripts, summaries, chats
Chat with videos, export text & PDF
$1 free API credit for RAG, chatbots & research

Try it free

Free forever plan • All features unlocked

How did a 27M Model even beat ChatGPT?

Description

Top Comments (10)

Related videos

How DeepSeek V4 Broke AI’s Cost Curse

PewDiePie beat chatGPT?

I Trained My Own AI... It beat ChatGPT

Did ChatGPT Just Kill Nano Banana?

GPT-5.2 is the best model ever made*

DeepSeek V3.2 Just Broke SoTA Again… But How?

Is gpt-5.1 the best code model ever?

Did ChatGPT Just Kill Zapier?

How to Fall Back Asleep FAST (Even at 2AM)

Did gpt-5 just shadow drop? Horizon is the best code model ever

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

How DeepSeek V4 Broke AI’s Cost Curse

PewDiePie beat chatGPT?

I Trained My Own AI... It beat ChatGPT

Did ChatGPT Just Kill Nano Banana?

GPT-5.2 is the best model ever made*

DeepSeek V3.2 Just Broke SoTA Again… But How?

Is gpt-5.1 the best code model ever?

Did ChatGPT Just Kill Zapier?

How to Fall Back Asleep FAST (Even at 2AM)

Did gpt-5 just shadow drop? Horizon is the best code model ever

Description

Top Comments (10)

Unlock the Data Inside
Turn Videos into Knowledge

How did a 27M Model even beat ChatGPT?

Description

Top Comments (10)

Related videos

How DeepSeek V4 Broke AI’s Cost Curse

PewDiePie beat chatGPT?

I Trained My Own AI... It beat ChatGPT

Did ChatGPT Just Kill Nano Banana?

GPT-5.2 is the best model ever made*

DeepSeek V3.2 Just Broke SoTA Again… But How?

Is gpt-5.1 the best code model ever?

Did ChatGPT Just Kill Zapier?

How to Fall Back Asleep FAST (Even at 2AM)

Did gpt-5 just shadow drop? Horizon is the best code model ever

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

How DeepSeek V4 Broke AI’s Cost Curse

PewDiePie beat chatGPT?

I Trained My Own AI... It beat ChatGPT

Did ChatGPT Just Kill Nano Banana?

GPT-5.2 is the best model ever made*

DeepSeek V3.2 Just Broke SoTA Again… But How?

Is gpt-5.1 the best code model ever?

Did ChatGPT Just Kill Zapier?

How to Fall Back Asleep FAST (Even at 2AM)

Did gpt-5 just shadow drop? Horizon is the best code model ever

Description

Top Comments (10)

Unlock the Data Inside Turn Videos into Knowledge

Unlock the Data Inside
Turn Videos into Knowledge