Navigate Select ESC Close

OpenAI o1's New Paradigm: Test-Time Compute Explained

2024-10-14 Science & Technology
50.7k
2.3k
145
bycloud
bycloud
225.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

What is the latest hype about Test-Time Compute and why it's mid Check out NVIDIA's suite of Training and Certification here: [NVIDIA Certification] https://nvda.ws/3XxkFyj [AI Learning Essential] https://nvda.ws/4gvD474 [Gen AI/LLM Learning Path] https://nvda.ws/4enwYE7 You can use the code “BYCLOUD” at checkout for 10% off! check out my newsletter: https://mail.bycloud.ai Test Time Compute by DeepMind [Paper] https://arxiv.org/abs/2408.03314 To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning [Paper] https://arxiv.org/abs/2409.12183 Language Models Learn to Mislead Humans via RLHF [Paper] https://arxiv.org/abs/2409.12822 Chain-of-Thought Reasoning Without Prompting [Paper] https://arxiv.org/abs/2402.10200 Larger and more instructable language models become less reliable [Paper] https://www.nature.com/articles/s41586-024-07930-y Let's Think Dot by Dot: Hidden Computation in Transformer Language Models [Paper] https://arxiv.org/abs/2404.15758 This video is supported by the kind Patrons & YouTube Members: 🙏Andrew Lescelius, alex j, Chris LeDoux, Ben Shaener, Alex Maurice, Miguilim, Deagan, FiFaŁ, Robert Zawiasa, Owen Ingraham, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Penumbraa, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO, Richárd Nagyfi, Hector, Drexon, Claxvii 177th, Inferencer, Michael Brenner, Akkusativ, Oleg Wock, FantomBloth, Thipok Tham, Clayton Ford, Theo, Handenon, Diego Silva, mayssam, Kadhai Pesalam, Tim Schulz [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Music] massobeats - floral [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Askejm [Thumbnail Idea] https://x.com/DrJimFan/status/1834279865933332752

Top Comments (10)

@Guedez1 2024-10-14

One of the chain of thoughts felt like doing an A* search on all possible answers

106 3 replies
@rawallon 2024-10-14

Your channel is like twitter but only the good part, I love it

103 2 replies
@lbgstzockt8493 2024-10-15

OpenAI went from extremely secretive closed-source for profit to even more secretive closed-source for profit. Truly revolutionary change.

103
@XetXetable 2024-10-14

I don't understand why you're so insistent that using RL to learn reasoning can't cause new knowledge to be gained. You're implicitly assuming that if the model knows A and that A implies B then the model must already know B. But that's not true. The model knows the rules of chess, and these rules imply whatever the optimal strategy is, but it definitely doesn't know this optimal strategy. It may come to learn it (or of approximations of it) through RL, though, as alpha zero and similar did.

56 3 replies
@bycloudAI 2024-10-14

Let me know if you guys want a dive into the methodologies of TTC, there's a lot of new papers/implementations coming out every day lol (entropix is a cool one) Check out NVIDIA's suite of Training and Certification here: [NVIDIA Certification] https://nvda.ws/3XxkFyj [AI Learning Essential] https://nvda.ws/4gvD474 [Gen AI/LLM Learning Path] https://nvda.ws/4enwYE7 You can use the code “BYCLOUD” at checkout for 10% off!

35 4 replies
@Terenfear 2024-10-14

Glad to see the original editing approach back.

26 1 replies
@springdotgay 2024-10-14

Fun fact: I have spent 3-4 days trying to fix a single SQLite bug while I was debugging with AI

14 5 replies
@John_YT 2024-10-14

"Bart say the line!" *Sigh* "The bitter lesson strikes again"

9
@GIRcode 2024-10-14

kinda reminds me of how chess bots like stockfish are able to view multiple potential outcomes to find the best move possible

8
@Originalimoc 2024-10-14

Okay this explains why higher temp and top_p give better results sometime😮

4

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot