OpenAI o1's New Paradigm: Test-Time Compute Explained

2024-10-14 Science & Technology

50.7k

2.3k

145

Watch on YouTube

bycloud

228.0k subscribers

Description

What is the latest hype about Test-Time Compute and why it's mid Check out NVIDIA's suite of Training and Certification here: [NVIDIA Certification] https://nvda.ws/3XxkFyj [AI Learning Essential] https://nvda.ws/4gvD474 [Gen AI/LLM Learning Path] https://nvda.ws/4enwYE7 You can use the code “BYCLOUD” at checkout for 10% off! check out my newsletter: https://mail.bycloud.ai Test Time Compute by DeepMind [Paper] https://arxiv.org/abs/2408.03314 To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning [Paper] https://arxiv.org/abs/2409.12183 Language Models Learn to Mislead Humans via RLHF [Paper] https://arxiv.org/abs/2409.12822 Chain-of-Thought Reasoning Without Prompting [Paper] https://arxiv.org/abs/2402.10200 Larger and more instructable language models become less reliable [Paper] https://www.nature.com/articles/s41586-024-07930-y Let's Think Dot by Dot: Hidden Computation in Transformer Language Models [Paper] https://arxiv.org/abs/2404.15758 This video is supported by the kind Patrons & YouTube Members: 🙏Andrew Lescelius, alex j, Chris LeDoux, Ben Shaener, Alex Maurice, Miguilim, Deagan, FiFaŁ, Robert Zawiasa, Owen Ingraham, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Penumbraa, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO, Richárd Nagyfi, Hector, Drexon, Claxvii 177th, Inferencer, Michael Brenner, Akkusativ, Oleg Wock, FantomBloth, Thipok Tham, Clayton Ford, Theo, Handenon, Diego Silva, mayssam, Kadhai Pesalam, Tim Schulz [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Music] massobeats - floral [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Askejm [Thumbnail Idea] https://x.com/DrJimFan/status/1834279865933332752

#bycloud #bycloudai #test time compute #chain of thought #test time computing #open ai #openai o1 #openai o1 test time compute

Top Comments (10)

@Guedez1 2024-10-14

One of the chain of thoughts felt like doing an A* search on all possible answers

106 3 replies

@rawallon 2024-10-14

Your channel is like twitter but only the good part, I love it

103 2 replies

@lbgstzockt8493 2024-10-15

OpenAI went from extremely secretive closed-source for profit to even more secretive closed-source for profit. Truly revolutionary change.

103

@bycloudAI 2024-10-14

Let me know if you guys want a dive into the methodologies of TTC, there's a lot of new papers/implementations coming out every day lol (entropix is a cool one) Check out NVIDIA's suite of Training and Certification here: [NVIDIA Certification] https://nvda.ws/3XxkFyj [AI Learning Essential] https://nvda.ws/4gvD474 [Gen AI/LLM Learning Path] https://nvda.ws/4enwYE7 You can use the code “BYCLOUD” at checkout for 10% off!

35 4 replies

@Terenfear 2024-10-14

Glad to see the original editing approach back.

26 1 replies

@shApYT 2024-10-14

RLHF or in other words LGTM ship it to prod.

@John_YT 2024-10-14

"Bart say the line!" *Sigh* "The bitter lesson strikes again"

@GIRcode 2024-10-14

kinda reminds me of how chess bots like stockfish are able to view multiple potential outcomes to find the best move possible

@Originalimoc 2024-10-14

Okay this explains why higher temp and top_p give better results sometime😮

@vincent_hall 2024-12-26

Thank you for giving us a healthy level of scepticism in the current AI models.

Description

Top Comments (10)

@Guedez1 2024-10-14

One of the chain of thoughts felt like doing an A* search on all possible answers

106 3 replies

@rawallon 2024-10-14

Your channel is like twitter but only the good part, I love it

103 2 replies

@lbgstzockt8493 2024-10-15

OpenAI went from extremely secretive closed-source for profit to even more secretive closed-source for profit. Truly revolutionary change.

103

@bycloudAI 2024-10-14

35 4 replies

@Terenfear 2024-10-14

Glad to see the original editing approach back.

26 1 replies

@shApYT 2024-10-14

RLHF or in other words LGTM ship it to prod.

@John_YT 2024-10-14

"Bart say the line!" *Sigh* "The bitter lesson strikes again"

@GIRcode 2024-10-14

kinda reminds me of how chess bots like stockfish are able to view multiple potential outcomes to find the best move possible

@Originalimoc 2024-10-14

Okay this explains why higher temp and top_p give better results sometime😮

@vincent_hall 2024-12-26

Thank you for giving us a healthy level of scepticism in the current AI models.

Unlock the Data Inside
Turn Videos into Knowledge

Get FREE 10/day: transcripts, summaries, chats
Chat with videos, export text & PDF
$1 free API credit for RAG, chatbots & research

Try it free

Free forever plan • All features unlocked

OpenAI o1's New Paradigm: Test-Time Compute Explained

Description

Top Comments (10)

Related videos

What Is Yann LeCun Cooking? JEPA Explained Simply

Earth’s Core Should Be Impossible. A New State of Matter Explains It.

DeepSeek's Insane Architecture Breakthrough [Engram Explained]

OpenAI's New Era

Black Holes. Explained. For 1.5 Hours.

NotebookLM's Biggest Updates Yet - Every New Feature Explained

OpenAI's Code Red, Sacks vs New York Times, New Poverty Line?

OpenAI’s Code Red Explained

The 7 Most Common Magnesium Types, Explained

New AI Meta: Train LLMs To Explore On "Hard" Tokens [RLVR + Entropy]

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

What Is Yann LeCun Cooking? JEPA Explained Simply

Earth’s Core Should Be Impossible. A New State of Matter Explains It.

DeepSeek's Insane Architecture Breakthrough [Engram Explained]

OpenAI's New Era

Black Holes. Explained. For 1.5 Hours.

NotebookLM's Biggest Updates Yet - Every New Feature Explained

OpenAI's Code Red, Sacks vs New York Times, New Poverty Line?

OpenAI’s Code Red Explained

The 7 Most Common Magnesium Types, Explained

New AI Meta: Train LLMs To Explore On "Hard" Tokens [RLVR + Entropy]

Description

Top Comments (10)

Unlock the Data Inside
Turn Videos into Knowledge

OpenAI o1's New Paradigm: Test-Time Compute Explained

Description

Top Comments (10)

Related videos

What Is Yann LeCun Cooking? JEPA Explained Simply

Earth’s Core Should Be Impossible. A New State of Matter Explains It.

DeepSeek's Insane Architecture Breakthrough [Engram Explained]

OpenAI's New Era

Black Holes. Explained. For 1.5 Hours.

NotebookLM's Biggest Updates Yet - Every New Feature Explained

OpenAI's Code Red, Sacks vs New York Times, New Poverty Line?

OpenAI’s Code Red Explained

The 7 Most Common Magnesium Types, Explained

New AI Meta: Train LLMs To Explore On "Hard" Tokens [RLVR + Entropy]

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

What Is Yann LeCun Cooking? JEPA Explained Simply

Earth’s Core Should Be Impossible. A New State of Matter Explains It.

DeepSeek's Insane Architecture Breakthrough [Engram Explained]

OpenAI's New Era

Black Holes. Explained. For 1.5 Hours.

NotebookLM's Biggest Updates Yet - Every New Feature Explained

OpenAI's Code Red, Sacks vs New York Times, New Poverty Line?

OpenAI’s Code Red Explained

The 7 Most Common Magnesium Types, Explained

New AI Meta: Train LLMs To Explore On "Hard" Tokens [RLVR + Entropy]

Description

Top Comments (10)

Unlock the Data Inside Turn Videos into Knowledge

Unlock the Data Inside
Turn Videos into Knowledge