Navigate Select ESC Close

How GPT-5, Claude, and Gemini are actually trained and served – Reiner Pope

2026-04-29 Science & Technology
55.4k
2.6k
181
Dwarkesh Patel
Dwarkesh Patel
1.3m subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Did a very different format with Reiner Pope – a blackboard lecture where he walks through how frontier LLMs are trained and served. It's shocking how much you can deduce about what the labs are doing from a handful of equations, public API prices, and some chalk. It’s a bit technical, but I encourage you to hang in there - it’s really worth it. There are less than a handful of people who understand the full stack of AI, from chip design to model architecture, as well as Reiner. It was a real delight to learn from him. Reiner is CEO of MatX, a new chip startup (full disclosure - I’m an angel investor). He was previously at Google, where he worked on software efficiency, compilers, and TPU architecture. Wrote up some flashcards and practice problems to help myself retain what Reiner taught. Hope it's helpful to you too! https://reiner-flashcards.vercel.app/ Download markdown of transcript here to chat with an LLM: https://gist.github.com/dwarkeshsp/79100f0fdeed69d76241903bb0604dbe 0:00:00 – How batch size affects token cost and speed 0:31:59 – How MoE models are laid out across GPU racks 0:47:02 – How pipeline parallelism spreads model layers across racks 1:03:27 – Why Ilya said, “As we now know, pipelining is not wise.” 1:18:49 – Because of RL, models may be 100x over-trained beyond Chinchilla-optimal 1:32:52 – Deducing long context memory costs from API pricing 2:03:52 – Convergent evolution between neural nets and cryptography

Top Comments (10)

@BLAISEDAHL96 2026-04-29

Yep, this definitely needs to be the format moving forward with guests that care to instruct something. This is a great public service.

439 3 replies
@WillzMaster85 2026-04-29

this new format will be a complete gamechanger; a simple yet genius move dwarkesh, good work!

205
@fernandodutra3788 2026-04-29

So nice to see a mainstream podcast (1.3M subs) spend 2+h discussing technical state of the art. Not just “let’s explain what 1+1 is so the audience can follow”, but actually just going with the flow. Appreciated!

106
@DwarkeshPatel 2026-04-29

Wrote up some flashcards and practice problems to help myself retain what Reiner taught. Hope it's helpful to you too! https://reiner-flashcards.vercel.app

74 4 replies
@manushsanchela1109 2026-04-29

This is the best format I have seen on YT. Please keep going

63
@darkflow_null 2026-04-29

Absolutely love the new format

45
@FedericoLebrón 2026-04-29

Reiner and a blackboard? I've never clicked on anything faster in my life. What a treat!!!!

25
@BLAISEDAHL96 2026-04-29

This looks phenomenal! One note would be to get some more fill lighting so you guys aren’t completely shaded out.

19
@avnotes 2026-04-30

Petition to bring Sir Karpathy in this setup!

14 1 replies
@NimTheHuman 2026-04-30

This is making me realize that most recordings of lectures are seriously deprived of the production quality they deserve (e.g, high quality mics, thoughtful lightning, aesthetic room setups, intentional camera angle shifts like the one at 7:18). If more university-style lectures adopted this format and quality, humans would be a lot more knowledgable. :) Looking forward to more videos using this format.

5

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot