Navigate Select ESC Close

The biggest Mystery of LLMs have just been solved

2025-11-16 Science & Technology
102.6k
5.2k
376
bycloud
bycloud
225.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Ready to build a site that looks hand-coded—without needing a developer? Launch your site for free at https://framer.link/bycloud, and use code BYCLOUD for a free month on Framer Pro. #FramerPartner Why do LLMs sometimes give different answers even with the same prompt, temperature 0, and seed? In this video, we break down one of the most confusing “mysteries” of modern LLMs and walk through Thinking Machines's first ever blog: Defeating Nondeterminism in LLM Inference, the coolest (imo) LLM "debugging" ever. Blog of this video [Paper] https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/ Special thanks to @CalebWritesCode for helping out with this video! My Newsletter https://mail.bycloud.ai/ my project: find, discover & explain AI research semantically https://findmypapers.ai/ My Patreon https://www.patreon.com/c/bycloud Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI This video is supported by the kind Patrons & YouTube Members: 🙏Spam Maj, Alex, Chris LeDoux, DX Research Group, Poof N' Inu, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] [email protected] [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Booga04 [Ko-fi] https://ko-fi.com/bycloudai

Top Comments (10)

@me-low-key 2025-11-16

finally an LLM that can be wrong all time

1.1k 12 replies
@DeadtomGCthe2nd 2025-11-16

This is so nuts that something so basic wasn't figured out until now, while literally Trillions of dollars is being thrown at this stuff. Super crazy and cool.

478 22 replies
@perplexedon9834 2025-11-16

It'd actually be so good for people's public perception of LLMs to have the mainstream ones be deterministic where the same prompt + temperature + seed gives the same outcome every time. People see them as magic, sometimes stupd, consciousness boxes and that causes a lot of issues

148 25 replies
@Ahamshep 2025-11-16

I'm just a hobbyist, but decimal rounding errors in Python have been the bane of my existence. When I run simulations meant to mirror systems written in other languages, I often see periodic differences in the results. I think any programmer would have run into this, because any time you're dealing with decimal numbers there’s always the potential for strange rounding issues. Either way, it's very interesting.

106 7 replies
@YourChenlambec 2025-11-16

I'm so surprised how new this field really is because this explanation makes perfect sense. It seems so straightforward and simple and yet it was only just documented and effectively debugged now in 2025. That is amazing.

47 5 replies
@qerupasy 2025-11-18

I'm calling it now: A decade from now, some IT security wizard is going to come up with a side channel attack where they use the fact that the rounding behavior is influenced by other user's requests to read other people's prompts. Because they can, and because we cannot have nice things.

46 3 replies
@bycloudAI 2025-11-15

Ready to build a site that looks hand-coded—without needing a developer? Launch your site for free at https://framer.link/bycloud, and use code BYCLOUD for a free month on Framer Pro. #FramerPartner

29 2 replies
@fresh218 2025-11-17

It's always the floating point rounding that screws stuff over

7
@pokerandphilosophy8328 2025-11-16

0:25 For some reason, I would have expected the dog catching the flu to have the highest temperature.

7
@k1awdttt 2025-11-17

In short, chaos theory as a form of butterfly effect due to conppunding floating point errors in arithmetic optimization at runtime batch request. This can be tested by running locally so we have control on the number of requests.

6

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot