Navigate Select ESC Close

10x Faster Than Standard LLM!? DiffusionLM Explained

2025-07-28 Science & Technology
63.7k
2.8k
202
bycloud
bycloud
225.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Try out Warp 2.0 now, the current rank #1 AI on Terminal Bench, outperforming Claude Code: https://go.warp.dev/bycloud You can also use code "BYCLOUD" to get Warp Pro for 1 month free. (limited for 1,000 redemptions) My Newsletter https://mail.bycloud.ai/ my project: find, discover & explain AI research semantically https://findmypapers.ai/ My Patreon https://www.patreon.com/c/bycloud Video Sauces: Inception Labs [Website] https://inceptionlabs.ai/ Gemini Diffusion [Blog] https://deepmind.google/models/gemini-diffusion Large Language Diffusion Models [Paper] https://arxiv.org/abs/2502.09992 MMaDA [Paper] https://arxiv.org/abs/2505.15809 LaViDa [Paper] https://arxiv.org/abs/2505.16839 Diffusion Beats Autoregressive in Data-Constrained Settings [Paper] https://www.arxiv.org/abs/2507.15857 Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI This video is supported by the kind Patrons & YouTube Members: 🙏Nous Research, Chris LeDoux, Ben Shaener, DX Research Group, Poof N' Inu, Andrew Lescelius, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] [email protected] [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Booga04 [Ko-fi] https://ko-fi.com/bycloudai

Top Comments (10)

@megavoltampere1000 2025-07-28

2017: Attention is all you need 2025: diffusion is all you need

380 7 replies
@zgaly 2025-07-28

BYCLOUD SAMAAA I NEED A VIDEO RIGHT NOW ON HOW DIFFUSION LM WORKS

140 5 replies
@scifi2sci 2025-07-28

Please make the technical Deep dive into diffusion LMs

84 4 replies
@yanntal954 2025-07-28

Oh, finally an update on this wonderful idea!

74
@michaellin4553 2025-07-28

AI is going in reverse, we're switching image gen to autoregression and text to diffusion

64 6 replies
@ln2deep 2025-07-28

Note that diffusion might make it easier to solve certain types of problems. There are many problems where it is better to create a global solution that is refined over time.

35
@bycloudAI 2025-07-28

Try out Warp 2.0 now, the current rank #1 AI on Terminal Bench, outperforming Claude Code: https://go.warp.dev/bycloud You can also use code "BYCLOUD" to get Warp Pro for 1 month free. (limited for 1,000 redemptions) correction 0:22 it's not really an architecture, it's an objective, my bad.

26 2 replies
@DisturbedNeo 2025-07-28

There are some wild techniques being that have been shown off recently. We got Diffusion, we got HRMs, we got Mamba, we got Self-Adapting Continuous Learning. AI researchers be cooking.

17
@simonson6498 2025-07-28

RIP next token prediction... you've been too great so far.

15
@Slash27015 2025-07-28

2000's teachers: better pay attention, you won't always have a calculator with you 2025's teachers: a locally ran language model in your pocket (plus it does math)

14 1 replies

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot