Navigate Select ESC Close

Kimi K2 Technical Breakdown: How It Challenged AI’s 7-Year Status Quo

2025-08-26 Science & Technology
36.6k
2.2k
82
bycloud
bycloud
225.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Master AI agents now using HubSpot's FREE resource! https://clickhubspot.com/aa716f Might be very late to the "Muon" party, but hope this at least have some great insights for you. In this video, we will take a look at Muon, which is a new optimizer that Moonshot AI successfully scaled up. My Newsletter https://mail.bycloud.ai/ my project: find, discover & explain AI research semantically https://findmypapers.ai/ My Patreon https://www.patreon.com/c/bycloud Kimi K2 [Blog] https://moonshotai.github.io/Kimi-K2/ [GitHub] https://github.com/MoonshotAI/Kimi-K2 Muon Is Scalable For Pre-training [Paper] https://arxiv.org/abs/2502.16982 Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI This video is supported by the kind Patrons & YouTube Members: 🙏Nous Research, Chris LeDoux, Ben Shaener, DX Research Group, Poof N' Inu, Andrew Lescelius, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] [email protected] [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Booga04 [Ko-fi] https://ko-fi.com/bycloudai

Top Comments (10)

@adrieltorresola9580 2025-08-26

You got some comments in your bots 💀. Anyways, I’ve been using Kimi for a bit through t3 and it’s been relatively decent! Especially for such a low cost model (Super great explanations with the Minecraft reference. What a great way to explain local optimization!)

147 2 replies
@RedOneM 2025-08-26

I checked your channel earlier today and was like: Huh, guess you won’t be able to upload for a while. Glad I‘m wrong.

64 3 replies
@prasantkarn9282 2025-08-26

Another great video! btw hope mandatory military training is treating you well

39
@knicklichtjedi 2025-08-26

Why tf did Youtube auto-translate the title — and also simply wrong? All the places call it Muon, while Youtube "translated" it to Myon for my language. Why Youtube... release me from all this translation suffering without the option to turn it off!

36 7 replies
@danielhenderson7050 2025-08-26

When they say ‘one expert per GPU’, my understanding after looking into it is that it's about the training, not inference. In serving, multiple experts live on each GPU, so you don’t actually need 384 GPUs to run a single instance. After watching again I realized you also said that "...to train", but I thought it was an interesting thing to highlight anyway. I was not aware of the huge difference in requirements between training and inference before. Hope you are enjoying your military duty :D

33 1 replies
@no-observe 2025-08-27

Kimi k2 passed my vibe check. Many good MoEs give it unprecedented depth I haven't seen before.

10
@bycloudAI 2025-08-26

Master AI agents now using HubSpot's FREE resource! https://clickhubspot.com/aa716f

7
@KissatenYoba 2025-08-27

6:35 law and chaos

3
@juhotuho10 2025-08-28

Oh wow, way more excited about the new muon optimizer than anything LLM related

2
@first-thoughtgiver-of-will2456 2025-08-31

Orthogonalization is soooo hot in 2025

2

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot