Navigate Select ESC Close

POV: Chinese AI Lab Teaching Everyone How To Save Millions of Dollars

2025-07-19 Science & Technology
65.1k
2.9k
150
bycloud
bycloud
225.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Check out Runpod's Hub and Serverless to make deploying AI models even easier! https://runpod.io?ref=h9oj1vbp ByteDance Seed Proposed PMA which is a model merging technique for pre-training models to project your annealed performance without the need to go through annealing. This can save up to millions in big model training runs. My Newsletter https://mail.bycloud.ai/ my project: find, discover & explain AI research semantically https://findmypapers.ai/ My Patreon https://www.patreon.com/c/bycloud Model Merging in Pre-training of Large Language Models [Paper] https://alphaxiv.org/abs/2505.12082 Other "model merging" techniques I mentioned (but are used in completely different scenarios) https://alphaxiv.org/abs/2410.03617 https://alphaxiv.org/abs/2410.15661 https://alphaxiv.org/abs/2403.07816 Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI This video is supported by the kind Patrons & YouTube Members: 🙏Nous Research, Chris LeDoux, Ben Shaener, DX Research Group, Poof N' Inu, Andrew Lescelius, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon, Constantinos Charilaou, Abay Bektursun [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] [email protected] [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] Abhay [Ko-fi] https://ko-fi.com/bycloudai

Top Comments (10)

@mateidumitrescu238 2025-07-19

God bless open source research

560 21 replies
@johnk7025 2025-07-19

Chinese know very well how to lower the price.

333 19 replies
@simeonnnnn 2025-07-19

NGL, I thought this was a Kimi k2 video.

105 1 replies
@Aurora12488 2025-07-19

I swear your accent went ultra Chinese for the intro 😂

54 1 replies
@reiskoryphae 2025-07-20

I really appreciate the use of scientific papers and that you show these sources in the description.

20 1 replies
@bycloudAI 2025-07-19

Check out Runpod's Hub and Serverless to make deploying AI models even easier! https://runpod.io?ref=h9oj1vbp

15
@RodrigoBenitez-gr4bb 2025-07-19

THE SPONSOR IS EXACTLY WHAT I NEEDED THANKS!

11
@nicholascanada3123 2025-07-19

I’ve been big-time bullish on merging models for a long time now

7
@grxceli 2025-07-20

it's honeslty ridiculous how well these open-source models perform - I've been benchmarking model performance on design, and they occupy 50% of the top 10 spots

3
@w-wolf-f 2025-07-20

The last point you referenced regarding distributed training reminds me of "SPARTA - Distributed Training with Sparse Parameter Averaging"

3

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot