Navigate Select ESC Close

Meta's new Llama-4 is Full of Controversies...

2025-04-10 Science & Technology
39.4k
1.6k
177
bycloud
bycloud
225.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Visualize your AI content creation workflow now with poppy AI https://start.getpoppy.ai/bycloud and use the coupon code “BYCLOUD” for $25 off any plans! Meta just dropped the Llama-4 series, and it looks really good... on papers. People have been finding weird problems left and right. So in this video, I'll give you an overview on Llama-4, and point out all the consistencies people found and why the community is slightly disappointed about this release. My Newletter https://mail.bycloud.ai/ My Patreon https://www.patreon.com/c/bycloud Llama-4 [Blog] https://ai.meta.com/blog/llama-4-multimodal-intelligence/ [Tweet] https://x.com/AIatMeta/status/1908598456144531660 [Download] https://huggingface.co/collections/meta-llama/llama-4-67f0c30d9fe03840bc9d0164 other sources [Artificial Analysis] https://artificialanalysis.ai/ [lmArena] https://lmarena.ai/ [Chinese Rumor Thread] https://www.1point3acres.com/bbs/thread-1122600-1-1.html [Fiction Live Bench] https://fiction.live/stories/Fiction-liveBench-April-6-2025/oQdzQvKHw8JyXbN87 [ARC-AGI] https://arcprize.org/leaderboard [Math Perturb] https://math-perturb.github.io/ [VISTA] https://scale.com/leaderboard/visual_language_understanding This video is supported by the kind Patrons & YouTube Members: 🙏Andrew Lescelius, Ben Shaener, Kainan, Chris LeDoux, Miguilim, Deagan, FiFaŁ, Robert Zawiasa, Marcelo Ferreira, Owen Ingraham, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Penumbraa, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO, Richárd Nagyfi, Hector, Drexon, Claxvii 177th, Inferencer, Michael Brenner, Akkusativ, Oleg Wock, FantomBloth, Thipok Tham, Clayton Ford, Theo, Handenon, Diego Silva, mayssam, Kadhai Pesalam, Tim Schulz, jiye, Anushka, Henrik Sundt, Julian Aßmann, Thomas Lin, Sid_Cypher, Mark Buckler, Kevin Tai, NO U, Gonzalo Fidalgo, Igor Alvarez, Alon Pluda, Clément Veyssière, Sander Zwaenepoel, etrotta, Binnie Yiu, Matej Macak, c zhou, Berhane-Meskel, sai sandeep mandava, Leo, Asad Dhamani, Charlie C, tantan assawade, Ângelo Fonseca, Stefan Lorenz, Paperboy, mika, Leo, Utsav Soi, Calvin Yan, Thomas Di martino [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] [email protected] [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Booga04 and me [Bitcoin (BTC)] 3JFMJQVGXNA2HJE5V9qCwLiqy6wHY9Vhdx [Ethereum (ETH)] 0x3d784F55E0bE5f35c1566B2E014598C0f354f190 [Litecoin (LTC)] MGHnqALjyU2W6NuJSSW9fTWV4dcHfwHZd7 [Bitcoin Cash (BCH)] 1LkyGfzHxnSfqMF8tN7ZGDwUTyBB6vcii9 [Solana (SOL)] 6XyMCEdVhtxJQRjMKgUJaySL8cGoBPzzA2NPDMPfVkKN [Ko-fi] https://ko-fi.com/bycloudai

Top Comments (10)

@nathanb011 2025-04-10

"Since it's open source everyone can fix it" > illegal to use in the EU

131 13 replies
@OperationDarkside 2025-04-10

It feels like there's a secret sauce to make models better at coding. I have a small piece of JavaScript (originally created by an older LLM and faulty), that I use to measure LLM bug fix performance. There have been a lot of open weight models since around beginning last year, that manage to create the fixed code just fine, but they're usually 8B or above and not fully open source. I even tested some 12B-14B reasoning models and they mostly failed spectacularly after thousands of tokens. Either these models have never seen a piece of JavaScript, are overtrained on Python or there is a secret training set, the open source community doesn't have.

26 7 replies
@samehedi 2025-04-10

thank you for briefly explaining the difference between dense and moe as well as active parameters. from all these ai channels, i think you might be the best one. love your content and explainations!

23
@GigaSimp 2025-04-10

Llama-4 is already asking for vacation days?

22
@okolenmi 2025-04-10

1.5B model outperforming 109B is funny

15
@bycloudAI 2025-04-10

Visualize your AI content creation workflow now with poppy AI https://start.getpoppy.ai/bycloud and use the coupon code “BYCLOUD” for $25 off any plans!

14 1 replies
@coleabbott3432 2025-04-10

I agree it is odd. And where are there smaller models? What's going on?

5
@forgiq 2025-04-10

Maybe we reached the peak usecases of transformer? Maybe newer improved algorithms like MAMBA??

5
@pyotrgrowpotkin 2025-04-10

I've found all of the Llama models/generations to be generally really underwhelming tbf, so not surprised by this.

3
@URB4NR3CON 2025-04-10

You've been the best in AI news and overviews for years now and with great humor. glad I came across you

3

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot