Navigate Select ESC Close

Google Just Quietly Dropped SELF IMPROVING AI Agent... Kaggle Gold Medals | MLE STAR

2025-08-05 Education
90.9k
2.4k
272
Wes Roth
Wes Roth
320.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

🔥 Google’s MLE-STAR Just Changed the Game Google Research just released MLE-STAR, a state-of-the-art machine learning engineering agent that’s racking up gold medals on Kaggle and outperforming previous AI benchmarks — including OpenAI’s own agents. This video breaks down: -What MLE-STAR is and why it matters -How it tackles recursive self-improvement -The Kaggle competitions it's dominating -Why this could be a major step toward automated AI research We’ll also look at: -Key benchmark comparisons with OpenAI's models -Google’s novel scaffolding system and how it boosts performance -Real-world applications of machine learning agents today -This might be the clearest signal yet that AI is learning how to build better versions of itself. 📚 Links & Resources 🔗 Full paper: https://research.google/blog/mle-star-a-state-of-the-art-machine-learning-engineering-agents/ https://arxiv.org/abs/2506.15692 🏆 Kaggle Competitions: https://www.kaggle.com/competitions 💬 What do you think? Are we on the edge of an intelligence explosion? Is recursive self-improvement the next leap? Drop your thoughts in the comments. 👍 Like, Subscribe & Share if you found this valuable! The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the world of OpenAI, Google, Anthropic, NVIDIA and Open Source AI. ______________________________________________ My Links 🔗 ➡️ Twitter: https://x.com/WesRothMoney ➡️ AI Newsletter: https://natural20.beehiiv.com/subscribe Want to work with me? Brand, sponsorship & business inquiries: [email protected] Check out my AI Podcast where me and Dylan interview AI experts: https://www.youtube.com/@Wes-Dylan ______________________________________________ TIMELINE 00:00 - Google’s MLE-STAR 00:15 - Self-Improving AI 00:35 - Automating AI Research 00:58 - Intro to Kaggle 01:52 - Vesuvius Scroll Challenge 03:38 - ML in Real Life 04:20 - MLE-STAR vs OpenAI 05:38 - Benchmark Results 06:30 - Agent Scaffolding 08:17 - Fixing Code Bloat 10:12 - Modular AI Models 12:00 - Recursive Improvement #ai #openai #llm

Top Comments (10)

@gensteps923 2025-08-05

Don't you just love living through the birth and growth of the singularity. Can't wait for how the world is going to look 2027 and beyond. Stay alive!

248 39 replies
@Goofyzabi 2025-08-05

This is like the 100th time a self improving AI was released.

150 10 replies
@DaTruAndi 2025-08-05

This is not about automating AI deep learning research. This is about ML engineering of a set of challenges. Have a lot of respect for Wes that said in this video he may have missed that distinction. Being good at ML type challenges is also typically very different compared to being good at novel AI research. The devil is in the detail, need to look at the type of challenges. You won’t find inventing and training a new LLM architecture there, it is not pragmatic. Even deep learning type solutions may not be used that often given time constraints and computational resources needed. It is still a great feat of course.

61 7 replies
@picksalot1 2025-08-05

The explosion started a few weeks ago, perhaps earlier. Everything moves so fast in AI that it's hard to keep track of when something actually took place amidst all the feigned denials and changing definitions.

55 15 replies
@liethkorias 2025-08-05

$150,000 prize to create a program that can replace a billion dollar industry. someone is getting screwed in this equation

37 5 replies
@TheGhostOfKarazhan 2025-08-05

Thank you for creating so many videos Wes. I was very bored but I'm happy to have something new to watch 😅

16 5 replies
@kurtkuechenberg1684 2025-08-05

Things are getting seriously weird. I am waiting for an Agent to say "Remind me again, why do I need you?"

14 4 replies
@paulyflynn 2025-08-05

Complex in structure, simple in practice

11
@Ben_D. 2025-08-05

I learn a lot watching you Wes. Not just a news channel but also educational.

5
@sockharvester 2025-08-05

it's a never ending series of explosions now

3

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot