AI isn't gonna keep improving
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Related videos
This is bad...
Theo - t3․gg
104.3k views
AI Is Hacking Everything Now...
Theo - t3․gg
82.0k views
I’m done.
Theo - t3․gg
185.8k views
Github is Falling Apart
Theo - t3․gg
96.5k views
OpenAI is lying
Theo - t3․gg
81.9k views
Open source is dying
Theo - t3․gg
97.8k views
Are you f**king kidding?
Theo - t3․gg
201.9k views
Anthropic is lying to us.
Theo - t3․gg
122.7k views
I was wrong
Theo - t3․gg
69.3k views
AI mistakes you're probably making
Theo - t3․gg
135.1k views
Top Comments (10)
Gordon Moore was not a “Dev”. He was the co-founder of Fairchild Semiconductor and Intel (and former CEO of the Intel).
Calling Gordon Moore a "dev and hardware enthusiast" is hilarious. Dude literally founded Intel
The problem as always is when you have a 99% reliable system and you want a 99.9% reliable model. The .9% difference is 10x more than anything else
"Moore - a dev and hardware enthusiast" -- Theo That is... technically correct
As we reach higher benchmark scores, you have to flip your understanding of them. If accuracy goes from 80% to 90%, that feels like a 10% improvement, but in reality the error rate has gone down by half, which is basically a 100% improvement.
Calling Gordon Moore a dev/hardware enthusiast would’ve been funny, if it was intended as a joke
small factual correction: "One of the crazy things apple invented, was the idea of having different cores with different roles"... No they didn't, it was actually ARM. Hetereogeneous Computing strategies and Big/Little architectures were not invented by apple :)
It feels less like LLMs have plateaued and more like the benchmarks are all being gamed and optimized for. Claude 3.5 sonnet, for example, is a cut above all other models.
I no longer believe what I said in this video. Updated take here: https://youtu.be/Kzf-tL8zyfo?si=IU-0BajX1F6FbBHz
This happened with re-enforcement learning too. The models had so many nodes that back propagation had virtually no effect, meaning they became too big to train any more and even got worse with more training.
Unlock the Data Inside
Turn Videos into Knowledge
- Get FREE 10/day: transcripts, summaries, chats
- Chat with videos, export text & PDF
- $1 free API credit for RAG, chatbots & research
Free forever plan • All features unlocked
Top Comments (10)
Gordon Moore was not a “Dev”. He was the co-founder of Fairchild Semiconductor and Intel (and former CEO of the Intel).
Calling Gordon Moore a "dev and hardware enthusiast" is hilarious. Dude literally founded Intel
The problem as always is when you have a 99% reliable system and you want a 99.9% reliable model. The .9% difference is 10x more than anything else
"Moore - a dev and hardware enthusiast" -- Theo That is... technically correct
As we reach higher benchmark scores, you have to flip your understanding of them. If accuracy goes from 80% to 90%, that feels like a 10% improvement, but in reality the error rate has gone down by half, which is basically a 100% improvement.
Calling Gordon Moore a dev/hardware enthusiast would’ve been funny, if it was intended as a joke
small factual correction: "One of the crazy things apple invented, was the idea of having different cores with different roles"... No they didn't, it was actually ARM. Hetereogeneous Computing strategies and Big/Little architectures were not invented by apple :)
It feels less like LLMs have plateaued and more like the benchmarks are all being gamed and optimized for. Claude 3.5 sonnet, for example, is a cut above all other models.
I no longer believe what I said in this video. Updated take here: https://youtu.be/Kzf-tL8zyfo?si=IU-0BajX1F6FbBHz
This happened with re-enforcement learning too. The models had so many nodes that back propagation had virtually no effect, meaning they became too big to train any more and even got worse with more training.