LLM’s Billion Dollar Problem
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Related videos
MY trillion $Dollar Project is finally OUT!
PewDiePie
793.7k views
Trump’s “Epstein” Problem RESURFACES in NEW Billion Dollar SLUSH FUND?!?!
Legal AF
34.4k views
MAGA Felon CAUGHT Behind Trump’s BILLION DOLLAR SCHEME?!
MeidasTouch
61.9k views
The RL Irony in LLMs
bycloud
23.0k views
Building a Billion Dollar Brand with Anastasia Soare and Oprah
Oprah
5.1m views
Behind Italy’s Billion Dollar Luxury Brands - PBD & Crew Travel to Tuscany
Valuetainment
125.8k views
Bitcoin Prognose: Warum der Kurs auf 1 Million Dollar steigt
Marc Friedrich
107.0k views
Inside Nashville's Broadway bar problem | NewsNation Reports
NewsNation
59.4k views
POV: Chinese AI Lab Teaching Everyone How To Save Millions of Dollars
bycloud
65.1k views
Big Beautiful Bill, Elon/Trump, Dollar Down Big, Harvard's Money Problems, Figma IPO
All-In Podcast
541.7k views
Top Comments (10)
Google was only offering that long context window for the first few weeks after Gemini 3 launch. They quietly rugpulled once the hype was past peak and the exodus from GPT was well under way. Now you only get 32K tokens even with Pro and it's just god awful. Reasoning performance tanked too after Google cut the precision of parameter datatypes to save more money. So, I don't think Google solved anything at all. Their secret sauce was being the biggest and most profitable player in the game, which allowed them to burn who knows how much cash to fool everyone into thinking they'd solved it, and switch over to their inferior product on 6-12 month plans or whatever, leading nVidia to abandon the 100B investment OpenAI was depending on to stay solvent. Honestly it looks like this "innovation" is innovative in the same way that Altman's ploy to buy up all the RAM wafer supplies to engineer the shortage, without actually using those wafers, was. Basically, a scam. So sick of this anti-consumer, anti-competitive, big business bullshit.
video idea: LLM's billion watt problem
I took a shot for every 'linear' in this video. Hello from spirit realm.
The best change I made to my AI assisted coding workflow was to limit the project size. This is an known software practice of working in "modules", rather than a monolithic code base. Modules communicate with one another as separate apps. This limits your context needs dramatically.
16:40 i think your conclusion about google having solved it may be wrong, don't forget they have their own custom hardware made specificaly for inference.
We're back to LSTM 😭😭
Context usage on frontier models is ridiculous these days. I like Gemini 3 pro but it’s genuinely incapable of basic tasks after maybe 10 minutes of conversation, it’s almost funny sometimes
Check out Inngest and let your AI agents wear a harness now https://innge.st/yt-bycl-1
"Google has solved it, guys!" Google: *Nervously tugs collar while issuing 100 year bonds*
Oh thats why flash 3 was trained as an olympic regabaiter. Its such a good model, that is sooo annoying
Unlock the Data Inside
Turn Videos into Knowledge
- Get FREE 10/day: transcripts, summaries, chats
- Chat with videos, export text & PDF
- $1 free API credit for RAG, chatbots & research
Free forever plan • All features unlocked
Top Comments (10)
Google was only offering that long context window for the first few weeks after Gemini 3 launch. They quietly rugpulled once the hype was past peak and the exodus from GPT was well under way. Now you only get 32K tokens even with Pro and it's just god awful. Reasoning performance tanked too after Google cut the precision of parameter datatypes to save more money. So, I don't think Google solved anything at all. Their secret sauce was being the biggest and most profitable player in the game, which allowed them to burn who knows how much cash to fool everyone into thinking they'd solved it, and switch over to their inferior product on 6-12 month plans or whatever, leading nVidia to abandon the 100B investment OpenAI was depending on to stay solvent. Honestly it looks like this "innovation" is innovative in the same way that Altman's ploy to buy up all the RAM wafer supplies to engineer the shortage, without actually using those wafers, was. Basically, a scam. So sick of this anti-consumer, anti-competitive, big business bullshit.
video idea: LLM's billion watt problem
I took a shot for every 'linear' in this video. Hello from spirit realm.
The best change I made to my AI assisted coding workflow was to limit the project size. This is an known software practice of working in "modules", rather than a monolithic code base. Modules communicate with one another as separate apps. This limits your context needs dramatically.
16:40 i think your conclusion about google having solved it may be wrong, don't forget they have their own custom hardware made specificaly for inference.
We're back to LSTM 😭😭
Context usage on frontier models is ridiculous these days. I like Gemini 3 pro but it’s genuinely incapable of basic tasks after maybe 10 minutes of conversation, it’s almost funny sometimes
Check out Inngest and let your AI agents wear a harness now https://innge.st/yt-bycl-1
"Google has solved it, guys!" Google: *Nervously tugs collar while issuing 100 year bonds*
Oh thats why flash 3 was trained as an olympic regabaiter. Its such a good model, that is sooo annoying