Gemini 3.5 FLASH: BAD to OUTSTANDING
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Related videos
this is going to get bad
Timcast
120.4k views
Discovery of an Invisible Flat Structure Shaping Our Galaxy
Anton Petrov
73.8k views
Gemini Flash 3 is my new favorite model (yes really)
Theo - t3․gg
66.7k views
Deepseek V3.2 Beats GPT-5 and Gemini 3 Pro - Chinese AI Destroying US Tech
Eli the Computer Guy
24.6k views
Discovery Hearings on Trooper Proctor Texts. What's getting turned over.
Emily D. Baker
134.9k views
Trump has DISASTER LANDING in DC over VERY BAD NEWS
MeidasTouch
308.1k views
r/AITA for Giving $2,500,000 to a 12-year-old?
rSlash
135.8k views
Read Prosecutor Adam Lally Testifies at Discovery Hearing
Emily D. Baker
215.4k views
The Truth about AI is Devastating: Proof by MIT, Harvard
Discover AI
65.2k views
r/AITA For Divorcing over Mr. Beast?
rSlash
181.2k views
Top Comments (10)
Thanks! This content is very useful.
curious if the trace shows an actual causal model or just iterative patching until something sticks
Impressive! Why keep the web search enabled though? It doesn't really matter for your custom prompt and it didn't use the web according to the trace, but still.
Regarding the discussion about grounding: would it not be clear within the reasoning trace if it were referencing Google search or particular tool calls that would have access to YouTube transcripts? I think it’s very exciting that this model happened upon the answer so quickly.
It would be interesting to know in 3 independent attempts how it would do.
You can also use it in flex mode and that cuts the price in 1/2
Grounding with Google Search should be disabled. It could be using YouTube previous videos.
I do love the way when RL became as clear as the semantic nature of the answers. Whatever the LLM results to your prompt. Have you ever try any gradual approach ? Easy ways, i think you will manage 😊 NB: about CoT... you know it has nothing to do with the real "thinking" . You reveal the trick, now explain the magic ?
Thank you for jumping on this model so quickly.I have to agree with some of the other commenters -- grounding should have been turned off, especially given the jump to 8 steps -- perhaps it's no coincidence that the flash model got exactly the best answer that happened to be only reached by its sibling model. It would be very interesting to see this test run again with grounding turned off. Indeed I asked the flash model and it said it should be turned off "otherwise it allows the model to look up the exact test question, find existing answer keys or leaked solutions online and simply copy the results."
it's basically "not giving up" strategy, interesting...
Unlock the Data Inside
Turn Videos into Knowledge
- Get FREE 10/day: transcripts, summaries, chats
- Chat with videos, export text & PDF
- $1 free API credit for RAG, chatbots & research
Free forever plan • All features unlocked
Top Comments (10)
Thanks! This content is very useful.
curious if the trace shows an actual causal model or just iterative patching until something sticks
Impressive! Why keep the web search enabled though? It doesn't really matter for your custom prompt and it didn't use the web according to the trace, but still.
Regarding the discussion about grounding: would it not be clear within the reasoning trace if it were referencing Google search or particular tool calls that would have access to YouTube transcripts? I think it’s very exciting that this model happened upon the answer so quickly.
It would be interesting to know in 3 independent attempts how it would do.
You can also use it in flex mode and that cuts the price in 1/2
Grounding with Google Search should be disabled. It could be using YouTube previous videos.
I do love the way when RL became as clear as the semantic nature of the answers. Whatever the LLM results to your prompt. Have you ever try any gradual approach ? Easy ways, i think you will manage 😊 NB: about CoT... you know it has nothing to do with the real "thinking" . You reveal the trick, now explain the magic ?
Thank you for jumping on this model so quickly.I have to agree with some of the other commenters -- grounding should have been turned off, especially given the jump to 8 steps -- perhaps it's no coincidence that the flash model got exactly the best answer that happened to be only reached by its sibling model. It would be very interesting to see this test run again with grounding turned off. Indeed I asked the flash model and it said it should be turned off "otherwise it allows the model to look up the exact test question, find existing answer keys or leaked solutions online and simply copy the results."
it's basically "not giving up" strategy, interesting...