There's a new best OSS model and it's...weird

2025-03-13 Science & Technology

55.2k

1.5k

155

Watch on YouTube

Theo - t3․gg

552.0k subscribers

Description

There's another open source reasoning model: Qwen's QwQ. It's benchmarks are nuts, but I don't actually think it's all that good... Thank you Sevalla for sponsoring! Check them out at: https://soydev.link/sevalla Also check out t3 chat? https://soydev.link/chat SOURCE https://qwenlm.github.io/blog/qwq-32b/ Want to sponsor a video? Learn more here: https://soydev.link/sponsor-me Check out my Twitch, Twitter, Discord more at https://t3.gg S/O Ph4se0n3 for the awesome edit 🙏

#web development #full stack #typescript #javascript #react #programming #programmer #theo

Top Comments (10)

@emeraldbonsai 2025-03-13

i swear i saw this model like 2 weeks ago

245 6 replies

@vizvamitraa 2025-03-13

WAIT -- "Wait's Average Index by Theo"

@MaxPicAxe 2025-03-13

19:50 I love how AI predicted a human taking over the chat and complaining that the code is wrong.

@emiliog07 2025-03-13

I played a drinking game, where every time Theo says interesting I take a drink. I ended up in the hospital.

25 1 replies

@gamergod9182 2025-03-13

so, it's essentially that Punisher meme where he goes "wait!wait!wait! WAITWAITWAITWAITWAIT!!"

@johndavidson8096 2025-03-13

Video seems a bit late but I thoroughly enjoyed your breakdown regardless. I hope you continue making all these kinds of videos, you’re one of my go-to-sources for programming news lol

@justindressler5992 2025-03-15

Its tricky to setup I've found. The biggest tricks are to set repetition penalty to 1 because it is a reasoning model it can repeat things while thinking if the penalty is applied the reasoning fails. Second put the <think> tag with new line in the system prompt or instruct template. Tells the model to only start thinking after the context.

3 3 replies

@VeaceslavBARBARII 2025-03-14

I'm not hyped about LLMs, but your videos are very helpful. You're saving us time. Otherwise, we'd have to go through every other LLM ourselves and see what it offers. Thank you!

@Dev_Vaghela25 2025-03-14

We missed rotating hexagon with red ball in it benchmark

@Oglokoog 2025-03-14

The "hmm/wait" benchmark is amazing.

Description

Top Comments (10)

@emeraldbonsai 2025-03-13

i swear i saw this model like 2 weeks ago

245 6 replies

@vizvamitraa 2025-03-13

WAIT -- "Wait's Average Index by Theo"

@MaxPicAxe 2025-03-13

19:50 I love how AI predicted a human taking over the chat and complaining that the code is wrong.

@emiliog07 2025-03-13

I played a drinking game, where every time Theo says interesting I take a drink. I ended up in the hospital.

25 1 replies

@gamergod9182 2025-03-13

so, it's essentially that Punisher meme where he goes "wait!wait!wait! WAITWAITWAITWAITWAIT!!"

@johndavidson8096 2025-03-13

Video seems a bit late but I thoroughly enjoyed your breakdown regardless. I hope you continue making all these kinds of videos, you’re one of my go-to-sources for programming news lol

@justindressler5992 2025-03-15

3 3 replies

@VeaceslavBARBARII 2025-03-14

I'm not hyped about LLMs, but your videos are very helpful. You're saving us time. Otherwise, we'd have to go through every other LLM ourselves and see what it offers. Thank you!

@Dev_Vaghela25 2025-03-14

We missed rotating hexagon with red ball in it benchmark

@Oglokoog 2025-03-14

The "hmm/wait" benchmark is amazing.

Unlock the Data Inside
Turn Videos into Knowledge

Get FREE 10/day: transcripts, summaries, chats
Chat with videos, export text & PDF
$1 free API credit for RAG, chatbots & research

Try it free

Free forever plan • All features unlocked

There's a new best OSS model and it's...weird

Description

Top Comments (10)

Related videos

Kimi K3 is the best model ever made (sometimes)

Oh no (the new Grok model is good)

FABLE IS BACK! (And Sonnet 5 is here too)

We all fell for it…

This model is kind of a disaster.

It's finally here.

Opus 4.6 Is The Best Coding Model Ever Made*

The Best Model For Frontend Design Is...

Kimi K2.5 might be my new favorite model...

GPT-5.2 is the best model ever made*

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

Kimi K3 is the best model ever made (sometimes)

Oh no (the new Grok model is good)

FABLE IS BACK! (And Sonnet 5 is here too)

We all fell for it…

This model is kind of a disaster.

It's finally here.

Opus 4.6 Is The Best Coding Model Ever Made*

The Best Model For Frontend Design Is...

Kimi K2.5 might be my new favorite model...

GPT-5.2 is the best model ever made*

Description

Top Comments (10)

Unlock the Data Inside
Turn Videos into Knowledge

There's a new best OSS model and it's...weird

Description

Top Comments (10)

Related videos

Kimi K3 is the best model ever made (sometimes)

Oh no (the new Grok model is good)

FABLE IS BACK! (And Sonnet 5 is here too)

We all fell for it…

This model is kind of a disaster.

It's finally here.

Opus 4.6 Is The Best Coding Model Ever Made*

The Best Model For Frontend Design Is...

Kimi K2.5 might be my new favorite model...

GPT-5.2 is the best model ever made*

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

Kimi K3 is the best model ever made (sometimes)

Oh no (the new Grok model is good)

FABLE IS BACK! (And Sonnet 5 is here too)

We all fell for it…

This model is kind of a disaster.

It's finally here.

Opus 4.6 Is The Best Coding Model Ever Made*

The Best Model For Frontend Design Is...

Kimi K2.5 might be my new favorite model...

GPT-5.2 is the best model ever made*

Description

Top Comments (10)

Unlock the Data Inside Turn Videos into Knowledge

Unlock the Data Inside
Turn Videos into Knowledge