Navigate Select ESC Close

There's a new best OSS model and it's...weird

2025-03-13 Science & Technology
55.2k
1.5k
155
Theo - t3․gg
Theo - t3․gg
539.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

There's another open source reasoning model: Qwen's QwQ. It's benchmarks are nuts, but I don't actually think it's all that good... Thank you Sevalla for sponsoring! Check them out at: https://soydev.link/sevalla Also check out t3 chat? https://soydev.link/chat SOURCE https://qwenlm.github.io/blog/qwq-32b/ Want to sponsor a video? Learn more here: https://soydev.link/sponsor-me Check out my Twitch, Twitter, Discord more at https://t3.gg S/O Ph4se0n3 for the awesome edit 🙏

Top Comments (10)

@vizvamitraa 2025-03-13

WAIT -- "Wait's Average Index by Theo"

89
@gamergod9182 2025-03-13

so, it's essentially that Punisher meme where he goes "wait!wait!wait! WAITWAITWAITWAITWAIT!!"

25
@emeraldbonsai 2025-03-13

i swear i saw this model like 2 weeks ago

245 6 replies
@VeaceslavBARBARII 2025-03-14

I'm not hyped about LLMs, but your videos are very helpful. You're saving us time. Otherwise, we'd have to go through every other LLM ourselves and see what it offers. Thank you!

3
@johndavidson8096 2025-03-13

Video seems a bit late but I thoroughly enjoyed your breakdown regardless. I hope you continue making all these kinds of videos, you’re one of my go-to-sources for programming news lol

7
@emiliog07 2025-03-13

I played a drinking game, where every time Theo says interesting I take a drink. I ended up in the hospital.

25 1 replies
@MaxPicAxe 2025-03-13

19:50 I love how AI predicted a human taking over the chat and complaining that the code is wrong.

31
@Dev_Vaghela25 2025-03-14

We missed rotating hexagon with red ball in it benchmark

1
@Oglokoog 2025-03-14

The "hmm/wait" benchmark is amazing.

0
@justindressler5992 2025-03-15

Its tricky to setup I've found. The biggest tricks are to set repetition penalty to 1 because it is a reasoning model it can repeat things while thinking if the penalty is applied the reasoning fails. Second put the <think> tag with new line in the system prompt or instruct template. Tells the model to only start thinking after the context.

3 3 replies

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot