Navigate Select ESC Close

DeepSeek Just Killed Visual Reasoning (And It's 10× Cheaper)

2026-05-02 Science & Technology
3.7k
172
17
Prompt Engineering
Prompt Engineering
241.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Deepseek new paper "Thinking with Visual Primitives". They dropped the paper in their github repo and then removed it. Here is the paper: https://github.com/ailuntx/Thinking-with-Visual-Primitives/blob/main/Thinking_with_Visual_Primitives.pdf My voice to text App: whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0 Let's Connect: 🦾 Discord: https://discord.com/invite/t4eYQRUcXB ☕ Buy me a Coffee: https://ko-fi.com/promptengineering |🔴 Patreon: https://www.patreon.com/PromptEngineering 💼Consulting: https://calendly.com/engineerprompt/consulting-call 📧 Business Contact: [email protected] Become Member: http://tinyurl.com/y5h28s6h 💻 Pre-configured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off). Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0

Top Comments (10)

@svuvich 2026-05-02

I was a bit bummed that v4 didn't come out with vision initially. Even without vision this model is a beast. I believe DS v4 Flash is the first actually economically feasible model for an average person to use without needing to subsidize them with venture capitalist money

20 1 replies
@synargproductions6862 2026-05-02

Deepseek v4 is insane in the QA Panda VSCode extension 🎉🎉🎉🎉🎉🎉🎉

9
@zickzack3858 2026-05-03

end what did happen to ENGRAM ?

3
@TLCMEDIA1 2026-05-03

How did you edit/Create your video? What did you use ?

1 2 replies
@sunshinesun121 2026-05-05

Wow!!! The implications of Visual Processing By DEEPSEEK V4 is significant. Autonomous driving will need less Powerful Chips and Less memory will have significant savings. 😮😮😮

1
@miker3298 2026-05-04

Brilliant analysis

1
@RickySupriyadi 2026-05-06

7:44 hm... isnt gemini vision agent able do that too? and i can imagine what other labs going to do to their researchers... "dont go home we will refractor all VLM" 😭

0 2 replies
@coffeewithmilk563 2026-05-07

Thanks for speaking clearly, I can tell you're trying

0
@davidtitanium22 2026-05-05

the vision model is at least the second half of the year, but i can't wait for an affordable vision workhorse

0
@ajaytaneja111 2026-05-10

Are we saying the bounding box coordinates are part of input embeddings and hence the Attention process as per DeepSeek paper?

0

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot