Navigate Select ESC Close

RAG is Dead. Again. (Claude Agent SDK + Memory)

2026-05-14 Science & Technology
2.1k
69
8
Prompt Engineering
Prompt Engineering
241.0k subscribers

Unlock all features

FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.

Description

Checkout Milvus (44k stars on GitHub): https://github.com/milvus-io/milvus Claude can do a lot more than coding. In this video, I show you how to build a dual memory system using the Claude Agent SDK that combines vector search with file system tools to give your agents persistent, multi-layered retrieval over complex documents. LINKS: Project Github Repo: https://github.com/PromtEngineer/ParseRAG Signup Zilliz Cloud (managed Milvus) to claim $100 credits with business emails: https://tinyurl.com/zilliz-prompt-engineer My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0 Let's Connect: 🦾 Discord: https://discord.com/invite/t4eYQRUcXB ☕ Buy me a Coffee: https://ko-fi.com/promptengineering |🔴 Patreon: https://www.patreon.com/PromptEngineering 💼Consulting: https://calendly.com/engineerprompt/consulting-call 📧 Business Contact: [email protected] Become Member: http://tinyurl.com/y5h28s6h 💻 Pre-configured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off). Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0

Top Comments (9)

@topgunlee7198 2026-05-14

Agentic File Search is a very interesting idea for RAG, and from my own experience using it, I found the retrieval accuracy to be quite impressive. However, I think the biggest challenges are latency and token usage. In domains where the volume of documents is massive and documents are continuously updated or removed, operating this kind of pipeline in a stable and scalable way is not easy. I believe Graph RAG has similar limitations as well. In environments where documents exist at large scale and change continuously, the cost of maintaining and synchronizing the graph becomes significant, which makes it difficult to operate reliably at a production level. Personally, I’m curious whether you have any ideas for reducing latency in these kinds of systems.

3 1 replies
@bpachat 2026-05-14

GraphRAG already solves the issues with classic RAG.

1
@kabronell 2026-05-14

Nice. Would be cool to choose the agent.

0 2 replies
@justdavebz 2026-05-15

If it’s already read the files, why does it need semantic search?

0 1 replies
@simonkaralyus 2026-05-15

Will Anthropic block this as well soon?

0 1 replies
@radeksparowski7174 2026-05-14

is it open source and free and uncensored, why is it not integrated in a jarvis like linux distro, download and install onto a usb key or external usb drive, boot from it, checks the hw and moves in with audio in and out by default, permanent memory rag and ability to work with webcams and do tasks in browser...from than on it learns on its own

0
@federalwardogs 2026-05-14

🥱

0
@PhunkyBob 2026-05-17

IMHO, your chuncking part is too simple and could be way more efficient.

0
@alexisdamnit9012 2026-05-14

First "RAG is dead" video in 2026 👏

0

Unlock the Data Inside
Turn Videos into Knowledge

  • Get FREE 10/day: transcripts, summaries, chats
  • Chat with videos, export text & PDF
  • $1 free API credit for RAG, chatbots & research

Free forever plan • All features unlocked

App screenshot