Home
Channel
Prompt Engineering
RAG is Dead. Again. (Claude Agent SDK + Memory)

RAG is Dead. Again. (Claude Agent SDK + Memory)

2026-05-14 Science & Technology

2.1k

241.0k subscribers

Description

Checkout Milvus (44k stars on GitHub): https://github.com/milvus-io/milvus Claude can do a lot more than coding. In this video, I show you how to build a dual memory system using the Claude Agent SDK that combines vector search with file system tools to give your agents persistent, multi-layered retrieval over complex documents. LINKS: Project Github Repo: https://github.com/PromtEngineer/ParseRAG Signup Zilliz Cloud (managed Milvus) to claim $100 credits with business emails: https://tinyurl.com/zilliz-prompt-engineer My Dictation App: www.whryte.com Website: https://engineerprompt.ai/ RAG Beyond Basics Course: https://prompt-s-site.thinkific.com/courses/rag Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0 Let's Connect: 🦾 Discord: https://discord.com/invite/t4eYQRUcXB ☕ Buy me a Coffee: https://ko-fi.com/promptengineering |🔴 Patreon: https://www.patreon.com/PromptEngineering 💼Consulting: https://calendly.com/engineerprompt/consulting-call 📧 Business Contact: [email protected] Become Member: http://tinyurl.com/y5h28s6h 💻 Pre-configured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off). Signup for Newsletter, localgpt: https://tally.so/r/3y9bb0

#prompt engineering #Prompt Engineer #LLMs #AI #artificial Intelligence #Llama #GPT-4 #fine-tuning LLMs

Top Comments (9)

@topgunlee7198 2026-05-14

Agentic File Search is a very interesting idea for RAG, and from my own experience using it, I found the retrieval accuracy to be quite impressive. However, I think the biggest challenges are latency and token usage. In domains where the volume of documents is massive and documents are continuously updated or removed, operating this kind of pipeline in a stable and scalable way is not easy. I believe Graph RAG has similar limitations as well. In environments where documents exist at large scale and change continuously, the cost of maintaining and synchronizing the graph becomes significant, which makes it difficult to operate reliably at a production level. Personally, I’m curious whether you have any ideas for reducing latency in these kinds of systems.

3 1 replies

@bpachat 2026-05-14

GraphRAG already solves the issues with classic RAG.

@kabronell 2026-05-14

Nice. Would be cool to choose the agent.

0 2 replies

@justdavebz 2026-05-15

If it’s already read the files, why does it need semantic search?

0 1 replies

@simonkaralyus 2026-05-15

Will Anthropic block this as well soon?

0 1 replies

@radeksparowski7174 2026-05-14

is it open source and free and uncensored, why is it not integrated in a jarvis like linux distro, download and install onto a usb key or external usb drive, boot from it, checks the hw and moves in with audio in and out by default, permanent memory rag and ability to work with webcams and do tasks in browser...from than on it learns on its own

@federalwardogs 2026-05-14

🥱

@PhunkyBob 2026-05-17

IMHO, your chuncking part is too simple and could be way more efficient.

@alexisdamnit9012 2026-05-14

First "RAG is dead" video in 2026 👏

Description

Top Comments (9)

@topgunlee7198 2026-05-14

3 1 replies

@bpachat 2026-05-14

GraphRAG already solves the issues with classic RAG.

@kabronell 2026-05-14

Nice. Would be cool to choose the agent.

0 2 replies

@justdavebz 2026-05-15

If it’s already read the files, why does it need semantic search?

0 1 replies

@simonkaralyus 2026-05-15

Will Anthropic block this as well soon?

0 1 replies

@radeksparowski7174 2026-05-14

@federalwardogs 2026-05-14

🥱

@PhunkyBob 2026-05-17

IMHO, your chuncking part is too simple and could be way more efficient.

@alexisdamnit9012 2026-05-14

First "RAG is dead" video in 2026 👏

Unlock the Data Inside
Turn Videos into Knowledge

Get FREE 10/day: transcripts, summaries, chats
Chat with videos, export text & PDF
$1 free API credit for RAG, chatbots & research

Try it free

Free forever plan • All features unlocked

RAG is Dead. Again. (Claude Agent SDK + Memory)

Description

Top Comments (9)

Related videos

Software engineering is dead now

Anthropic confirms software engineering is NOT dead

Sonnet 4.5 Is Here—And It’s a Beast at Coding

GPT-OSS Jailbreak with this Simple Trick

What is AI Engineering

Context Engineering is All You NEED!

The Only Embedding Model You Need for RAG

Context Engineering is the future of AI Agents - here’s why

Gemini CLI — Google’s Free Open-Source Coding Agent

Anthropic’s Blueprint for Building Lean, Powerful AI Agents

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

Software engineering is dead now

Anthropic confirms software engineering is NOT dead

Sonnet 4.5 Is Here—And It’s a Beast at Coding

GPT-OSS Jailbreak with this Simple Trick

What is AI Engineering

Context Engineering is All You NEED!

The Only Embedding Model You Need for RAG

Context Engineering is the future of AI Agents - here’s why

Gemini CLI — Google’s Free Open-Source Coding Agent

Anthropic’s Blueprint for Building Lean, Powerful AI Agents

Description

Top Comments (9)

Unlock the Data Inside
Turn Videos into Knowledge

RAG is Dead. Again. (Claude Agent SDK + Memory)

Description

Top Comments (9)

Related videos

Software engineering is dead now

Anthropic confirms software engineering is NOT dead

Sonnet 4.5 Is Here—And It’s a Beast at Coding

GPT-OSS Jailbreak with this Simple Trick

What is AI Engineering

Context Engineering is All You NEED!

The Only Embedding Model You Need for RAG

Context Engineering is the future of AI Agents - here’s why

Gemini CLI — Google’s Free Open-Source Coding Agent

Anthropic’s Blueprint for Building Lean, Powerful AI Agents

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Unlock all features

Related videos

Software engineering is dead now

Anthropic confirms software engineering is NOT dead

Sonnet 4.5 Is Here—And It’s a Beast at Coding

GPT-OSS Jailbreak with this Simple Trick

What is AI Engineering

Context Engineering is All You NEED!

The Only Embedding Model You Need for RAG

Context Engineering is the future of AI Agents - here’s why

Gemini CLI — Google’s Free Open-Source Coding Agent

Anthropic’s Blueprint for Building Lean, Powerful AI Agents

Description

Top Comments (9)

Unlock the Data Inside Turn Videos into Knowledge

Unlock the Data Inside
Turn Videos into Knowledge