RAG is Dead. Again. (Claude Agent SDK + Memory)
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Related videos
Software engineering is dead now
Theo - t3․gg
165.4k views
Anthropic confirms software engineering is NOT dead
ThePrimeTime
341.6k views
Sonnet 4.5 Is Here—And It’s a Beast at Coding
Prompt Engineering
52.0k views
GPT-OSS Jailbreak with this Simple Trick
Prompt Engineering
54.4k views
What is AI Engineering
Telusko
67.6k views
Context Engineering is All You NEED!
Prompt Engineering
38.7k views
The Only Embedding Model You Need for RAG
Prompt Engineering
35.2k views
Context Engineering is the future of AI Agents - here’s why
David Ondrej
44.4k views
Gemini CLI — Google’s Free Open-Source Coding Agent
Prompt Engineering
56.6k views
Anthropic’s Blueprint for Building Lean, Powerful AI Agents
Prompt Engineering
45.0k views
Top Comments (9)
Agentic File Search is a very interesting idea for RAG, and from my own experience using it, I found the retrieval accuracy to be quite impressive. However, I think the biggest challenges are latency and token usage. In domains where the volume of documents is massive and documents are continuously updated or removed, operating this kind of pipeline in a stable and scalable way is not easy. I believe Graph RAG has similar limitations as well. In environments where documents exist at large scale and change continuously, the cost of maintaining and synchronizing the graph becomes significant, which makes it difficult to operate reliably at a production level. Personally, I’m curious whether you have any ideas for reducing latency in these kinds of systems.
GraphRAG already solves the issues with classic RAG.
Nice. Would be cool to choose the agent.
If it’s already read the files, why does it need semantic search?
Will Anthropic block this as well soon?
is it open source and free and uncensored, why is it not integrated in a jarvis like linux distro, download and install onto a usb key or external usb drive, boot from it, checks the hw and moves in with audio in and out by default, permanent memory rag and ability to work with webcams and do tasks in browser...from than on it learns on its own
🥱
IMHO, your chuncking part is too simple and could be way more efficient.
First "RAG is dead" video in 2026 👏
Unlock the Data Inside
Turn Videos into Knowledge
- Get FREE 10/day: transcripts, summaries, chats
- Chat with videos, export text & PDF
- $1 free API credit for RAG, chatbots & research
Free forever plan • All features unlocked
Top Comments (9)
Agentic File Search is a very interesting idea for RAG, and from my own experience using it, I found the retrieval accuracy to be quite impressive. However, I think the biggest challenges are latency and token usage. In domains where the volume of documents is massive and documents are continuously updated or removed, operating this kind of pipeline in a stable and scalable way is not easy. I believe Graph RAG has similar limitations as well. In environments where documents exist at large scale and change continuously, the cost of maintaining and synchronizing the graph becomes significant, which makes it difficult to operate reliably at a production level. Personally, I’m curious whether you have any ideas for reducing latency in these kinds of systems.
GraphRAG already solves the issues with classic RAG.
Nice. Would be cool to choose the agent.
If it’s already read the files, why does it need semantic search?
Will Anthropic block this as well soon?
is it open source and free and uncensored, why is it not integrated in a jarvis like linux distro, download and install onto a usb key or external usb drive, boot from it, checks the hw and moves in with audio in and out by default, permanent memory rag and ability to work with webcams and do tasks in browser...from than on it learns on its own
🥱
IMHO, your chuncking part is too simple and could be way more efficient.
First "RAG is dead" video in 2026 👏