Phase Transitions in Agent Memory: Recurrent Memory
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Unlock all features
FREE: Get instant access to 10 AI summaries, chats, or transcripts per day.
Related videos
Trump PANICS over HIS DEPOSITION in EMERGENCY MOTION!!!
MeidasTouch
19.5k views
Discoveries That Rewrite the Mystery | The Curse of Oak Island
HISTORY
38.9k views
Wagner Moura - Preserving Historical Memory in “The Secret Agent” | The Daily Show
The Daily Show
45.4k views
Trump's Agent FACE RECKONING in COURT over VIOLATIONS
Legal AF
43.7k views
The Media Is In A Mass PANIC Over The White House Renovations
Matt Walsh
71.6k views
Trump ACCIDENTALLY OPENS Discovery INTO HIMSELF in CASE
MeidasTouch
662.6k views
GAZA Phase One Agreement
The Duran
38.6k views
Trans-Dimensional Mysteries Revealed | Ancient Aliens
HISTORY
62.5k views
Trump ACCIDENTALLY Opens FULL DISCOVERY into HIS DARK PAST
MeidasTouch
310.4k views
Strange Dark Matter Discoveries That Can't Be Explained With Current Models
Anton Petrov
97.8k views
Top Comments (9)
A date-index overview. In the date overview each date lists different keywords that will be used for the index search. In the index overview, beneth these keywords are other dates listed that can be used to pull "memory" into a context.
Thanks as always for guiding us through these interesting papers. As I understand this one, the compute involved in creating the three containers of memory, evaluating this phase transition, and the notion that recurrence equals salience, all of this is problematic. And once you've massaged all this data you still only have whatever attention makes possible, you haven't somehow magically increased what's the model can remember, "subconsciously"or otherwise, is still limited to whatever fits in the context window. It's optimized ICL, but at the cost of quite a few tokens. Once you've got it all set up I'm sure you have a nice efficient process at inference, but you have to include the compute cost of preparing everything for this memory approach to yield a bottom line cost benefit.
I wonder if those "memories" could be compressed in theme blocks. Using a "relationship-factors"(?) to decribe how close different memories are connected. A kind of similar how LLM work, just simpler and for "Memories"/Textblocks.
Shouldn't the compression apply to memories that aren't often used just as well, though somewhat different in LLM prompting/approach and an analogous hyperparameter search?
is the density threshold learned or hardcoded? because 87% savings means nothing if consolidation drops the one fact the agent actually needs at runtime
I think we need recursive tokenization methods (recursive residual quantization) with these types of methods, so that the manifolds can be interchanged without breaking - but I think the gap is still effective RL harnesses - temporal functions only get you so far - maybe these state spaces need environmental/domain/modality grounding.
THX
How do I connect with you to assist with a project I am working on?
14:45 Finally. An accredited scientist backing up allergies as weakness.
Unlock the Data Inside
Turn Videos into Knowledge
- Get FREE 10/day: transcripts, summaries, chats
- Chat with videos, export text & PDF
- $1 free API credit for RAG, chatbots & research
Free forever plan • All features unlocked
Top Comments (9)
A date-index overview. In the date overview each date lists different keywords that will be used for the index search. In the index overview, beneth these keywords are other dates listed that can be used to pull "memory" into a context.
Thanks as always for guiding us through these interesting papers. As I understand this one, the compute involved in creating the three containers of memory, evaluating this phase transition, and the notion that recurrence equals salience, all of this is problematic. And once you've massaged all this data you still only have whatever attention makes possible, you haven't somehow magically increased what's the model can remember, "subconsciously"or otherwise, is still limited to whatever fits in the context window. It's optimized ICL, but at the cost of quite a few tokens. Once you've got it all set up I'm sure you have a nice efficient process at inference, but you have to include the compute cost of preparing everything for this memory approach to yield a bottom line cost benefit.
I wonder if those "memories" could be compressed in theme blocks. Using a "relationship-factors"(?) to decribe how close different memories are connected. A kind of similar how LLM work, just simpler and for "Memories"/Textblocks.
Shouldn't the compression apply to memories that aren't often used just as well, though somewhat different in LLM prompting/approach and an analogous hyperparameter search?
is the density threshold learned or hardcoded? because 87% savings means nothing if consolidation drops the one fact the agent actually needs at runtime
I think we need recursive tokenization methods (recursive residual quantization) with these types of methods, so that the manifolds can be interchanged without breaking - but I think the gap is still effective RL harnesses - temporal functions only get you so far - maybe these state spaces need environmental/domain/modality grounding.
THX
How do I connect with you to assist with a project I am working on?
14:45 Finally. An accredited scientist backing up allergies as weakness.