This paper teaches long-horizon AI agents to remember everything exactly without stuffing their whole memory at once.
This paper teaches a language-model agent to explore smarter by combining two ways of learning (on-policy and off-policy) with a simple, self-written memory.
LatentMem is a new memory system that helps teams of AI agents remember the right things for their specific jobs without overloading them with text.
The paper asks a simple question: if a language model becomes better at step-by-step reasoning (using RLVR), do its text embeddings also get better? The short answer is no.