LMEB is a new test that checks whether text-embedding models can remember and find information across long stretches of time, not just in short, neat passages.
This paper explains how AI agents remember things across long conversations and why many current tests donβt truly measure that memory.
This survey links how human brains remember things to how AI agents should remember things so they can act smarter over time.