HERMES is a training-free way to make video-language models understand live, streaming video quickly and accurately.
This survey asks how close AI memory systems are to human memory and organizes the answer into three parts: implicit memory (inside the model), explicit memory (outside storage you can look up), and agentic memory (what an AI agent keeps over time to plan and act).
This paper shows how to get strong text embeddings from decoder-only language models without any training.
MorphAny3D is a training-free way to smoothly change one 3D object into another, even if they are totally different (like a bee into a biplane).
LitePT is a new AI backbone for 3D point clouds that uses convolutions in early layers and attention in later layers to be both fast and accurate.
Scone is a new AI method that makes images from instructions while correctly picking the right subject even when many look similar.