Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline
BeginnerGuo Chen, Lidong Lu et al.Mar 5arXiv
This paper introduces MM-Lifelong, a 181-hour, multi-scale video dataset designed to test AI on true long-term (lifelong) understanding across days to months.
#multimodal lifelong understanding#long video reasoning#working memory bottleneck