Agentic Very Long Video Understanding
IntermediateAniket Rege, Arka Sadhu et al.Jan 26arXiv
The paper tackles understanding super long, first‑person videos (days to a week) by giving an AI a smarter memory and better tools.
#entity scene graph#agentic planning#long-horizon video understanding