Video-Browser: Towards Agentic Open-web Video Browsing
BeginnerZhengyang Liang, Yan Shu et al.Dec 28arXiv
The paper tackles how AI agents can truly research the open web when the answers are hidden inside long, messy videos, not just text.
#agentic video browsing#pyramidal perception#video understanding