Papers5

#video reasoning

A Very Big Video Reasoning Suite

Maijunxian Wang, Ruisi Wang et al.Feb 23arXiv

This paper builds a gigantic library of video puzzles (VBVR) so AI can practice not just making pretty videos, but actually thinking through what happens over time.

#video reasoning#rule-based evaluation#in-domain generalization

Not triaged yet

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Intermediate

Chengzu Li, Zanyi Wang et al.Jan 28arXiv

This paper shows that making short videos can help AI plan and reason in pictures better than writing out steps in text.

#video reasoning#visual planning#test-time scaling

Not triaged yet

VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Intermediate

Shuming Liu, Mingchen Zhuge et al.Jan 8arXiv

The paper asks a simple question: do video AIs really need to “think out loud” every time, or can they answer quickly most of the time and think deeply only when needed?

#video reasoning#adaptive reasoning#early exit

Not triaged yet

AdaTooler-V: Adaptive Tool-Use for Images and Videos

Intermediate

Chaoyang Wang, Kaituo Feng et al.Dec 18arXiv

AdaTooler-V teaches an image-and-video AI to first ask, “Do I really need a tool?” before using one, which saves time and boosts accuracy.

#adaptive tool-use#multimodal chain-of-thought#visual tool interactions

Not triaged yet

Rethinking Chain-of-Thought Reasoning for Videos

Intermediate

Yiwu Zhong, Zi-Yuan Hu et al.Dec 10arXiv

The paper shows that video AIs do not need long, human-like chains of thought to reason well.

#video reasoning#chain-of-thought#concise reasoning

Not triaged yet