This paper teaches AI to pay attention better by training its focus, not just its words.
The paper shows that video AIs do not need long, human-like chains of thought to reason well.