Kimi K2.5 is a new open-source AI that can read both text and visuals (images and videos) and act like a team of helpers to finish big tasks faster.
The paper tackles how AI agents can truly research the open web when the answers are hidden inside long, messy videos, not just text.