VideoLoom is a single AI model that can tell both when something happens in a video and where it happens, at the pixel level.
Zoom-Zero helps AI answer questions about videos by first finding the right moment and then zooming in to double-check tiny details.
Long Video Understanding (LVU) is hard because the important clues are tiny, far apart in time, and buried in hours of mostly unimportant footage.
ReVSeg teaches an AI to segment objects in videos by thinking step-by-step instead of guessing everything at once.