RANKVIDEO is a video-native reasoning reranker that helps search engines find the right videos for a text query by directly looking at the videoβs visuals and audio, not just text captions.
JudgeRLVR teaches a model to be a strict judge of answers before it learns to generate them, which trims bad ideas early.
This paper builds two teamwork models, Qwen3-VL-Embedding and Qwen3-VL-Reranker, that understand text, images, visual documents, and videos in one shared space so search works across all of them.