Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking
IntermediateMingxin Li, Yanzhao Zhang et al.Jan 8arXiv
This paper builds two teamwork models, Qwen3-VL-Embedding and Qwen3-VL-Reranker, that understand text, images, visual documents, and videos in one shared space so search works across all of them.
#multimodal retrieval#unified embedding space#cross-encoder reranker