VG-Refiner: Towards Tool-Refined Referring Grounded Reasoning via Agentic Reinforcement Learning
IntermediateYuji Wang, Wenlong Liu et al.Dec 6arXiv
VG-Refiner is a new way for AI to find the right object in a picture when given a description, even if helper tools make mistakes.
#visual grounding#referring expression comprehension#tool-integrated visual reasoning