A2Eval: Agentic and Automated Evaluation for Embodied Brain
IntermediateShuai Zhang, Jiayu Hu et al.Feb 2arXiv
A2Eval is a two-agent system that automatically builds and runs fair tests for robot-style vision-language models, cutting wasted work while keeping results trustworthy.
#Embodied AI#Vision-Language Models#Agentic Evaluation