A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning
IntermediateZixin Zhang, Kanghao Chen et al.Dec 16arXiv
This paper builds A4-Agent, a smart three-part helper that figures out where to touch or use an object just from a picture and a written instruction, without any extra training.
#affordance prediction#zero-shot learning#vision-language models