What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models
BeginnerDasol Choi, Guijin Son et al.Jan 7arXiv
Real people often ask vague questions with pictures, and todayβs vision-language models (VLMs) struggle with them.
#vision-language models#under-specified queries#query explicitation