Few Tokens Matter: Entropy Guided Attacks on Vision-Language Models
IntermediateMengqi He, Xinyu Tian et al.Dec 26arXiv
The paper shows that when vision-language models write captions, only a small set of uncertain words (about 20%) act like forks that steer the whole sentence.
#vision-language models#autoregressive generation#entropy