Papers2

#cosine distance

AgilePruner: An Empirical Study of Attention and Diversity for Adaptive Visual Token Pruning in Large Vision-Language Models

Changwoo Baek, Jouwon Song et al.Mar 1arXiv

Big picture: Vision-language models look at hundreds of image pieces (tokens), which makes them slow and sometimes chatty with mistakes called hallucinations.

#visual token pruning#attention-based pruning#diversity-based pruning

Not triaged yet

Language of Thought Shapes Output Diversity in Large Language Models

Intermediate

Shaoyang Xu, Wenxuan ZhangJan 16arXiv

The paper shows that changing the language a model 'thinks in' (its language of thought) can make its English answers more varied without making them much worse in quality.

#language of thought#output diversity#multilingual reasoning

Not triaged yet