LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs
BeginnerBenno Krojer, Shravan Nayak et al.Jan 31arXiv
LatentLens is a simple, training-free way to translate what a model "sees" in image patches into clear words and phrases.
#LatentLens#visual tokens#contextual embeddings