๐ŸŽ“How I Study AIHISA
๐Ÿ“–Read
๐Ÿ“„Papers๐Ÿ“ฐBlogs๐ŸŽฌCourses
๐Ÿ’กLearn
๐Ÿ›ค๏ธPaths๐Ÿ“šTopics๐Ÿ’กConcepts๐ŸŽดShorts
๐ŸŽฏPractice
๐Ÿ“Daily Log๐ŸŽฏPrompts๐Ÿง Review
SearchSettings
How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers2

AllBeginnerIntermediateAdvanced
All SourcesarXiv
#QwenVL

MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning

Intermediate
Chenhao Zhang, Yazhe Niu et al.Feb 11arXiv

Pictures can hide deeper meanings, like a wilted plant meaning someone feels burned out; most AI models miss these hints.

#image metaphor understanding#image implication#visual reinforcement learning

SAMTok: Representing Any Mask with Two Words

Intermediate
Yikang Zhou, Tao Zhang et al.Jan 22arXiv

SAMTok turns any objectโ€™s mask in an image into just two special โ€œwordsโ€ so language models can handle pixels like they handle text.

#SAMTok#mask tokenizer#residual vector quantization