How I Study AI - Learn AI Papers & Lectures the Easy Way

Specificity-aware reinforcement learning for fine-grained open-world classification

Intermediate

Samuele Angheben, Davide Berasi et al.Mar 3arXiv

This paper teaches AI to name things in pictures very specifically (like “golden retriever” instead of just “dog”) without making more mistakes.

#open-world classification#fine-grained recognition#large multimodal models

A Very Big Video Reasoning Suite

Intermediate

Maijunxian Wang, Ruisi Wang et al.Feb 23arXiv

This paper builds a gigantic library of video puzzles (VBVR) so AI can practice not just making pretty videos, but actually thinking through what happens over time.

#video reasoning#rule-based evaluation#in-domain generalization

See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

Intermediate

Shuoshuo Zhang, Yizhen Zhang et al.Dec 26arXiv

The paper teaches vision-language models (AIs that look and read) to pay attention to the right picture parts without needing extra tools during answering time.

#BiPS#perceptual shaping#vision-language models

Papers3

Specificity-aware reinforcement learning for fine-grained open-world classification

A Very Big Video Reasoning Suite

See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning