🎓How I Study AIHISA

📖Read

📄Papers 📰Blogs 🎬Courses

💡Learn

🛤️Paths 📚Topics 💡Concepts 🎴Shorts

🎯Practice

📝Daily Log 🎯Prompts 🧠Review

Search Settings

How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers2

All Beginner Intermediate Advanced

All Sources arXiv

#benchmarking MLLMs

BabyVision: Visual Reasoning Beyond Language

Liang Chen, Weichu Xie et al.Jan 10arXiv

BabyVision is a new test that checks if AI can handle the same basic picture puzzles that young children can do, without leaning on language tricks.

#BabyVision#visual reasoning#multimodal large language models

Not triaged yet

Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification

Qihao Liu, Chengzhi Mao et al.Dec 18arXiv

AuditDM is a friendly 'auditor' model that hunts for where vision-language models get things wrong and then creates the right practice to fix them.

#AuditDM#model auditing#cross-model divergence

Not triaged yet