MAEB is a giant, fair report card for audio AI that tests 50+ models on 30 tasks across speech, music, environmental sounds, and audio–text tasks in 100+ languages.
ResearchGym is a new "gym" where AI agents are tested on real research projects end to end, not just on toy problems.
The paper tackles understanding super long, first‑person videos (days to a week) by giving an AI a smarter memory and better tools.
DanQing is a fresh, 100-million-pair Chinese image–text dataset collected from 2024–2025 web pages and carefully cleaned for training AI that understands pictures and Chinese text together.