This paper is the first big map of how AI can fix real software problems, not just write short code snippets.
This paper builds LIBERTy, a new way to fairly judge how well AI explains its decisions about big, human ideas like age, race, or experience.
FOFPred is a new AI that reads one or two images plus a short instruction like “move the bottle left to right,” and then predicts how every pixel will move in the next moments.
PACEvolve is a new recipe that helps AI agents improve their ideas step by step over long periods without getting stuck.
Molmo2 is a family of vision-language models that can watch videos, understand them, and point to or track things over time using fully open weights, data, and code.
Action100M is a gigantic video dataset with about 100 million labeled action moments built automatically from 1.2 million instructional videos.
This paper teaches video-making AIs to follow real-world physics better without retraining them.
HeartMuLa is a family of open-source music AI models that can understand and generate full songs with clear lyrics and strong musical structure.
Cities are full of places defined by people, like schools and parks, which are hard to see clearly from space without extra clues.
This paper builds an AI agent, ML-Master 2.0, that can work on machine learning projects for a very long time without forgetting what matters.
Language models can act like many characters, but they usually aim to be a helpful Assistant after post-training.
The paper shows a new way to teach AI assistants how to use tools in many-step conversations by mining ordinary text on the internet for step-by-step “how-to” knowledge.