Voxtral Realtime is a speech-to-text model that types what you say almost instantly, while keeping accuracy close to the best offline systems.
Knowledge graphs are like giant fact maps, and keeping every fact correct is hard and important.
LiveMedBench is a new, always-updating test for medical AIs that keeps test questions safely separated from training data to avoid cheating by memorization.
Modern image editors can now follow visual prompts like arrows and scribbles, which opens a new way for attackers to hide harmful instructions inside images.
The paper asks a simple question: which kind of step-by-step reasoning helps small language models learn best, and why?
SceneSmith is a smart team of AI helpers that turns a short text like 'a cozy study with books and a desk' into a full 3D home scene you can drop right into a robot simulator.
WorldCompass teaches video world models to follow actions better and keep pictures pretty by using reinforcement learning after pretraining.
Robots often get confused by wordy instructions, so this paper tells them exactly where to touch instead of what to do in sentences.
This paper teaches a language model to improve its own math answers by first writing several drafts and then learning to beat its best draft.
LLaDA2.1 teaches a diffusion-style language model to write fast rough drafts and then fix its own mistakes by editing tokens it already wrote.
This paper teaches AI to learn how-to steps from demonstrations in the moment, the way people do.
NarraScore turns a video's changing story into a matching soundtrack by using emotion as the bridge.