The paper asks when reinforcement learning (RL) really makes language models better at reasoning beyond what they learned in pre-training.
This paper shows a new way to teach an autoencoder to shape its hidden space (the 'latent space') to look like any distribution we want, not just a simple bell curve.
DeepCode is an AI coding system that turns long, complicated papers into full, working code repositories.
This paper teaches a language model to think along several paths at the same time instead of one step after another.
LLM multi-agent systems often fail quietly (no crash) and leave long, twisty logs that are hard to debug by hand.
Long Video Understanding (LVU) is hard because the important clues are tiny, far apart in time, and buried in hours of mostly unimportant footage.
This paper argues that the fastest and safest path to super-smart AI is for humans and AIs to improve together, not for AI to improve alone.
COOPER is a single AI model that both “looks better” (perceives depth and object boundaries) and “thinks smarter” (reasons step by step) to answer spatial questions about images.
VideoSSM is a new way to make long, stable, and lively videos by giving the model two kinds of memory: a short-term window and a long-term state-space memory.
Diffusion language models (dLLMs) can write all parts of an answer in parallel, but they usually take many tiny cleanup steps, which makes them slow.