The paper proposes the Laws of Reasoning (LORE), simple rules that say how much a model should think and how accurate it can be as problems get harder.
RadarGen is a tool that learns to generate realistic car radar point clouds just from multiple camera views.
ReCo is a new way to edit videos just by telling the computer what to change with words, no extra masks needed.
Robust-R1 teaches vision-language models to notice how a picture is damaged, think through what that damage hides, and then answer as if the picture were clear.
InsertAnywhere is a two-stage system that lets you add a new object into any video so it looks like it was always there.
Visual grounding is when an AI finds the exact thing in a picture that a sentence is talking about, and this paper shows today’s big vision-language AIs are not as good at it as we thought.
3D-RE-GEN turns a single photo of a room into a full 3D scene with separate, textured objects and a usable background.
The paper introduces UCoder, a way to teach a code-generating AI to get better without using any outside datasets, not even unlabeled code.
The paper introduces Canon layers, tiny add-ons that let nearby words share information directly, like passing notes along a row of desks.
Humans keep a big-picture memory (a “mindscape”) when reading long things; this paper teaches AI to do the same.
Reasoning Palette gives a language or vision-language model a tiny hidden “mood” (a latent code) before it starts answering, so it chooses a smarter plan rather than just rolling dice on each next word.
This paper teaches AI agents to learn new reusable skills and get better over time by using reinforcement learning, not just prompts.