The paper introduces PRIVASIS, a huge, fully synthetic dataset (1.4 million records) filled with realistic-looking private details, but created from scratch so it does not belong to any real person.
FASA is a training-free method that makes large language models faster and lighter on memory by keeping only the most useful past tokens during decoding.
When rewards are rare, a popular training method for language models (GRPO) often stops learning because every try in a group gets the same score, so there is nothing to compare.
The paper solves a big problem in fast image generators: they got quick, but they lost variety and kept making similar pictures.
This paper shows that many hard math and AI problems can be solved with one shared idea called homotopy, where we move from an easy version of a problem to the real one step by step.
Large language models learn better when we spend more practice time on the right questions at the right moments.
LatentMem is a new memory system that helps teams of AI agents remember the right things for their specific jobs without overloading them with text.
This paper teaches AI how to fix broken Lean math proofs by learning from the compiler’s feedback, not just from finished, perfect proofs.
Auto-regressive video models make videos one chunk at a time but run out of GPU memory because the KV-cache grows with history.
FIRE-Bench is a new test that checks whether AI agents can fully redo real scientific discoveries, step by step, not just guess answers.
AdaptMMBench is a new test that checks if AI models know when to just look and think, and when to use extra visual tools like zooming or brightening an image.
MARS is an AI agent that runs AI research like a careful scientist and thrifty engineer at the same time.