This paper introduces SOP, a system that lets many real robots learn new skills online at the same time while keeping one shared brain (policy).
MMFormalizer is a new system that turns problems with pictures and words (like physics scenes or geometry diagrams) into strict, checkable math statements and proofs.
The authors built a simple six-agent system to see if today’s AI models could plan, run, and write a research paper mostly on their own.
Large reasoning models can often find the right math answer in their “head” before finishing their written steps, but this works best in languages with lots of training data like English and Chinese.
Large language models (LLMs) are good at many math problems but often mess up simple counting when the list gets long.
DreamStyle is a single video-stylizing model that can follow text, copy a style image, or continue from a stylized first frame—without switching tools.
MiMo-V2-Flash is a giant but efficient language model that uses a team-of-experts design to think well while staying fast.
AnyDepth is a new, simple way for a computer to tell how far things are in a picture using just one image (monocular depth).
SimpleMem is a new memory system that helps AI remember long conversations without wasting space or tokens.
VINO is a single AI model that can make and edit both images and videos by listening to text and looking at reference pictures and clips at the same time.
Talk2Move is a training recipe that lets an image editor move, rotate, and resize the exact object you mention using plain text, while keeping the rest of the picture stable.
Falcon-H1R is a small (7B) AI model that thinks really well without needing giant computers.