This paper speeds up and improves AI image editing by giving hard edits more attention and easy edits less, just like a smart coach.
This paper shows that when AI models grade university-level math proofs, they often disagree with human experts in systematic ways.
Vision-Language-Action (VLA) robots are powerful but too big and slow for many real-world devices.
Mobile-O is a small but smart AI that can both understand pictures and make new images, and it runs right on your phone.
This paper builds a gigantic library of video puzzles (VBVR) so AI can practice not just making pretty videos, but actually thinking through what happens over time.
ManCAR helps recommendation systems think step by step but keeps their thoughts on realistic paths using a map of how items connect.
LLMs trained with simple rewards often latch onto just a few ways of solving problems and stop exploring, which hurts their ability to find other correct answers.
CFE-BENCH is a new, teacher-verified "Classroom Final Exam" for AI that uses real college STEM problems to test deep, step-by-step reasoning.
Hepato-LLaVA is a special AI that reads giant microscope pictures of the liver and answers medical questions about cancer.
This paper explains how AI agents remember things across long conversations and why many current tests don’t truly measure that memory.
Robots learn better when they get small hints at every step instead of only a final thumbs-up or thumbs-down.
JavisDiT++ is a new AI that makes short videos and matching sounds from a text prompt, keeping sight and sound in sync.