CL-bench is a new test that checks whether AI can truly learn new things from the information you give it right now, not just from what it memorized before.
The paper introduces PRIVASIS, a huge, fully synthetic dataset (1.4 million records) filled with realistic-looking private details, but created from scratch so it does not belong to any real person.
The paper fixes a hidden mistake many fast video generators were making when turning a "see-everything" model into a "see-past-only" model.
The paper studies how to teach a smaller language model using a bigger one by only focusing on the most useful bits instead of everything.
This paper builds a new test called AgentIF-OneDay that checks if AI helpers can follow everyday instructions the way people actually give them.
Real instructions often have logic like and first-then and if-else and this paper teaches models to notice and obey that logic.
ThinkRL-Edit teaches an image editor to think first and draw second, which makes tricky, reasoning-heavy edits much more accurate.
Unified Thinker separates “thinking” (planning) from “drawing” (image generation) so complex instructions get turned into clear, doable steps before any pixels are painted.
VINO is a single AI model that can make and edit both images and videos by listening to text and looking at reference pictures and clips at the same time.
T2AV-Compass is a new, unified test to fairly grade AI systems that turn text into matching video and audio.
IC-Effect is a new way to add special effects to existing videos by following a text instruction while keeping everything else unchanged.
Vector Prism helps computers animate SVG images by first discovering which tiny shapes belong together as meaningful parts.