This paper shows how to make diffusion language models write high‑quality text in just a few steps, which makes them much faster.
ExStrucTiny is a new test (benchmark) that checks if AI can pull many connected facts from all kinds of documents and neatly put them into JSON, even when the question style and schema change.
QRRanker is a lightweight way to sort many long text chunks by how helpful they are to a question, using the model’s own attention to score relevance.
Sci-CoE is a two-stage training method that helps one language model learn to both solve science problems and check those solutions with very little labeled data.
DreamID-Omni is one model that can create, edit, and animate human-centered videos with matching voices, all in sync.
Diffusion Large Language Models (dLLMs) can write many parts of an answer at once, not just left to right like usual chatbots.
This paper introduces P-GenRM, a personalized generative reward model that judges AI answers using a custom scorecard built just for each user and situation.
This paper gives language models a 'wand' to manage their own memory, instead of relying on humans to stuff the prompt for them.
GigaBrain-0.5M* is a robot brain that sees, reads, and acts, and it gets smarter by imagining the future before moving.
DeepSight is a free, all-in-one safety toolkit that both tests how models behave (DeepSafe) and peeks inside how they think (DeepScan).
LawThinker is a legal AI agent that double-checks every research step before using it, so small mistakes don’t snowball into big ones.
This paper shows a simple way to turn many 'too-easy' questions into harder, still-checkable ones so that AI keeps learning instead of stalling.