The paper introduces VDR-Bench, a new test with 2,000 carefully built questions that truly require both seeing (images) and reading (web text) to find answers.
Idea2Story is a two-stage system that first studies many accepted research papers offline and then uses that knowledge online to turn a vague idea into a full scientific plan.
Multi-step RAG systems often struggle with long documents because their memory is just a pile of isolated facts, not a connected understanding.
Capitalization tie-out checks if a company’s ownership table truly matches what its legal documents say.
Digital humans used to just copy motions; this paper makes them think, speak, and move in sync like real people.