SPARK is a new way to train AI agents that saves compute by exploring more only at the most important moments.
LLMs are usually trained by treating every question the same and giving each one the same number of tries, which wastes compute on easy problems and neglects hard ones.
AdaReasoner teaches AI to pick the right visual tools, use them in the right order, and stop using them when they aren’t helping.
LLM agents are usually trained in a few worlds but asked to work in many different, unseen worlds, which often hurts their performance.
This paper teaches a language-model agent to look up facts in millions of scientific paper summaries and answer clear, single-answer questions.
Typhoon-S is a simple, open recipe that turns a basic language model into a helpful assistant and then teaches it important local skills, all on small budgets.
This paper teaches AI to turn simple dialogue into full movie scenes by first writing a detailed script and then filming it step by step.
SAMTok turns any object’s mask in an image into just two special “words” so language models can handle pixels like they handle text.
Academic rebuttals are not just about being polite; they are about smart, strategic persuasion under hidden information.
Small AI models often stumble when a tool call fails and then get stuck repeating bad calls instead of fixing the mistake.
Diffusion language models can write tokens in any order, but that freedom can accidentally hurt their ability to reason well.
The paper introduces Intervention Training (InT), a simple way for a language model to find and fix the first wrong step in its own reasoning using a short, targeted correction.