Parallel-Probe is a simple add-on that lets many AI “thought paths” think at once but stop early when they already agree.
Re-TRAC is a new way for AI search agents to learn from each try, write a clean summary of what happened, and then use that summary to do better on the next try.
DARC teaches big language models to get smarter by splitting training into two calm, well-organized steps instead of one chaotic loop.
EvasionBench is a new, very large dataset that helps computers spot when company leaders dodge questions during earnings call Q&A.
X-Coder shows that models can learn expert-level competitive programming using data that is 100% synthetic—no real contest problems needed.
The paper shows how a vision-language model (VLM) can train itself to be a fair judge of answers about images without using any human preference labels.