RelayGen is a training-free way to switch between a big model and a small model while one answer is being generated.
The paper introduces a new way to sample text from masked diffusion language models that is smarter and less greedy.
TSRBench is a giant test that checks if AI models can understand and reason about data that changes over time, like heartbeats, stock prices, and weather.
This paper is the first big map of how AI can fix real software problems, not just write short code snippets.