This paper teaches a language-model agent to look up facts in millions of scientific paper summaries and answer clear, single-answer questions.
This paper teaches a model to turn a question about a table into both a short answer and a clear, correct chart.
The paper teaches language models using extra 'language homework' made from the same raw text so they learn grammar and meaning, not just next-word guessing.
Large reasoning models can often find the right math answer in their βheadβ before finishing their written steps, but this works best in languages with lots of training data like English and Chinese.