Papers5

All Beginner Intermediate Advanced

All Sources arXiv

#Qwen2.5

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Intermediate

Leheng Sheng, Yongtao Zhang et al.Feb 11arXiv

Long texts overwhelm many language models, which forget important bits and slow down as the context grows.

#gated recurrent memory#update gate#exit gate

PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR

Intermediate

James Burgess, Jan N. Hansen et al.Jan 26arXiv

This paper teaches a language-model agent to look up facts in millions of scientific paper summaries and answer clear, single-answer questions.

#RLVR#search agents#PaperSearchQA

Aligning Text, Code, and Vision: A Multi-Objective Reinforcement Learning Framework for Text-to-Visualization

Intermediate

Mizanur Rahman, Mohammed Saidul Islam et al.Jan 8arXiv

This paper teaches a model to turn a question about a table into both a short answer and a clear, correct chart.

#Text-to-Visualization#Reinforcement Learning#GRPO

Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks

Beginner

Atsuki Yamaguchi, Maggie Mi et al.Jan 6arXiv

The paper teaches language models using extra 'language homework' made from the same raw text so they learn grammar and meaning, not just next-word guessing.

#language model pretraining#causal language modeling#linguistic competence

Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners

Beginner

Yihong Liu, Raoyuan Zhao et al.Jan 6arXiv

Large reasoning models can often find the right math answer in their “head” before finishing their written steps, but this works best in languages with lots of training data like English and Chinese.

#latent reasoning#chain-of-thought#multilingual LLMs