Papers4

#Spearman correlation

Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment

Yuming Yang, Mingyoung Lai et al.Jan 20arXiv

The paper asks a simple question: Which step-by-step explanations from a teacher model actually help a student model learn to reason better?

#Rank-Surprisal Ratio#data-student suitability#chain-of-thought distillation

Not triaged yet

Entropy Sentinel: Continuous LLM Accuracy Monitoring from Decoding Entropy Traces in STEM

Beginner

Pedro Memoli Buffa, Luciano Del CorroJan 13arXiv

The paper introduces Entropy Sentinel, a simple way to watch how accurate an AI is by reading its “uncertainty heartbeat” during generation.

#LLM monitoring#entropy profile#top-k probabilities

Not triaged yet

When Reasoning Meets Its Laws

Intermediate

Junyu Zhang, Yifan Sun et al.Dec 19arXiv

The paper proposes the Laws of Reasoning (LORE), simple rules that say how much a model should think and how accurate it can be as problems get harder.

#Large Reasoning Models#Laws of Reasoning#Compute Law

Not triaged yet

Enriching Word Vectors with Subword Information

Intermediate

Piotr Bojanowski, Edouard Grave et al.Jul 15arXiv

This paper teaches computers to understand words by also looking at the smaller pieces inside words, like 'un-', 'play', and '-ing'.

#subword embeddings#character n-grams#skip-gram

Not triaged yet