🎓How I Study AIHISA
📖Read
📄Papers📰Blogs🎬Courses
💡Learn
🛤️Paths📚Topics💡Concepts🎴Shorts
🎯Practice
🧩Problems🎯Prompts🧠Review
Search
How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers2

AllBeginnerIntermediateAdvanced
All SourcesarXiv
#Test-Time Training

Reinforcement Learning via Self-Distillation

Intermediate
Jonas Hübotter, Frederike Lübeck et al.Jan 28arXiv

The paper teaches large language models to learn from detailed feedback (like error messages) instead of only a simple pass/fail score.

#Self-Distillation#Reinforcement Learning with Rich Feedback#SDPO

End-to-End Test-Time Training for Long Context

Intermediate
Arnuv Tandon, Karan Dalal et al.Dec 29arXiv

This paper shows how a language model can keep learning while you use it, so it handles very long inputs without slowing down.

#Test-Time Training#Meta-learning#Long-context language modeling