How I Study AI - Learn AI Papers & Lectures the Easy Way

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

This paper shows a simple way to turn many 'too-easy' questions into harder, still-checkable ones so that AI keeps learning instead of stalling.

#Reinforcement Learning with Verifiable Rewards#Compositional prompts#Sequential Prompt Composition

Not triaged yet

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Intermediate

Qi Qian, Chengsong Huang et al.Jan 7arXiv

Everyone uses tests (benchmarks) to judge how smart AI models are, but not all tests are good tests.

#LLM evaluation#benchmark quality#ranking consistency

Not triaged yet

Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory

Beginner

Mirac Suzgun, Mert Yuksekgonul et al.Apr 10arXiv

The paper introduces Dynamic Cheatsheet (DC), a simple way for language models to keep a tiny, smart notebook of useful tricks while they are being used.

#Dynamic Cheatsheet#test-time learning#memory curation

Not triaged yet

Papers3

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory