๐ŸŽ“How I Study AIHISA
๐Ÿ“–Read
๐Ÿ“„Papers๐Ÿ“ฐBlogs๐ŸŽฌCourses
๐Ÿ’กLearn
๐Ÿ›ค๏ธPaths๐Ÿ“šTopics๐Ÿ’กConcepts๐ŸŽดShorts
๐ŸŽฏPractice
โฑ๏ธCoach๐ŸงฉProblems๐Ÿง Thinking๐ŸŽฏPrompts๐Ÿง Review
SearchSettings
How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers1

AllBeginnerIntermediateAdvanced
All SourcesarXiv
#Cold-start SFT

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Intermediate
Yuchen Yan, Liang Jiang et al.Feb 6arXiv

Long chains of thought make AI smarter but also slower, pricier, and limited by memory windows.

#Iterative reasoning#Reinforcement learning for LLMs#Trajectory-level optimization