DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution
IntermediateShengda Fan, Xuyan Ye et al.Jan 20arXiv
DARC teaches big language models to get smarter by splitting training into two calm, well-organized steps instead of one chaotic loop.
#DARC#self-play#curriculum learning