QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management
IntermediateWeizhou Shen, Ziyi Yang et al.Dec 15arXiv
QwenLong-L1.5 is a training recipe that helps AI read and reason over very long documents by improving the data it learns from, the way it is trained, and how it remembers important stuff.
#long-context reasoning#reinforcement learning#GRPO