LSRIF: Logic-Structured Reinforcement Learning for Instruction Following
IntermediateQingyu Ren, Qianyu He et al.Jan 10arXiv
Real instructions often have logic like and first-then and if-else and this paper teaches models to notice and obey that logic.
#instruction following#logical structures#parallel constraints