D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use
IntermediateBowen Xu, Shaoyu Wu et al.Feb 2arXiv
This paper fixes a common problem in reasoning AIs called Lazy Reasoning, where the model rambles instead of making a good plan.
#task decomposition#tool use#large reasoning models