ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models
IntermediateLong Lian, Sida Wang et al.Nov 24arXiv
ThreadWeaver teaches a language model to split big problems into smaller parts it can solve at the same time, like teammates working in parallel.
#adaptive parallel reasoning#fork–join#threaded inference