DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs
IntermediateShidong Cao, Hongzhan Lin et al.Jan 7arXiv
DiffCoT treats a modelβs step-by-step thinking (Chain-of-Thought) like a messy draft that can be cleaned up over time, not something fixed forever.
#Chain-of-Thought#Diffusion models#Autoregressive decoding