Recursive Think-Answer Process for LLMs and VLMs
IntermediateByung-Kwan Lee, Youngchae Chee et al.Mar 2arXiv
This paper teaches AI models to judge how sure they are about an answer and to think again if they are not sure.
#Recursive Think–Answer#Confidence-guided reasoning#Reinforcement learning for LLMs