PRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference
IntermediateRituraj Sharma, Weiyuan Chen et al.Mar 3arXiv
PRISM is a new way to help AI think through hard problems by checking each step, not just the final answer.
#DEEPTHINK#Process Reward Model#step-level verification