$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners
IntermediateHarman Singh, Xiuyu Li et al.Mar 4arXiv
The paper shows that when a model compares two of its own answers head-to-head, it picks the right one more often than when it judges each answer alone.
#pairwise self-verification#test-time scaling#parallel reasoning