When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
IntermediateShoubin Yu, Yue Zhang et al.Feb 9arXiv
Visual spatial reasoning often fails when a model only looks at one picture and must imagine new viewpoints.
#Adaptive Test-Time Scaling#World Models#Visual Spatial Reasoning