UniT: Unified Multimodal Chain-of-Thought Test-time Scaling
IntermediateLeon Liangyu Chen, Haoyu Ma et al.Feb 12arXiv
UniT teaches one multimodal model to think in steps with pictures and words, so it can check its own work and fix mistakes as it goes.
#Unified multimodal model#Chain-of-thought#Test-time scaling