WorldCompass: Reinforcement Learning for Long-Horizon World Models
BeginnerZehan Wang, Tengfei Wang et al.Feb 9arXiv
WorldCompass teaches video world models to follow actions better and keep pictures pretty by using reinforcement learning after pretraining.
#world models#reinforcement learning#clip-level rollout