The paper argues that making and using pictures inside an AIโs thinking can help it reason more like humans, especially for real-world, physical and spatial problems.
This paper fixes a common problem in video-making AIs where tiny mistakes snowball over time and ruin long videos.