The paper shows that Test-Time Training (TTT) with keyβvalue (KV) binding is not really memorizing like a notebook; it is acting like a learned linear attention layer.
WorldWarp is a new method that turns a single photo plus a planned camera path into a long, steady, 3D-consistent video.