Generative Visual Code Mobile World Models
IntermediateWoosung Koh, Sungjun Han et al.Feb 2arXiv
This paper shows a new way to predict what a phone screen will look like after you tap or scroll: generate web code (like HTML/CSS/SVG) and then render it to pixels.
#mobile GUI#world model#vision-language model