Yume-1.5: A Text-Controlled Interactive World Generation Model
IntermediateXiaofeng Mao, Zhen Li et al.Dec 26arXiv
Yume1.5 is a model that turns text or a single image into a living, explorable video world you can move through with keyboard keys.
#interactive world generation#video diffusion#temporal-spatial-channel modeling