LingBot-World is an open-source world model that turns video generation into an interactive, real-time simulator.
This paper builds a real-time talking-listening head avatar that reacts naturally to your words, tone, nods, and smiles in about half a second.
Autoregressive (AR) models write one word at a time, which is accurate but slow, especially when your computer or GPU canβt keep many tasks in memory at once.