SPARK is a new way to train AI agents that saves compute by exploring more only at the most important moments.
LLM agents are usually trained in a few worlds but asked to work in many different, unseen worlds, which often hurts their performance.
This paper asks if large language models (LLMs) can act like "world models" that predict what happens next in text-based environments, not just the next word in a sentence.