This paper teaches image generators to place objects in the right spots by building a special teacher called a reward model focused on spatial relationships.
This paper teaches a language-model agent to explore smarter by combining two ways of learning (on-policy and off-policy) with a simple, self-written memory.
MAI-UI is a family of AI agents that can see, understand, and control phone and computer screens using plain language.