Phi-4-reasoning-vision-15B is a small, open-weight AI that understands pictures and text together and is especially good at math, science, and using computer screens.
This paper teaches image generators to place objects in the right spots by building a special teacher called a reward model focused on spatial relationships.
GUI-Libra is a training recipe that helps computer-using AI agents both think carefully and click precisely on screens.
SWE-Master is a fully open, step-by-step recipe for turning a regular coding model into a strong software-fixing agent that works across many steps, files, and tests.