This paper shows a new way to help AI think through long problems faster by turning earlier text steps into small pictures the AI can reread.
Traditional supervised fine-tuning (SFT) makes a model copy one answer too exactly, which can cause overfitting to the exact wording instead of the real idea.
MiMo-V2-Flash is a giant but efficient language model that uses a team-of-experts design to think well while staying fast.
Falcon-H1R is a small (7B) AI model that thinks really well without needing giant computers.