Render-of-Thought (RoT) turns the modelβs step-by-step thinking from long text into slim images so the model can think faster with fewer tokens.
VQRAE is a new kind of image tokenizer that lets one model both understand images (continuous features) and generate/reconstruct them (discrete tokens).