This paper introduces PLaT, a way for AI to think silently in a hidden space (the brain) and only speak when needed (the mouth).
The paper asks a simple question: if a language model becomes better at step-by-step reasoning (using RLVR), do its text embeddings also get better? The short answer is no.