The paper tests a simple but bold idea: show code to AI as pictures instead of plain text, then shrink those pictures to save tokens and time.
DeepSeek-OCR 2 teaches a computer to “read” pictures of documents in a smarter order, more like how people read.
AgentOCR turns an agent’s long text history into pictures so it can remember more using fewer tokens.
Long texts are expensive for AI to read because each extra token costs a lot of compute and memory.