This paper speeds up how AI models read very long texts by carefully choosing which words (tokens) to focus on at each step.
LongCat-Flash-Thinking-2601 is a huge 560-billion-parameter Mixture-of-Experts model built to act like a careful helper that can use tools, browse, code, and solve multi-step tasks.