TSRBench is a giant test that checks if AI models can understand and reason about data that changes over time, like heartbeats, stock prices, and weather.
MAXS is a new way for AI agents to think a few steps ahead while using tools like search and code, so they make smarter choices.