AOrchestra is like a smart conductor that builds the right mini-helpers (sub-agents) on demand to solve big, multi-step tasks.
This paper builds a tough new test called O3-BENCH to check if AI can truly think with images, not just spot objects.