TRIP-Bench is a new test that checks if AI travel agents can plan real trips over many chat turns while following strict rules and changing user requests.
This paper introduces OmniAgent, a smart video-and-audio detective that actively decides when to listen and when to look.