SAW-Bench is a new test that checks if AI can understand the world from a first-person view, like wearing smart glasses.
This paper asks a simple question with big consequences: can todayβs AI models actively explore a new space and build a trustworthy internal map of it?