Cities are full of places defined by people, like schools and parks, which are hard to see clearly from space without extra clues.
OpenVoxel is a training-free way to understand 3D scenes by grouping tiny 3D blocks (voxels) into objects and giving each object a clear caption.