This paper builds a Google-for-theorems: a semantic search engine that finds exact theorems, lemmas, and propositions instead of just entire papers.
IVRA is a simple, training-free add-on that helps robot brains keep the 2D shape of pictures while following language instructions.
This paper introduces CGPT, a way to help computers find the right tables by building smarter mini-versions of tables and training with tough practice questions.
InfiniteVGGT is a streaming 3D vision system that can keep working forever on live video without running out of memory.
Most image-similarity tools only notice how things look (color, shape, class) and miss deeper, human-like connections.