ShapeR builds clean, correctly sized 3D objects from messy, casual phone or glasses videos by using images, camera poses, sparse SLAM points, and short text captions together.
SHARP turns a single photo into a 3D scene you can look around in, and it does this in under one second on a single GPU.