🎓How I Study AIHISA
📖Read
📄Papers📰Blogs🎬Courses
💡Learn
🛤️Paths📚Topics💡Concepts🎴Shorts
🎯Practice
🧩Problems🎯Prompts🧠Review
Search
How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers6

AllBeginnerIntermediateAdvanced
All SourcesarXiv
#camera pose estimation

Think3D: Thinking with Space for Spatial Reasoning

Beginner
Zaibin Zhang, Yuhan Wu et al.Jan 19arXiv

Think3D lets AI models stop guessing from flat pictures and start exploring real 3D space, like walking around a room in a video game.

#Think3D#spatial reasoning#3D reconstruction

V-DPM: 4D Video Reconstruction with Dynamic Point Maps

Intermediate
Edgar Sucar, Eldar Insafutdinov et al.Jan 14arXiv

V-DPM is a new way for AI to turn a short video into a moving 3D world, capturing both the shape and the motion of everything in it.

#Dynamic Point Maps#4D reconstruction#scene flow

InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams

Intermediate
Shuai Yuan, Yantai Yang et al.Jan 5arXiv

InfiniteVGGT is a streaming 3D vision system that can keep working forever on live video without running out of memory.

#InfiniteVGGT#rolling memory#causal attention

3D-RE-GEN: 3D Reconstruction of Indoor Scenes with a Generative Framework

Intermediate
Tobias Sautter, Jan-Niklas Dihlmann et al.Dec 19arXiv

3D-RE-GEN turns a single photo of a room into a full 3D scene with separate, textured objects and a usable background.

#single-image 3D reconstruction#scene composition#context-aware inpainting

Efficiently Reconstructing Dynamic Scenes One D4RT at a Time

Intermediate
Chuhan Zhang, Guillaume Le Moing et al.Dec 9arXiv

D4RT is a new AI model that turns regular videos into moving 3D scenes (4D) quickly and accurately.

#D4RT#dynamic 4D reconstruction#query-based decoding

TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels

Intermediate
Jiahao Lu, Weitao Xiong et al.Dec 9arXiv

TrackingWorld turns a regular single-camera video into a map of where almost every pixel moves in 3D space over time.

#monocular 3D tracking#world-centric coordinates#camera pose estimation