🎓How I Study AIHISA
📖Read
📄Papers📰Blogs🎬Courses
💡Learn
🛤️Paths📚Topics💡Concepts🎴Shorts
🎯Practice
🧩Problems🎯Prompts🧠Review
Search
How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers11

AllBeginnerIntermediateAdvanced
All SourcesarXiv
#3D Gaussian Splatting

PLANING: A Loosely Coupled Triangle-Gaussian Framework for Streaming 3D Reconstruction

Intermediate
Changjian Jiang, Kerui Ren et al.Jan 29arXiv

PLANING is a new way to build 3D worlds from a moving single camera by combining two kinds of pieces: sharp triangles for shape and soft Gaussians for looks.

#Streaming 3D Reconstruction#Triangle Primitives#Neural Gaussians

Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering

Intermediate
Jieying Chen, Jeffrey Hu et al.Jan 14arXiv

This paper shows how to make long, camera-controlled videos much faster by generating only a few smart keyframes with diffusion, then filling in the rest using a 3D scene and rendering.

#camera-controlled video generation#sparse keyframes#3D reconstruction

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Intermediate
Sheng-Yu Huang, Jaesung Choe et al.Jan 14arXiv

OpenVoxel is a training-free way to understand 3D scenes by grouping tiny 3D blocks (voxels) into objects and giving each object a clear caption.

#OpenVoxel#Sparse Voxel Rasterization#training-free 3D understanding

Choreographing a World of Dynamic Objects

Intermediate
Yanzhe Lyu, Chen Geng et al.Jan 7arXiv

CHORD is a new way to animate 3D scenes over time (4D) where many objects move and interact, guided only by a text prompt.

#4D generation#Rectified Flow#Score Distillation Sampling

GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction

Intermediate
Yi-Chuan Huang, Hao-Jen Chien et al.Dec 31arXiv

GaMO is a new way to rebuild 3D scenes from just a few photos by expanding each photo’s edges (outpainting) instead of inventing whole new camera views.

#3D reconstruction#outpainting#multi-view diffusion

Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Beginner
Song Wang, Lingdong Kong et al.Dec 30arXiv

Robots like cars and drones see the world with many different sensors (cameras, LiDAR, radar, and even event cameras), and this paper shows a clear roadmap for teaching them to understand space by learning from all of these together.

#Spatial Intelligence#Multi-Modal Pre-Training#Self-Supervised Learning

Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting

Intermediate
Yoonwoo Jeong, Cheng Sun et al.Dec 24arXiv

This paper speeds up how 3D scenes handle big, 512‑dimensional features without throwing away important information.

#3D Gaussian Splatting#Quantile Rendering#Open-vocabulary segmentation

WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Beginner
Hanyang Kong, Xingyi Yang et al.Dec 22arXiv

WorldWarp is a new method that turns a single photo plus a planned camera path into a long, steady, 3D-consistent video.

#Novel View Synthesis#3D Gaussian Splatting#Spatio-Temporal Diffusion

MatSpray: Fusing 2D Material World Knowledge on 3D Geometry

Intermediate
Philipp Langsteiner, Jan-Niklas Dihlmann et al.Dec 20arXiv

MatSpray turns 2D guesses about what materials look like (color, shininess, metal) into a clean 3D model you can relight realistically.

#MatSpray#3D Gaussian Splatting#Gaussian Ray Tracing

Animate Any Character in Any World

Intermediate
Yitong Wang, Fangyun Wei et al.Dec 18arXiv

AniX is a system that lets you place any character into any 3D world and control them with plain language, like “run forward” or “play a guitar.”

#AniX#3D Gaussian Splatting#world models

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Beginner
Yuning Gong, Yifei Liu et al.Dec 9arXiv

Visionary is a web-based platform that lets you view and interact with advanced 3D scenes, right in your browser, with just a click.

#WebGPU#3D Gaussian Splatting#ONNX Runtime Web