Papers9

#Chamfer Distance

VecGlypher: Unified Vector Glyph Generation with Language Models

Xiaoke Huang, Bhavul Gauri et al.Feb 25arXiv

VecGlypher is a single language-model-based system that writes SVG code to draw crisp, editable letters (glyphs) directly from text descriptions or a few example images.

#VecGlypher#vector glyph generation#SVG path tokens

Not triaged yet

CADEvolve: Creating Realistic CAD via Program Evolution

Intermediate

Maksim Elistratov, Marina Barannikov et al.Feb 18arXiv

AI models that make CAD designs used to learn mostly from simple “draw-then-extrude” examples, so they struggled with real, complex parts.

#CAD#CadQuery#Image2CAD

Not triaged yet

Stroke3D: Lifting 2D strokes into rigged 3D model via latent diffusion models

Intermediate

Ruisi Zhao, Haoren Zheng et al.Feb 10arXiv

Stroke3D lets you draw simple 2D stick-figure strokes plus a short text, and it builds a ready-to-animate 3D model with a skeleton and textures.

#Stroke3D#rigged 3D generation#skeleton-first pipeline

Not triaged yet

PLANING: A Loosely Coupled Triangle-Gaussian Framework for Streaming 3D Reconstruction

Intermediate

Changjian Jiang, Kerui Ren et al.Jan 29arXiv

PLANING is a new way to build 3D worlds from a moving single camera by combining two kinds of pieces: sharp triangles for shape and soft Gaussians for looks.

#Streaming 3D Reconstruction#Triangle Primitives#Neural Gaussians

Not triaged yet

Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Intermediate

Hongyuan Chen, Xingyu Chen et al.Jan 20arXiv

Motion 3-to-4 turns a single regular video into a moving 3D object over time (a 4D asset) by first getting the object’s shape and then figuring out how every part moves.

#4D synthesis#monocular video#motion reconstruction

Not triaged yet

3D-RE-GEN: 3D Reconstruction of Indoor Scenes with a Generative Framework

Intermediate

Tobias Sautter, Jan-Niklas Dihlmann et al.Dec 19arXiv

3D-RE-GEN turns a single photo of a room into a full 3D scene with separate, textured objects and a usable background.

#single-image 3D reconstruction#scene composition#context-aware inpainting

Not triaged yet

CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives

Intermediate

Zihan Wang, Jiashun Wang et al.Dec 16arXiv

CRISP turns a normal phone video of a person into a clean 3D world and a virtual human that can move in it without breaking physics.

#real-to-sim#human-scene interaction#planar primitives

Not triaged yet

Feedforward 3D Editing via Text-Steerable Image-to-3D

Intermediate

Ziqi Ma, Hongqiao Chen et al.Dec 15arXiv

Steer3D lets you change a 3D object just by typing what you want, like “add a roof rack,” and it does it in one quick pass.

#3D editing#image-to-3D#ControlNet

Not triaged yet

SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling

Intermediate

Elisabetta Fedele, Francis Engelmann et al.Dec 5arXiv

SpaceControl lets you steer a powerful 3D generator with simple shapes you draw, without retraining the model.

#3D generative modeling#test-time guidance#latent space intervention

Not triaged yet