My research focuses on geometry modeling, machine learning, and shape analysis.
I am particularly interested in building generative models for fine-grained, controllable 3D shape synthesis
guided by geometry, images, or text. I am also interested in image and video foundation models for customized
content generation.
This thesis introduces several feed-forward approaches for generating high-quality 3D shapes, which aim to address the challenges of
structure-aware 3D content creation and provide interactive and controllable 3D modeling experience.
ART-DECO is a 3D detailizer that instantly transforms coarse 3D shape proxies into high-quality, textured 3D assets guided by text prompts.
Trained via SDS from MVDream, ART-DECO enables interactive modeling, style-consistent details, and creative structure control.
GenVDM is the first method for generating Vector Displacement Maps (VDMs): parameterized geometric stamps commonly used in 3D modeling.
It generates multi-view normal maps from a single input image and then reconstructs a VDM via a novel reconstruction pipeline.
DECOLLAGE is a learning-based method that enables novice users to add geometric details to a coarse 3D shape by selecting regions on it and
assigning them the styles of exemplar shapes with compelling geometric details.
DAE-Net is an unsupervised 3D shape co-segmentation method that learns a set of deformable part templates from a shape collection, which
yields high-quality, consistent, and fine-grained 3D shape co-segmentation.
The first example-based deep generative neural network for generating a high-resolution textured 3D shape through geometry detailization
and conditional texture generation applied to an input coarse voxel shape.
D2CSG is a neural model composed of two dual and complementary network branches, with dropouts, for unsupervised
learning of compact constructive solid geometry (CSG) representations of 3D CAD shapes.
The first deep neural implicit model for general-purpose, unpaired shape-to-shape translation, in both 2D and 3D domains.
UNIST can learn both style-preserving content alteration and content-preserving style transfer.
CoralNet is a cloud-based website and platform for manual, semi-automatic and automatic analysis of coral
reef images. Users access CoralNet through optimized web-based workflows for common tasks, other systems
can interface through API.
Composing volumetric-based generative model with topology-awareness auto-encoder allows them to learn
high-level topological properties such as genus and connectivity for 3D shape reconstruction.
* CoralNet is currently one of the largest platforms for
coral reef image annotation and analysis, supporting over 7,000 data sources, 5.4 million images, and 289 million point annotations contributed by researchers worldwide
Invited talks
Mar 2026
3DV 2026 Nectar Track, Vancouver - ART-DECO: Arbitrary Text Guidance for 3D Detailizer Construction