Infinite Script

Research Fellow · MMLab@NTU

Haozhe Xie 谢浩哲

General Inquiries: root [at] haozhexie [dot] com·Academic Matters: academic [at] haozhexie [dot] com

Research Interest 3D VisionRobotics
Biography

I am currently working as a research fellow at MMLab@NTU, Nanyang Technological University, working with Prof. Ziwei Liu. Previously, I was a Senior Research Scientist at Tencent AI Lab from 2021 to 2023.

I received my Ph.D. from the VILab at Harbin Institute of Technology in 2021, supervised by Prof. Hongxun Yao. During my Ph.D., I also interned at SenseTime Research, mentored by Dr. Wenxiu Sun.

2026
Highlight Image
Highlight Image
Highlight Image
arXiv 2601.22153 🏆 CVPRW'26 Best Paper
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
  • Haozhe Xie*
  • Beichen Wen*
  • Jiarui Zheng
  • Zhaoxi Chen
  • Fangzhou Hong
  • Haiwen Diao
  • Ziwei Liu
Highlight Image
Highlight Image
Highlight Image
arXiv 2603.19227
Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer
  • Chenyang Gu*
  • Mingyuan Zhang*
  • Haozhe Xie*
  • Zhongang Cai
  • Lei Yang
  • Ziwei Liu
Highlight Image
Highlight Image
Highlight Image
ECCV 2026
MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
  • Haitian Li*
  • Haozhe Xie*
  • Junxiang Xu
  • Beichen Wen
  • Fangzhou Hong
  • Ziwei Liu
Highlight Image
Highlight Image
Highlight Image
ECCV 2026
HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human–Scene Interactions
  • Yukang Cao
  • Haozhe Xie
  • Fangzhou Hong
  • Long Zhuo
  • Zhaoxi Chen
  • Liang Pan
  • Ziwei Liu
Highlight Image
Highlight Image
Highlight Image
ECCV 2026
InfiniteDance: Scalable 3D Dance Generation Towards in-the-wild Generalization
  • Ronghui Li
  • Zhongyuan Hu
  • Siyao Li
  • Youliang Zhang
  • Haozhe Xie
  • Mingyuan Zhang
  • Jie Guo
  • Xiu Li
  • Ziwei Liu
Highlight Image
Highlight Image
Highlight Image
TPAMI 48(1): 312-328
Compositional Generative Model of Unbounded 4D Cities
  • Haozhe Xie
  • Zhaoxi Chen
  • Fangzhou Hong
  • Ziwei Liu
Highlight Image
Highlight Image
Highlight Image
IJCV 134(1): 29
CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation
  • Yukang Cao*
  • Xinying Guo*
  • Mingyuan Zhang
  • Haozhe Xie
  • Chenyang Gu
  • Ziwei Liu
2025
Highlight Image
arXiv 2505.05474
3D Scene Generation: A Survey
  • Beichen Wen*
  • Haozhe Xie*
  • Zhaoxi Chen
  • Fangzhou Hong
  • Ziwei Liu
Highlight Image
Highlight Image
Highlight Image
CVPR 2025
Generative Gaussian Splatting for Unbounded 3D City Generation
  • Haozhe Xie
  • Zhaoxi Chen
  • Fangzhou Hong
  • Ziwei Liu
Highlight Image
Highlight Image
Highlight Image
CVPR 2025
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
  • Zhaoxi Chen
  • Jiaxiang Tang
  • Yuhao Dong
  • Ziang Cao
  • Fangzhou Hong
  • Yushi Lan
  • Tengfei Wang
  • Haozhe Xie
  • Tong Wu
  • Shunsuke Saito
  • Liang Pan
  • Dahua Lin
  • Ziwei Liu
Highlight Image
Highlight Image
Highlight Image
ICLR 2025
DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes
  • Hengwei Bian
  • Lingdong Kong
  • Haozhe Xie
  • Liang Pan
  • Yu Qiao
  • Ziwei Liu
Highlight Image
AAAI 2025
Multi-view Consistent 3D Panoptic Scene Understanding
  • Xianzhu Liu
  • Xin Sun
  • Haozhe Xie
  • Zonglin Li
  • Ru Li
  • Shengping Zhang
Highlight Image
IJCV 133(3): 1306-1325
2D Semantic-Guided Semantic Scene Completion
  • Xianzhu Liu
  • Haozhe Xie
  • Shengping Zhang
  • Hongxun Yao
  • Rongrong Ji
  • Liqiang Nie
  • Dacheng Tao
2024
Highlight Image
Highlight Image
Highlight Image
CVPR 2024
CityDreamer: Compositional Generative Model of Unbounded 3D Cities
  • Haozhe Xie
  • Zhaoxi Chen
  • Fangzhou Hong
  • Ziwei Liu
2023
Highlight Image
IJCV 131(9): 2425–2445
Learning Geometric Transformation for Point Cloud Completion
  • Shengping Zhang
  • Xianzhu Liu
  • Haozhe Xie
  • Liqiang Nie
  • Huiyu Zhou
  • Dacheng Tao
  • Xuelong Li
2021
Highlight Image
Highlight Image
Highlight Image
ACM Multimedia 2021
Long-Range Feature Propagating for Natural Image Matting
  • Qinglin Liu
  • Haozhe Xie
  • Shengping Zhang
  • Bineng Zhong
  • Rongrong Ji
Highlight Image
Highlight Image
Highlight Image
PhD Thesis
3D Scene and Object Reconstruction from Multiple Sources and Viewpoints
  • Haozhe Xie
Highlight Image
Highlight Image
Highlight Image
CVPR 2021
Efficient Regional Memory Network for Video Object Segmentation
  • Haozhe Xie
  • Hongxun Yao
  • Shangchen Zhou
  • Shengping Zhang
  • Wenxiu Sun
2020
Highlight Image
Highlight Image
Highlight Image
ECCV 2020
GRNet: Gridding Residual Network for Dense Point Cloud Completion
  • Haozhe Xie
  • Hongxun Yao
  • Shangchen Zhou
  • Jiageng Mao
  • Shengping Zhang
  • Wenxiu Sun
Highlight Image
IJCV 128(12): 2919-2935
Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images
  • Haozhe Xie
  • Hongxun Yao
  • Shengping Zhang
  • Shangchen Zhou
  • Wenxiu Sun
2019
Highlight Image
ICCV 2019
Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images
  • Haozhe Xie
  • Hongxun Yao
  • Xiaoshuai Sun
  • Shangchen Zhou
  • Shengping Zhang
Research Experience

Research Fellow
Mar 2023 - Present | MMLab@NTU, Nanyang Technological University
Supervised by Prof. Ziwei Liu. Research on 3D generation and VLA models, represented by CityDreamer and DynamicVLA.

Senior Research Scientist
Aug 2021 - Mar 2023 | Tencent AI Lab
Worked with Dr. Hong Shang. Outstanding Contributor (2022H1) & Excellent Individual (2022H2)

Research Intern
Mar 2019 - Aug 2021 | SenseTime Research
Mentored by Dr. Wenxiu Sun. Outstanding Intern (2019H2)

Invited Talks
MMLab@NTU Special Session on Embodied Intelligence. WiseModel, Online. [News]
Toward World Models: From 3D to 4D City Generation. 3D Computer Vision (3D 视觉工坊), Online. [Video playback]
3D Object and Scene Reconstruction Meets Neural Networks. Faculty of Computing, Harbin Institute of Technology, Harbin, China. [News]
Deep Learning Fundamentals. Zhejiang Institute of Research and Innovation, The University of Hong Kong, Hangzhou, China. [News]
Computer Fundamentals. Hangzhou No.14 High School, Hangzhou, China.
Data Structure. Hangzhou No.14 High School, Hangzhou, China.
Academic Services
NeurIPS, ICLR
CVPR, ICCV, ECCV, ICML, SIGGRAPH Asia, AAAI, Eurographics, 3DV, WACV
TPAMI, IJCV, TVCG, TOG, TIP, TMM, PR, CSUR
Teaching
NTU AI6126: Advanced Computer Vision — Teaching Assistant
HIT CS32261: Audio-Visual Signal Processing — Teaching Assistant