Skip to content

MiZhenxing/One4D

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control

Zhenxing Mi, Yuxin Wang, Dan Xu
The Hong Kong University of Science and Technology (HKUST)
Accepted to ECCV 2026

Arxiv Paper ECCV 2026 Hugging Face Website

cut_1.mp4

News

  • 2026.06: One4D has been accepted to ECCV 2026.

📝 Abstract

One4D is a unified framework for 4D generation and reconstruction that can seamlessly transition between 4D generation from a single image, 4D reconstruction from a full video, mixed generation and reconstruction from sparse frames, and 4D generation from a text prompt via Unified Masked Conditioning (UMC). With Decoupled LoRA Control (DLC), which employs two modality-specific LoRA adapters to form decoupled computation branches for RGB frames and pointmaps, connected by lightweight, zero-initialized control links that gradually learn mutual pixel-level consistency, One4D produces high-quality RGB frames and accurate pointmaps across both generation and reconstruction tasks.

🧠 Methodology

Unified Framework

One4D Framework

Figure 1: The One4D Unified Framework architecture.

  • 🎛️ Unified Masked Conditioning (UMC): Enables seamlessly transition between 4D generation from a single image, 4D reconstruction from a full video, and mixed generation and reconstruction from sparse frames using a single unified model.
  • 🧩 Decoupled LoRA Control (DLC): Decouples RGB and XYZ computation to minimize interference while maintaining pixel-wise cross-modal control.

Architecture Comparison

Architecture Comparison

Figure 2: Comparison of Decoupled LoRA Control against other architectures.

🎨 Results Showcase

1. Single Image to 4D

Generating a consistent 4D scene from a single input image.

cut_2.mp4
cut_6.mp4
cut_5.mp4
cut_4.mp4
cut_3.mp4

2. Sparse Frames to 4D

Reconstructing the 4D scene given only a few sparse frames.

cut_13.mp4
cut_12.mp4
cut_11.mp4
cut_8.mp4
cut_7.mp4

3. Full Video to 4D

High-fidelity reconstruction from a full video input.

cut_16.mp4
cut_15.mp4
cut_14.mp4
cut_10.mp4
cut_9.mp4

4. Text to 4D

Generating a consistent 4D scene from a pure text prompt.

cut_19.mp4
cut_20.mp4

📖 BibTeX

If you find our work useful for your research, please consider citing us:

@inproceedings{mi2026one4d,
  title={One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control},
  author={Mi, Zhenxing and Wang, Yuxin and Xu, Dan},
  booktitle={European Conference on Computer Vision (ECCV)},
  year={2026}
}

About

[ECCV 2026] One4D: Unified 4D Generation and Reconstruction

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors