Jie-Ying Lee 李杰穎

Research Assistant @ NYCU CS
Software Engineer @ Google Pixel Camera

CV / Scholar / LinkedIn / GitHub / X / Threads / Blog

About Me

I’m a research assistant in Computer Science at National Yang Ming Chiao Tung University, working with Prof. Yu-Lun Liu, and a Software Engineer on Google’s Pixel Camera Team. I work on 3D scene synthesis, generative models for vision, and embodied AI, particularly focusing on Neural Radiance Fields, 3D Gaussian Splatting, vision-language navigation, and on-device perception.

I received my B.S. in Computer Science from National Yang Ming Chiao Tung University, with an exchange semester at ETH Zurich. My industry experience includes internships at Google (Pixel Camera Team), Microsoft, and Appier.

I am actively seeking research collaborations. If you are interested in working with me, don’t hesitate to reach out.

Google SWE (2025 - Present)

NYCU Research Assistant (2025 - Present) B.S in Computer Science (2021 - 2025)

ETH Zurich Exchange Student (2024 - 2025)

News

Jun. 2026: Skyfall-GS accepted to ECCV 2026
Feb. 2026: Pantheon360 accepted to CVPR 2026
Sep. 2025: Joined Google as a Software Engineer on the Pixel Camera Team and joined NYCU as a research assistant, working with Prof. Yu-Lun Liu
Jul. 2025: See, Point, Fly accepted to CoRL 2025
Jun. 2025: LightsOut accepted to ICCV 2025
Mar. 2025: Two papers accepted to CVPR 2025: AuraFusion360 and SpectroMotion
Sep. 2024: Exchange semester at ETH Zurich, D-INFK (blog post)
Jul. 2024: BEVGaussian won NYCU CS Undergraduate Research Competition (3rd place)
Jun. 2024: Awarded NSTC Research Grant for University Students
May 2024: BoostMVSNeRFs accepted to SIGGRAPH 2024

Publications

Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery teaser image

▶ Hover / Tap

Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery
Jie-Ying Lee, Yi-Ruei Liu, Shr-Ruei Tsai, Wei-Cheng Chang, Chung-Ho Wu, Jiewen Chan, Zhenjun Zhao, Chieh Hubert Lin, Yu-Lun Liu
ECCV, 2026
project page / arXiv / pdf / code / video

We present Skyfall-GS, a framework that synthesizes photorealistic, city-block scale 3D urban scenes from satellite imagery using diffusion models, eliminating the need for expensive 3D scanning and manual annotation while enabling real-time exploration.

BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering teaser image

▶ Hover / Tap

BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering
Yi-Ruei Liu, Jie-Ying Lee, Zheng-Hui Huang, Yu-Lun Liu†, Chih-Hao Lin† (†Corresponding Author)
arXiv, 2026
project page / arXiv / pdf / code

We present BRDFusion, an inverse rendering framework for urban scenes that unifies physics-based rendering with generative diffusion priors to recover explicit material and lighting properties from video, enabling high-quality relighting and simulation.

Pantheon360: Taming Digital Twin Generation via 3D-Aware 360° Video Diffusion teaser image

▶ Hover / Tap

Pantheon360: Taming Digital Twin Generation via 3D-Aware 360° Video Diffusion
Ting-Hsuan Chen*, Ying-Huan Chen*, Tao Tu, Jie-Ying Lee, Cho-Ying Wu, Fangzhou Lin, Hengyuan Zhang, David Paz, Xinyu Huang, Yuliang Guo†, Yu-Lun Liu†, Yue Wang†, Liu Ren (*Equal Contribution, †Corresponding Author)
CVPR, 2026
project page / arXiv

We present Pantheon360, a controllable 360° video diffusion framework for digital twin generation that synthesizes high-fidelity videos from sparse 360° inputs, letting the diffusion model focus on photorealistic texture refinement while a 3D Cache enforces global geometric consistency.

LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal teaser image

▶ Hover / Tap LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal hover image

LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal
Shr-Ruei Tsai, Wei-Cheng Chang, Jie-Ying Lee, Chih-Hai Su, Yu-Lun Liu
ICCV, 2025
project page / arXiv / code / video / demo

We present LightsOut, a diffusion-based outpainting framework that enhances lens flare removal by reconstructing off-frame light sources. Our approach combines a multitask regression module with LoRA fine-tuned diffusion models to produce realistic and physically consistent results.

See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation teaser image

▶ Hover / Tap

See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation
Chih-Yao Hu*, Yang-Sen Lin*, Yuna Lee, Chih-Hai Su, Jie-Ying Lee, Shr-Ruei Tsai, Chin-Yang Lin, Kuan-Wen Chen, Tsung-Wei Ke, Yu-Lun Liu (*Equal Contribution)
CoRL, 2025
project page / arXiv / code / video

We present See, Point, Fly (SPF), a training-free framework for aerial vision-and-language navigation. By leveraging vision-language models and reformulating navigation as a 2D spatial grounding task, SPF enables universal unmanned aerial navigation without task-specific training.

AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting teaser image

▶ Hover / Tap

AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting
Chung-Ho Wu*, Yang-Jung Chen*, Ying-Huan Chen, Jie-Ying Lee, Bo-Hsu Ke, Chun-Wei Tuan Mu, Yi-Chuan Huang, Chin-Yang Lin, Min-Hung Chen, Yen-Yu Lin, Yu-Lun Liu (*Equal Contribution)
CVPR, 2025
project page / arXiv / code / video

We introduce AuraFusion360, a reference-based 360° scene inpainting method with three key innovations: depth-aware occlusion identification, Adaptive Guided Depth Diffusion for zero-shot point placement, and SDEdit-based enhancement for multi-view coherence.

SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes teaser image

▶ Hover / Tap

SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes
Cheng-De Fan, Chen-Wei Chang, Yi-Ruei Liu, Jie-Ying Lee, Jiun-Long Huang, Yu-Chee Tseng, Yu-Lun Liu
CVPR, 2025
project page / arXiv / code / video

We present SpectroMotion, the first 3D Gaussian Splatting method capable of reconstructing photorealistic dynamic specular scenes. By combining 3DGS with physically-based rendering and deformation fields, we achieve high-quality synthesis of challenging real-world dynamic reflective surfaces.

BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes teaser image

▶ Hover / Tap

BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes
Chih-Hai Su*, Chih-Yao Hu*, Shr-Ruei Tsai*, Jie-Ying Lee*, Chin-Yang Lin, Yu-Lun Liu (*Equal Contribution)
SIGGRAPH, 2024
project page / arXiv / code / video

We present BoostMVSNeRFs, a method that enhances rendering quality for MVS-based NeRFs in large-scale scenes. Our approach addresses key limitations including restricted viewport coverage and artifacts from limited input views, enabling generalizable view synthesis in complex environments.

Service

Reviewer: ECCV, IJCV
Teaching Assistant: Image and Video Generation, NYCU (2025 Fall)

Misc.

Beyond research, I’m passionate about staying active through badminton and hip-hop dance. I also enjoy capturing moments through photography.

Music-wise, I’m into Taiwanese indie and hip-hop, frequently listening to artists like Gummy B, 草東沒有派對 (No Party For Cao Dong), and 國蛋 GorDoN.