Shen Zheng 「郑深」

I am a second-year PhD student at the Robotics Institute, Carnegie Mellon University (CMU), advised by Dr. Srinivasa Narasimhan. I have interned at Waymo and Momenta, and worked at Lucid Motors. I completed my M.S. in Computer Vision (MSCV) at CMU, and earned my B.S. in Mathematics from Wenzhou-Kean University (WKU), where I worked with Dr. Gaurav Gupta.

Email1: shenzhen@andrew.cmu.edu

Email2: lebronshenzheng@gmail.com

Resume  /  Google Scholar  /  Github  /  Leetcode

profile photo
Research Areas

My current research focuses on semantic-aware image warping and contrastive learning (e.g., WarpI2I, Instance-Warp, TPSeNCE) to enhance visual details in generative models and improve the understanding of small and long-tail objects (e.g., ROADWork) in discriminative models.

My earlier works focused on image restoration and enhancement (e.g., SGZ, LLIE_Survey).

Selected Publications

WarpI2I: Image Warping for Image-to-Image Translation
Shen Zheng, Anurag Ghosh, Gaurav Parmar, Srinivasa Narasimhan
Under Review

Motivation: Low-capacity image-to-image translation models often struggle to preserve fine details because they compress high-resolution inputs into a spatially downsampled latent space.

Solution: We warp the input image to enlarge salient regions (e.g., objects, faces, eyes) so they are better preserved during image-to-image translation in the compressed latent representation.

ROADWork Dataset: Learning to Recognize, Observe, Analyze and Drive Through Work Zones
Anurag Ghosh, Shen Zheng, Robert Tamburo, Juan R. Alvarez Padilla, Hailiang Zhu, Michael Cardei, Nicholas Dunn, Christoph Mertz, Srinivasa Narasimhan
ICCV 2025
[Paper] [Webpage] [GitHub]

Motivation: Navigating work zones is challenging due to the lack of large-scale datasets capturing their complexity and variability.

Solution: We introduce ROADWork, a large-scale open-source dataset and benchmark with fine-grained annotations and scene descriptions to support robust perception and navigation in complex construction zones.

Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain Adaptation
Shen Zheng★, Anurag Ghosh★, Srinivasa Narasimhan
WACV 2025
[Paper] [Webpage] [Code]

Motivation: In domain adaptation, backgrounds occupy more pixels and exhibit larger cross-domain variations, making them harder to adapt, while foreground objects occupy fewer pixels and are less diverse, thus easier to adapt.

Solution: We warp the image at instance-level to oversample easier-to-adapt foreground objects and undersample harder-to-adapt background regions to improve feature representation in domain adaptation.

TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain
Shen Zheng, Changjie Lu, Srinivasa Narasimhan
WACV 2024
[Paper] [Webpage] [Code] [Slides] [Poster]

Motivation: Previous image-to-image translation methods produce artifacts and distortions, and lack control over the amount of rain generated.

Solution: Introduce a Triangular Probability Similarity (TPS) loss to minimize the artifacts and distortions. Propose a Semantic Noise Contrastive Estimation (SeNCE) strategy to stabilize the amounts of generated rain. Show that realistic rain generation benefits deraining and object detection in rain.

Low-Light Image Enhancement: A Comprehensive Survey and Beyond
Shen Zheng, Yiling Ma, Jinqian Pan, Changjie Lu, Gaurav Gupta

[Paper] [Code]

Motivation: Existing LLIE datasets focus on either overexposure or underexposure, not both, and usually feature minimally degraded images captured from static positions.

Solution: Present a comprehensive survey of low-light image enhancement (LLIE). Propose the SICE_Grad and SICE_Mix image datasets, which include images with both overexposure and underexposure. Introduce Night Wenzhou, a large-scale, high-resolution video dataset captured in fast motion with diverse illuminations and degradation.

PointNorm: Dual Normalization is All You Need for Point Cloud Analysis
Shen Zheng, Jinqian Pan, Changjie Lu, Gaurav Gupta
IJCNN 2023 (Oral Presentation)
[Paper] [Webpage] [Code] [Slides]

Motivation: Current point cloud analysis methods struggle with irregular (i.e., unevenly distributed) point clouds.

Solution: PointNorm, a point cloud analysis network with a DualNorm module (Point Normalization & Reverse Point Normalization) that leverages local mean and global standard deviation.

Semantic-Guided Zero-Shot Learning for Low-Light Image/Video Enhancement
Shen Zheng, Gaurav Gupta
WACV 2022
[Paper] [Webpage] [Code] [Slides]

Motivation: Current low-light image enhancement methods cannot handle uneven illuminations, is computationally inefficient, and fail to preserve the semantic information.

Solution: Introduce SGZ, a zero-shot low-light image enhancement framework with pixel-wise light deficiency estimation, parameter-free recurrent image enhancement, and unsupervised semantic segmentation.

AS-IntroVAE: Adversarial Similarity Distance Makes Robust IntroVAE
Changjie Lu, Shen Zheng, Zirui Wang, Omar Dib, Gaurav Gupta
ACML 2022
[Paper] [Code] [Slides]

Motivation: Generative models experience posterior collapse and vanishing gradient due to no effective metric for real-fake image evaluation.

Solution: Propose Adversarial Similarity Distance Introspective Variational Autoencoder (AS-IntroVAE), which can address the posterior collapse and the vanishing gradient problem in image generation in one go.

Professional Experiences
Perception Software Engineer (Intern) at Waymo

Mentor: Guohao Zhang

Improved Online HD Map Construction using long-term and short-term memory fusion.

Perception Software Engineer (Full-Time) at Lucid Motors

Director: Dr. Feng Guo

Working as a perception software engineer in the ADAS perception team responsible for ADAS parking, traffic light detection, and blockage detection.

Perception Engineer (Intern) at Momenta

Director: Dr. Wangjiang Zhu

Responsible for long-tailed data augmentation, training data auto-labeling and cleaning, and model evaluation for traffic light detection algorithms.

Services
Technical Program Committee:
WCCI 2024

Conference Reviewers:
CVIP (2021, 2022), AAAI (2022), IJCNN (2023, 2024, 2025), WACV (2023, 2024, 2025), ECCV (2024), CVPR (2025,2026), ICCV (2025)

Journal Reviewers:
TNNLS, IJCV, TCSVT, ESWA, EAAI, JVCIR, Neurocomputing
Skills
Programming Languages:
Python, R, Java, C++, Matlab, HTML, Mathematica, Shell, LaTeX, Markdown

Frameworks & Platforms:
Pytorch, TensorFlow, Keras, Ubuntu, Docker, Git, ONNX, CUDA

Libraries:
Scikit-Learn, SciPy, NumPy, OpenCV, Matplotlib, Pandas

Fun Facts
Sports:
Basketball, Table Tennis, Swimming, Cycling, Hiking, Weightlifting

Games:
DOTA2, AOE2, Warcraft III

Beliefs: