Shen Zheng

Shen Zheng 「郑深」

I am a second-year PhD student at the Robotics Institute, Carnegie Mellon University (CMU), advised by Dr. Srinivasa Narasimhan. I have interned at Waymo and Momenta, and worked at Lucid Motors. I completed my M.S. in Computer Vision (MSCV) at CMU, and earned my B.S. in Mathematics from Wenzhou-Kean University (WKU), where I worked with Dr. Gaurav Gupta.

Email1: shenzhen@andrew.cmu.edu

Email2: lebronshenzheng@gmail.com

Google Scholar / Github / 知乎

Research Areas

Modern deep learning models are inefficient: they are trained on many common-scene images with large uninformative regions.

My research addresses this inefficiency from two angles:

Data efficiency: synthesizing or collecting long-tail scenes (e.g., TPSeNCE, ROADWork)
Compute efficiency: focusing on salient regions to reduce spatial redundancy (e.g., Instance-Warp, WarpI2I, MapThunder),

My earlier works focused on image restoration and enhancement (e.g., SGZ, LLIE_Survey).

Selected Publications

	MapThunder: Fewer Faster Better Online Vectorized HD Map Construction Shen Zheng, et al. Coming Soon! We propose an online vectorized HD map construction method with fewer tokens, faster inference, and better accuracy, generalizing to VLM, VLA, and virtual cities.
	WarpI2I: Image Warping for Image-to-Image Translation Shen Zheng, Anurag Ghosh, Gaurav Parmar, Srinivasa Narasimhan ECCV 2026 [Webpage] [GitHub] We warp the input image to enlarge salient regions (e.g., faces, eyes, objects) to better preserve fine details in the compressed latent space during I2I.
	ROADWork Dataset: Learning to Recognize, Observe, Analyze and Drive Through Work Zones Anurag Ghosh, Shen Zheng, Robert Tamburo, Juan R. Alvarez Padilla, Hailiang Zhu, Michael Cardei, Nicholas Dunn, Christoph Mertz, Srinivasa Narasimhan ICCV 2025 [Paper] [Webpage] [GitHub] We introduce ROADWork, a large-scale open-source dataset and benchmark with fine-grained annotations and scene descriptions for driving in work zones.
	Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain Adaptation Shen Zheng★, Anurag Ghosh★, Srinivasa Narasimhan WACV 2025 [Paper] [Webpage] [Code] We warp the image at instance-level to oversample foreground objects and undersample background regions to improve domain adaptation.
	TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain Shen Zheng, Changjie Lu, Srinivasa Narasimhan WACV 2024 [Paper] [Webpage] [Code] [Slides] [Poster] We introduce TPSeNCE, a rain generation framework with Triangular Probability Similarity (TPS) and Semantic Noise Contrastive Estimation (SeNCE) for realistic rainy scene synthesis.
	Low-Light Image Enhancement: A Comprehensive Survey and Beyond Shen Zheng, Yiling Ma, Jinqian Pan, Changjie Lu, Gaurav Gupta [Paper] [Code] Present a comprehensive survey of low-light image enhancement (LLIE) and introduce Night Wenzhou, a large-scale, high-resolution video dataset captured in fast motion with diverse illuminations and degradation.
	PointNorm: Dual Normalization is All You Need for Point Cloud Analysis Shen Zheng, Jinqian Pan, Changjie Lu, Gaurav Gupta IJCNN 2023 (Oral Presentation) [Paper] [Webpage] [Code] [Slides] PointNorm, a point cloud analysis network with a DualNorm module that leverages local mean and global standard deviation to address irregular (i.e., unevenly distributed) point clouds.
	Semantic-Guided Zero-Shot Learning for Low-Light Image/Video Enhancement Shen Zheng, Gaurav Gupta WACV 2022 [Paper] [Webpage] [Code] [Slides] We introduce SGZ, a zero-shot low-light image and video enhancement framework at 1000 FPS with pixel-wise light deficiency estimation, recurrent image enhancement, and unsupervised semantic segmentation
	AS-IntroVAE: Adversarial Similarity Distance Makes Robust IntroVAE Changjie Lu, Shen Zheng, Zirui Wang, Omar Dib, Gaurav Gupta ACML 2022 [Paper] [Code] [Slides] We propose Adversarial Similarity Distance Introspective Variational Autoencoder (AS-IntroVAE), which can address the posterior collapse and the vanishing gradient problem in image generation in one go.

Professional Experiences

Perception Software Engineer (Intern) at Waymo

Mentor: Guohao Zhang; Zhenyao Zhu

Improved Online HD Map Construction using long-term and short-term memory fusion; Worked on Gaussian Rendering to improve 3D occupancy prediction.

Perception Software Engineer (Full-Time) at Lucid Motors

Director: Dr. Feng Guo

Working as a perception software engineer in the ADAS perception team responsible for ADAS parking, traffic light detection, and blockage detection.

Perception Engineer (Intern) at Momenta

Director: Dr. Wangjiang Zhu

Responsible for long-tailed data augmentation, training data auto-labeling and cleaning, and model evaluation for traffic light detection algorithms.

Services

Technical Program Committee:
WCCI 2024

Conference Reviewers:
AAAI (2022), IJCNN (2023, 2024, 2025), WACV (2023, 2024, 2025), ECCV (2024), CVPR (2025,2026), ICCV (2025), BMVC (2026)

Journal Reviewers:
TNNLS, IJCV, TCSVT, ESWA, EAAI, JVCIR, Neurocomputing

Skills

Programming Languages:
Python, R, Java, C++, Matlab, HTML, Mathematica, Shell, LaTeX, Markdown

Frameworks & Platforms:
Pytorch, TensorFlow, Keras, Ubuntu, Docker, Git, ONNX, CUDA

Libraries:
Scikit-Learn, SciPy, NumPy, OpenCV, Matplotlib, Pandas

Fun Facts

Sports:
Basketball, Table Tennis, Swimming, Cycling, Hiking, Weightlifting

Games:
DOTA2, AOE2, Warcraft III

Beliefs:

予诺三观, 浩然道路, 立威思想, 谢航精神, 永富方法, 框-弘-内卷, 雯芯境界