|
Research Areas
Modern deep learning models are inefficient:
they are trained on many common-scene images with large uninformative regions
and often require manual labels,
wasting both compute and annotation effort.
My research addresses this inefficiency from two angles:
- Compute efficiency: synthesizing or collecting long-tail scenes (e.g., TPSeNCE, ROADWork) and oversampling salient regions (e.g., WarpI2I, Instance-Warp).
- Label efficiency: Weak supervision with foundation-model pseudo-labels to reduce manual annotation.
My earlier works focused on image restoration and enhancement
(e.g., SGZ, LLIE_Survey).
|
|
|
WarpI2I: Image Warping for Image-to-Image Translation
Shen Zheng, Anurag Ghosh, Gaurav Parmar, Srinivasa Narasimhan
Under Review
[Webpage]
We warp the input image to enlarge salient regions (e.g., objects, faces, eyes)
so they are better preserved during image-to-image translation in the compressed latent representation.
|
|
|
ROADWork Dataset: Learning to Recognize, Observe, Analyze and Drive Through Work Zones
Anurag Ghosh, Shen Zheng, Robert Tamburo, Juan R. Alvarez Padilla, Hailiang Zhu, Michael Cardei, Nicholas Dunn, Christoph Mertz, Srinivasa Narasimhan
ICCV 2025
[Paper]
[Webpage]
[GitHub]
We introduce ROADWork, a large-scale open-source dataset and benchmark
with fine-grained annotations and scene descriptions
to support robust perception and navigation in complex construction zones.
|
|
|
Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain Adaptation
Shen Zheng★,
Anurag Ghosh★,
Srinivasa Narasimhan
WACV 2025
[Paper]
[Webpage]
[Code]
We warp the image at instance-level to oversample easier-to-adapt foreground objects
and undersample harder-to-adapt background regions to improve feature representation in domain adaptation.
|
|
|
TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain
Shen Zheng,
Changjie Lu,
Srinivasa Narasimhan
WACV 2024
[Paper]
[Webpage]
[Code]
[Slides]
[Poster]
Introduce a Triangular Probability Similarity (TPS) loss to minimize the artifacts and distortions.
Propose a Semantic Noise Contrastive Estimation (SeNCE) strategy to stabilize the amounts of generated rain.
Show that realistic rain generation benefits deraining and object detection in rain.
|
|
|
Low-Light Image Enhancement: A Comprehensive Survey and Beyond
Shen Zheng,
Yiling Ma,
Jinqian Pan,
Changjie Lu,
Gaurav Gupta
[Paper]
[Code]
Present a comprehensive survey of low-light image enhancement (LLIE).
Propose the SICE_Grad and SICE_Mix image datasets, which include images with both overexposure and underexposure.
Introduce Night Wenzhou, a large-scale, high-resolution video dataset captured in fast motion with diverse illuminations and degradation.
|
|
|
PointNorm: Dual Normalization is All You Need for Point Cloud Analysis
Shen Zheng,
Jinqian Pan,
Changjie Lu,
Gaurav Gupta
IJCNN 2023 (Oral Presentation)
[Paper]
[Webpage]
[Code]
[Slides]
Solution: PointNorm, a point cloud analysis network with a DualNorm module (Point Normalization & Reverse Point Normalization)
that leverages local mean and global standard deviation to address irregular (i.e., unevenly distributed) point clouds
|
|
|
Semantic-Guided Zero-Shot Learning for Low-Light Image/Video Enhancement
Shen Zheng,
Gaurav Gupta
WACV 2022
[Paper]
[Webpage]
[Code]
[Slides]
Introduce SGZ, a zero-shot low-light image enhancement framework
with pixel-wise light deficiency estimation, parameter-free recurrent image enhancement, and unsupervised semantic segmentation
to enhance unevenly illuminated images and videos.
|
|
|
AS-IntroVAE: Adversarial Similarity Distance Makes Robust IntroVAE
Changjie Lu,
Shen Zheng,
Zirui Wang,
Omar Dib,
Gaurav Gupta
ACML 2022
[Paper]
[Code]
[Slides]
Propose Adversarial Similarity Distance Introspective Variational Autoencoder (AS-IntroVAE), which can address the posterior
collapse and the vanishing gradient problem in image generation in one go.
|
|
Perception Software Engineer (Intern) at Waymo
Mentor: Guohao Zhang
Improved Online HD Map Construction using long-term and short-term memory fusion.
|
|
Perception Software Engineer (Full-Time) at Lucid Motors
Director: Dr. Feng Guo
Working as a perception software engineer in the ADAS perception team responsible for ADAS parking, traffic light detection, and blockage detection.
|
|
Perception Engineer (Intern) at Momenta
Director: Dr. Wangjiang Zhu
Responsible for long-tailed data augmentation, training data auto-labeling and cleaning, and model evaluation for traffic light detection algorithms.
|
 |
Technical Program Committee:
WCCI 2024
Conference Reviewers:
CVIP (2021, 2022),
AAAI (2022),
IJCNN (2023, 2024, 2025),
WACV (2023, 2024, 2025),
ECCV (2024),
CVPR (2025,2026),
ICCV (2025)
Journal Reviewers:
TNNLS,
IJCV,
TCSVT,
ESWA,
EAAI,
JVCIR,
Neurocomputing
|
 |
Programming Languages:
Python, R, Java, C++, Matlab, HTML, Mathematica, Shell, LaTeX, Markdown
Frameworks & Platforms:
Pytorch, TensorFlow, Keras, Ubuntu, Docker, Git, ONNX, CUDA
Libraries:
Scikit-Learn, SciPy, NumPy, OpenCV, Matplotlib, Pandas
|
 |
Sports:
Basketball, Table Tennis, Swimming, Cycling, Hiking, Weightlifting
Games:
DOTA2, AOE2, Warcraft III
Beliefs:
予诺三观,
浩然道路,
立威思想,
谢航精神,
永富方法,
刘远哲学,
天远信念,
志鹏智慧,
框-弘-内卷,
婉莹心态,
雯芯菩萨
|
|