Haoming Cai
/haʊˈmɪŋ tsaɪ/
I am a thrid-year CS PhD student @ CS Department of University of Maryland, College Park, advised by Professor Christopher Metzler.
Previously, I had a wonderful journey (2020-2022) in X-Pixel with my supervisor Prof. Dong Chao (Shanghai AI Lab) and mentor Dr. Gu Jinjin (Univeristy of Sydney).
Contact : hmcai@umd.edu / helmut.choy@gmail.com / haomingcai@link.cuhk.edu.cn
My CV : PDF
GitHub /
Google Scholar /
Flickr /
|
|
Research Interest
- My research centers on the convergence of Computational Photography, Gen AI and Low-level Vision. I aim to synergize computational imaging techniques with advanced back-end processing algorithms to enhance the perceptual quality of human experiences on mobile and edge devices.
Sometimes Science Is More Art Than Science -- Rick Sanchez
|
3D Reconstruction in Computational Photography (2023-Now)
|
|
Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats
Mingyang Xie*, Haoming Cai* , Sachin Shah, Yiran Xu, Brandon Y. Feng, Jia-bin Huang, Christopher Metzler
The European Conference on Computer Vision (ECCV'24)
|
Seeing & Tracking Through Adverse Weather Condition (2023-Now)
|
|
Temporally Consistent Atmospheric Turbulence Mitigation with Neural Representations
Haoming Cai*, Jingxi Chen*, Brandon Y. Feng, Weiyun Jiang, Mingyang Xie, Kevin Zhang, Cornelia Fermuller, Yiannis Aloimonos, Ashok Veeraraghavan, Christopher Metzler.
Under Review - 2024
|
|
CodedEvents: Optimal Point-Spread-Function Engineering for 3D-Tracking with Event Cameras
Sachin Shah, Matthew Albert Chan, Haoming Cai, Jingxi Chen, Sakshum Kulshrestha, Chahat Deep Singh, Yiannis Aloimonos, Christopher Metzler.
The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR'24)
This paper explores PSF engineering for neuromorphic event cameras, designing optimized masks for superior 3D point localization and tracking.
|
|
Snow Removal in Video: A New Dataset and A Novel Method
Haoyu Chen, Jingjing Ren, Jinjin Gu, Hongtao Wu, Xuequan Lu, Haoming Cai, Lei Zhu
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV'23)
arxiv/
code /
website /
This paper presents a new deep learning framework for removing snow from videos, featuring a high-quality dataset and innovative modules for effective snow removal, outperforming existing methods.
|
Image/Video Generation through Diffusion Model (2023-Now)
|
|
TimeRewind: Rewinding Time with Image-and-Events Video Diffusion
Jingxi Chen, Brandon Y. Feng, Haoming Cai, Mingyang Xie, Christopher Metzler, Cornelia Fermuller, Yiannis Aloimonos
Under Review - 2024
|
Image Metric Design Inspired by Human Visual System (2020-Now)
|
|
Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment
Tianhe Wu, Shuwei Shi, Haoming Cai , Mingdeng Cao, Jing Xiao, Yinqiang Zheng, Yujiu Yang
Advances in Neural Information Processing Systems (NeurIPS'23)
arxiv/
code /
website /
Current omnidirectional image quality assessment lacks observer browsing modeling. We propose Assessor360, a novel multi-sequence network for BOIQA derived from realistic multi-assessor ODI quality assessment
|
|
Pipal: a large-scale image quality assessment dataset for perceptual image restoration
Jinjin Gu, Haoming Cai, Haoyu Chen, Xiaoxing Ye, Jimmy S Ren, Chao Dong.
The European Conference on Computer Vision (ECCV'20)
arxiv/
website /
This paper highlights the challenge IQA faces with emerging GAN-based image restoration methods, noting a growing disparity between quantitative metrics and perceptual quality. To address this, the authors introduce a large-scale IQA dataset and benchmarks to enhance IQA methods’ effectiveness.
|
Controllable & Efficient Image Restoration (2020 - 2022)
|
|
Super-resolution by predicting offsets: An ultra-efficient super-resolution network for rasterized images
Jinjin Gu, Haoming Cai, Chenyu Dong, Ruofan Zhang, Yulun Zhang, Wenming Yang, Chun Yuan.
European Conference on Computer Vision (ECCV'22)
arxiv/
code /
SRPO is a real-time super-resolution method for computer graphics, achieving superior visual effects with minimal computational cost by leveraging rasterized image features and offset prediction.
|
|
Efficient image super-resolution using vast-receptive-field attention
Haoming Cai*, Lin Zhou*, Jinjin Gu, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Yu Qiao, Chao .
AIM'22 @ ECCV
arxiv/
code /
This study improves super-resolution networks by refining the attention mechanism, leading to VapSR, which outperforms lightweight networks with fewer parameters, achieving similar results as IMDB and RFDN networks with significantly fewer parameters.
|
|
Blueprint separable residual network for efficient image super-resolution
Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Jinjin Gu, Yu Qiao, Chao Dong.
NTIRE'23 @ CVPR
arxiv/
code /
Winner of Efficient Image Super-Resolution Track @ New Trends in Image Restoration and Enhancement (NTIRE) workshop of CVPR’23
|
|
Toward interactive modulation for photo-realistic image restoration
Haoming Cai, Jingwen He, Yu Qiao, Chao Dong.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'21)
arxiv/
code /
This paper presents CUGAN, a Controllable Unet GAN, for modulating image restoration tasks with fine texture details. Through dynamic level adjustments and condition networks, CUGAN outperforms previous methods, offering smooth user control over output effects.
|
|