Haoming Cai /haʊˈmɪŋ tsaɪ/

I am a thrid-year CS PhD student @ CS Department of University of Maryland, College Park, advised by Professor Christopher Metzler.

Previously, I had a wonderful journey (2020-2022) in X-Pixel with my supervisor Prof. Dong Chao (Shanghai AI Lab) and mentor Dr. Gu Jinjin (Univeristy of Sydney).

Contact : hmcai@umd.edu / helmut.choy@gmail.com / haomingcai@link.cuhk.edu.cn

My CV : PDF

GitHub  /  Google Scholar  /  Flickr  / 

profile photo

Research Interest

- My research centers on the convergence of Computational Photography, Gen AI and Low-level Vision. I aim to synergize computational imaging techniques with advanced back-end processing algorithms to enhance the perceptual quality of human experiences on mobile and edge devices.
Sometimes Science Is More Art Than Science -- Rick Sanchez



Seeing, Tracking, and Reconstruction (2D & 3D) Under Challenging Imaging Conditions (2023-Now)

project image

Temporally Consistent Atmospheric Turbulence Mitigation with Neural Representations


Haoming Cai*, Jingxi Chen*, Brandon Y. Feng, Weiyun Jiang, Mingyang Xie, Kevin Zhang, Cornelia Fermuller, Yiannis Aloimonos, Ashok Veeraraghavan, Christopher Metzler.
NeurIPS'24

project image

Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats


Mingyang Xie*, Haoming Cai* , Sachin Shah, Yiran Xu, Brandon Y. Feng, Jia-bin Huang, Christopher Metzler
ECCV'24
website /

project image

CodedEvents: Optimal Point-Spread-Function Engineering for 3D-Tracking with Event Cameras


Sachin Shah, Matthew Albert Chan, Haoming Cai, Jingxi Chen, Sakshum Kulshrestha, Chahat Deep Singh, Yiannis Aloimonos, Christopher Metzler.
CVPR'24

This paper explores PSF engineering for neuromorphic event cameras, designing optimized masks for superior 3D point localization and tracking.

project image

Snow Removal in Video: A New Dataset and A Novel Method


Haoyu Chen, Jingjing Ren, Jinjin Gu, Hongtao Wu, Xuequan Lu, Haoming Cai, Lei Zhu
ICCV'23
arxiv/ code / website /

This paper presents a new deep learning framework for removing snow from videos, featuring a high-quality dataset and innovative modules for effective snow removal, outperforming existing methods.




Image/Video Generation through Diffusion Model (2023-Now)

project image

TimeRewind: Rewinding Time with Image-and-Events Video Diffusion


Jingxi Chen, Brandon Y. Feng, Haoming Cai, Mingyang Xie, Christopher Metzler, Cornelia Fermuller, Yiannis Aloimonos
Under Review - 2024

Image Metric Design Inspired by Human Visual System (2020-Now)

project image

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment


Tianhe Wu, Shuwei Shi, Haoming Cai , Mingdeng Cao, Jing Xiao, Yinqiang Zheng, Yujiu Yang
NeurIPS'23
arxiv/ code / website /

Current omnidirectional image quality assessment lacks observer browsing modeling. We propose Assessor360, a novel multi-sequence network for BOIQA derived from realistic multi-assessor ODI quality assessment

project image

Pipal: a large-scale image quality assessment dataset for perceptual image restoration


Jinjin Gu, Haoming Cai, Haoyu Chen, Xiaoxing Ye, Jimmy S Ren, Chao Dong.
ECCV'20
arxiv/ website /

This paper highlights the challenge IQA faces with emerging GAN-based image restoration methods, noting a growing disparity between quantitative metrics and perceptual quality. To address this, the authors introduce a large-scale IQA dataset and benchmarks to enhance IQA methods’ effectiveness.

Controllable & Efficient Image Restoration (2020 - 2022)

project image

Super-resolution by predicting offsets: An ultra-efficient super-resolution network for rasterized images


Jinjin Gu, Haoming Cai, Chenyu Dong, Ruofan Zhang, Yulun Zhang, Wenming Yang, Chun Yuan.
ECCV'22
arxiv/ code /

SRPO is a real-time super-resolution method for computer graphics, achieving superior visual effects with minimal computational cost by leveraging rasterized image features and offset prediction.

project image

Efficient image super-resolution using vast-receptive-field attention


Haoming Cai*, Lin Zhou*, Jinjin Gu, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Yu Qiao, Chao .
AIM'22 @ ECCV
arxiv/ code /

This study improves super-resolution networks by refining the attention mechanism, leading to VapSR, which outperforms lightweight networks with fewer parameters, achieving similar results as IMDB and RFDN networks with significantly fewer parameters.

project image

Blueprint separable residual network for efficient image super-resolution


Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Jinjin Gu, Yu Qiao, Chao Dong.
NTIRE'23 @ CVPR
arxiv/ code /

Winner of Efficient Image Super-Resolution Track @ New Trends in Image Restoration and Enhancement (NTIRE) workshop of CVPR’23

project image

Toward interactive modulation for photo-realistic image restoration


Haoming Cai, Jingwen He, Yu Qiao, Chao Dong.
CVPRW'21
arxiv/ code /

This paper presents CUGAN, a Controllable Unet GAN, for modulating image restoration tasks with fine texture details. Through dynamic level adjustments and condition networks, CUGAN outperforms previous methods, offering smooth user control over output effects.


Design and source code from Jon Barron's website