Publications

You can also find my publications on Google Scholar.

*equal contribution; †corresponding author.

2025

Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
Runmin Zhang, Zhu Yu†, Si-Yuan Cao†, Lingyu Zhu, Guangyi Zhang, Xiaokai Bai, Hui-Liang Shen
International Conference on Computer Vision (ICCV), 2025
[Paper] [Code]
Language Driven Occupancy Prediction
Zhu Yu, Bowen Pang, Lizhe Liu, Runmin Zhang, Qiang Li, Si-Yuan Cao, Maochun Luo, Mingxia Chen, Sheng Yang, Hui-liang Shen
International Conference on Computer Vision (ICCV), 2025
[Paper] [Code]
EDFFDNet: Towards Accurate and Efficient Unsupervised Multi-Grid Image Registration
Haokai Zhu, Bo Qu, Si-Yuan Cao, Runmin Zhang, Shujie Chen, Bailin Yang, Hui-liang Shen
International Conference on Computer Vision (ICCV), 2025
SSHNet: Unsupervised Cross-modal Homography Estimation via Problem Reformulation and Split Optimization
Junchen Yu, Si-Yuan Cao†, Runmin Zhang, Chenghao Zhang, Zhu Yu, Shujie Chen, Bailin Yang, and Hui-Liang Shen
Computer Vision and Pattern Recognition (CVPR), 2025 (Highlight)
[Paper] [Code]
Learned Image Transmission with Hierarchical Variational Autoencoder
Guangyi Zhang, Hanlei Li, Yunlong Cai†, Qiyu Hu, Guanding Yu, Runmin Zhang
AAAI Conference on Artificial Intelligence (AAAI), 2025
[Paper]
Structure-Aware Radar-Camera Depth Estimation
Fuyi Zhang, Zhu Yu†, Chunhao Li, Runmin Zhang, Xiaokai Bai, Zili Zhou, Si-Yuan Cao, Fang Wang, and Hui-Liang Shen†
IEEE International Conference on Robotics and Automation (ICRA), 2025.
[Code]
SGDFormer: One-stage Transformer-based Architecture for Cross-Spectral Stereo Image Guided Denoising
Runmin Zhang, Zhu Yu, Zehua Sheng, Jiacheng Ying†, Si-Yuan Cao, Shu-Jie Chen, Bailin Yang, Junwei Li, and Hui-Liang Shen†
Information Fusion, 2025
[Paper] [Code]
STARNet: Low-light Video Enhancement using Spatio-temporal Consistency Aggregation
Zhe Wu, Zehua Sheng, Xue Zhang, Si-Yuan Cao, Runmin Zhang, Beinan Yu, Chenghao Zhang, Bailin Yang, Hui-Liang Shen†
Pattern Recognition, 2025
[Paper]

2024

Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
Zhu Yu, Runmin Zhang, Jiacheng Ying, Junchen Yu, Xiaohai Hu, Lun Luo, Si-Yuan Cao†, and Hui-Liang Shen†
Conference on Neural Information Processing Systems (NeurIPS), 2024 (Spotlight)
[Paper] [Code]
SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning
Runmin Zhang*, Jun Ma*, Si-Yuan Cao*†, Lun Luo, Beinan Yu, Shu-Jie Chen, Junwei Li, and Hui-Liang Shen
European Conference on Computer Vision (ECCV), 2024
[Paper] [Code]
Rethinking Early-fusion Strategies for Improved Multispectral Object Detection
Xue Zhang, Si-Yuan Cao†, Fang Wang, Runmin Zhang, Zhe Wu, Xiaohan Zhang, Xiaokai Bai, and Hui-Liang Shen†
IEEE Transactions on Intelligent Vehicles (TIV), 2024
[Paper] [Code]
RestorerID: Towards Tuning-Free Face Restoration with ID Preservation
Jiacheng Ying*, Mushui Liu*, Zhe Wu, Runmin Zhang, Zhu Yu, Siming Fu, Si-Yuan Cao, Chao Wu, Yunlong Yu, Hui-Liang Shen†
arXiv, 2024
[Paper] [Code]

2023

Recurrent Homography Estimation Using Homography-Guided Image Warping and Focus Transformer
Si-Yuan Cao, Runmin Zhang, Lun Luo†, Beinan Yu, Zehua Sheng, Junwei Li, and Hui-Liang Shen
Computer Vision and Pattern Recognition (CVPR), 2023
[Paper] [Code]
PCNet: A Structure Similarity Enhancement Method for Multispectral and Multimodal Image Registration
Si-Yuan Cao, Beinan Yu, Lun Luo, Runmin Zhang, Shu-Jie Chen, Chunguang Li, Hui-Liang Shen†
Information Fusion, 2023
[Paper]