Publications
You can also find my publications on Google Scholar.
*equal contribution; †corresponding author.
2025
Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction Runmin Zhang, Zhu Yu†, Si-Yuan Cao†, Lingyu Zhu, Guangyi Zhang, Xiaokai Bai, Hui-Liang Shen International Conference on Computer Vision (ICCV), 2025 [Paper] [Code] |
Language Driven Occupancy Prediction Zhu Yu, Bowen Pang, Lizhe Liu, Runmin Zhang, Qiang Li, Si-Yuan Cao, Maochun Luo, Mingxia Chen, Sheng Yang, Hui-liang Shen International Conference on Computer Vision (ICCV), 2025 [Paper] [Code] |
EDFFDNet: Towards Accurate and Efficient Unsupervised Multi-Grid Image Registration Haokai Zhu, Bo Qu, Si-Yuan Cao, Runmin Zhang, Shujie Chen, Bailin Yang, Hui-liang Shen International Conference on Computer Vision (ICCV), 2025 |
SSHNet: Unsupervised Cross-modal Homography Estimation via Problem Reformulation and Split Optimization Junchen Yu, Si-Yuan Cao†, Runmin Zhang, Chenghao Zhang, Zhu Yu, Shujie Chen, Bailin Yang, and Hui-Liang Shen Computer Vision and Pattern Recognition (CVPR), 2025 (Highlight) [Paper] [Code] |
Learned Image Transmission with Hierarchical Variational Autoencoder Guangyi Zhang, Hanlei Li, Yunlong Cai†, Qiyu Hu, Guanding Yu, Runmin Zhang AAAI Conference on Artificial Intelligence (AAAI), 2025 [Paper] |
Structure-Aware Radar-Camera Depth Estimation Fuyi Zhang, Zhu Yu†, Chunhao Li, Runmin Zhang, Xiaokai Bai, Zili Zhou, Si-Yuan Cao, Fang Wang, and Hui-Liang Shen† IEEE International Conference on Robotics and Automation (ICRA), 2025. [Code] |
SGDFormer: One-stage Transformer-based Architecture for Cross-Spectral Stereo Image Guided Denoising Runmin Zhang, Zhu Yu, Zehua Sheng, Jiacheng Ying†, Si-Yuan Cao, Shu-Jie Chen, Bailin Yang, Junwei Li, and Hui-Liang Shen† Information Fusion, 2025 [Paper] [Code] |
STARNet: Low-light Video Enhancement using Spatio-temporal Consistency Aggregation Zhe Wu, Zehua Sheng, Xue Zhang, Si-Yuan Cao, Runmin Zhang, Beinan Yu, Chenghao Zhang, Bailin Yang, Hui-Liang Shen† Pattern Recognition, 2025 [Paper] |
2024
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion Zhu Yu, Runmin Zhang, Jiacheng Ying, Junchen Yu, Xiaohai Hu, Lun Luo, Si-Yuan Cao†, and Hui-Liang Shen† Conference on Neural Information Processing Systems (NeurIPS), 2024 (Spotlight) [Paper] [Code] |
SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning Runmin Zhang*, Jun Ma*, Si-Yuan Cao*†, Lun Luo, Beinan Yu, Shu-Jie Chen, Junwei Li, and Hui-Liang Shen European Conference on Computer Vision (ECCV), 2024 [Paper] [Code] |
Rethinking Early-fusion Strategies for Improved Multispectral Object Detection Xue Zhang, Si-Yuan Cao†, Fang Wang, Runmin Zhang, Zhe Wu, Xiaohan Zhang, Xiaokai Bai, and Hui-Liang Shen† IEEE Transactions on Intelligent Vehicles (TIV), 2024 [Paper] [Code] |
RestorerID: Towards Tuning-Free Face Restoration with ID Preservation Jiacheng Ying*, Mushui Liu*, Zhe Wu, Runmin Zhang, Zhu Yu, Siming Fu, Si-Yuan Cao, Chao Wu, Yunlong Yu, Hui-Liang Shen† arXiv, 2024 [Paper] [Code] |
2023
Recurrent Homography Estimation Using Homography-Guided Image Warping and Focus Transformer Si-Yuan Cao, Runmin Zhang, Lun Luo†, Beinan Yu, Zehua Sheng, Junwei Li, and Hui-Liang Shen Computer Vision and Pattern Recognition (CVPR), 2023 [Paper] [Code] |
PCNet: A Structure Similarity Enhancement Method for Multispectral and Multimodal Image Registration Si-Yuan Cao, Beinan Yu, Lun Luo, Runmin Zhang, Shu-Jie Chen, Chunguang Li, Hui-Liang Shen† Information Fusion, 2023 [Paper] |