About Me

I am a Ph.D. student in computer vision at Zhejiang University, advised by Prof. Hui-Liang Shen and Dr. Si-Yuan Cao. Previously, I earned my bachelor's degree from Zhejiang University.

My research interests lie in image matching, 3D reconstruction/perception, and multi-modal image processing, etc. Feel free to contact me using Email (runmin_zhang@zju.edu.cn).

📚 Selected Publications

*equal contribution; †corresponding author.

3D Reconstruction/Perception

Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
Runmin Zhang, Zhu Yu†, Si-Yuan Cao†, Lingyu Zhu, Guangyi Zhang, Xiaokai Bai, Hui-Liang Shen
International Conference on Computer Vision (ICCV), 2025
[Paper] [Code]
Language Driven Occupancy Prediction
Zhu Yu, Bowen Pang, Lizhe Liu, Runmin Zhang, Qiang Li, Si-Yuan Cao, Maochun Luo, Mingxia Chen, Sheng Yang, Hui-liang Shen
International Conference on Computer Vision (ICCV), 2025
[Paper] [Code]
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
Zhu Yu, Runmin Zhang, Jiacheng Ying, Junchen Yu, Xiaohai Hu, Lun Luo, Si-Yuan Cao†, and Hui-Liang Shen†
Conference on Neural Information Processing Systems (NeurIPS), 2024 (Spotlight)
[Paper] [Code]

Image Matching

SSHNet: Unsupervised Cross-modal Homography Estimation via Problem Reformulation and Split Optimization
Junchen Yu, Si-Yuan Cao†, Runmin Zhang, Chenghao Zhang, Zhu Yu, Shujie Chen, Bailin Yang, and Hui-Liang Shen
Computer Vision and Pattern Recognition (CVPR), 2025 (Highlight)
[Paper] [Code]
SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning
Runmin Zhang*, Jun Ma*, Si-Yuan Cao*†, Lun Luo, Beinan Yu, Shu-Jie Chen, Junwei Li, and Hui-Liang Shen
European Conference on Computer Vision (ECCV), 2024
[Paper] [Code]
Recurrent Homography Estimation Using Homography-Guided Image Warping and Focus Transformer
Si-Yuan Cao, Runmin Zhang, Lun Luo†, Beinan Yu, Zehua Sheng, Junwei Li, and Hui-Liang Shen
Computer Vision and Pattern Recognition (CVPR), 2023
[Paper] [Code]

Multi-modal Image Processing

SGDFormer: One-stage Transformer-based Architecture for Cross-Spectral Stereo Image Guided Denoising
Runmin Zhang, Zhu Yu, Zehua Sheng, Jiacheng Ying†, Si-Yuan Cao, Shu-Jie Chen, Bailin Yang, Junwei Li, and Hui-Liang Shen†
Information Fusion, 2025
[Paper] [Code]

Services

Journal Reviewer:TIP

Conference Reviewer:CVPR, ICCV, NeurIPS, ICML, AAAI