About Me
I am a Ph.D. student in computer vision at Zhejiang University, advised by Prof. Hui-Liang Shen and Dr. Si-Yuan Cao. Previously, I earned my bachelor's degree from Zhejiang University.
My research interests lie in image matching, 3D reconstruction/perception, and multi-modal image processing, etc. Feel free to contact me using Email (runmin_zhang@zju.edu.cn).
📚 Selected Publications
*equal contribution; †corresponding author.
3D Reconstruction/Perception
![]() | Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction Runmin Zhang, Zhu Yu†, Si-Yuan Cao†, Lingyu Zhu, Guangyi Zhang, Xiaokai Bai, Hui-Liang Shen International Conference on Computer Vision (ICCV), 2025 [Paper] [Code] |
![]() | Language Driven Occupancy Prediction Zhu Yu, Bowen Pang, Lizhe Liu, Runmin Zhang, Qiang Li, Si-Yuan Cao, Maochun Luo, Mingxia Chen, Sheng Yang, Hui-liang Shen International Conference on Computer Vision (ICCV), 2025 [Paper] [Code] |
![]() | Context and Geometry Aware Voxel Transformer for Semantic Scene Completion Zhu Yu, Runmin Zhang, Jiacheng Ying, Junchen Yu, Xiaohai Hu, Lun Luo, Si-Yuan Cao†, and Hui-Liang Shen† Conference on Neural Information Processing Systems (NeurIPS), 2024 (Spotlight) [Paper] [Code] |
Image Matching
![]() | SSHNet: Unsupervised Cross-modal Homography Estimation via Problem Reformulation and Split Optimization Junchen Yu, Si-Yuan Cao†, Runmin Zhang, Chenghao Zhang, Zhu Yu, Shujie Chen, Bailin Yang, and Hui-Liang Shen Computer Vision and Pattern Recognition (CVPR), 2025 (Highlight) [Paper] [Code] |
![]() | SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning Runmin Zhang*, Jun Ma*, Si-Yuan Cao*†, Lun Luo, Beinan Yu, Shu-Jie Chen, Junwei Li, and Hui-Liang Shen European Conference on Computer Vision (ECCV), 2024 [Paper] [Code] |
![]() | Recurrent Homography Estimation Using Homography-Guided Image Warping and Focus Transformer Si-Yuan Cao, Runmin Zhang, Lun Luo†, Beinan Yu, Zehua Sheng, Junwei Li, and Hui-Liang Shen Computer Vision and Pattern Recognition (CVPR), 2023 [Paper] [Code] |
Multi-modal Image Processing
![]() | SGDFormer: One-stage Transformer-based Architecture for Cross-Spectral Stereo Image Guided Denoising Runmin Zhang, Zhu Yu, Zehua Sheng, Jiacheng Ying†, Si-Yuan Cao, Shu-Jie Chen, Bailin Yang, Junwei Li, and Hui-Liang Shen† Information Fusion, 2025 [Paper] [Code] |
Services
Journal Reviewer:TIP
Conference Reviewer:CVPR, ICCV, NeurIPS, ICML, AAAI