Jiaming Zhang

Professor

Hunan University (HNU)

I am a Professor at the School of AI and Robotics at Hunan University (HNU), where I lead the Integrated Sensing and Assistive Intelligence (InSAI) Lab. Our research focuses on developing inclusive and intelligent systems at the intersection of computer vision and assistive technology, with the goal of creating solutions that assist people, especially those with visual impairments.

I collaborate with Prof. Rainer Stiefelhagen at CV:HCI Lab and ACCESS@KIT Lab at Karlsruhe Institute of Technology (KIT). I am happy to work with Prof. Philip Torr at TVG at University of Oxford, and with Prof. Marc Pollefeys at CVG Group at ETH Zurich. Previously, I was a Postdoc at KIT. I received my Ph.D. (summa cum laude) and M.Sc. from KIT and my B.Sc. from Shenzhen University.

📣[News] The InSAI Lab is currently hiring! We are actively looking for Postdocs, Ph.D./Master/Bachelor Students, Research Assistants, and Visiting Scholars/Interns, please email me and join us!

Interest

Robotics
Computer Vision
Assistive Technology
Human-Computer Interaction
2D/3D Scene Understanding
Document Understanding
Embodied Intelligence
Autonomous Driving

Experience

Professor, 2025 - Present
School of AI and Robotics, HNU
Postdoc, 2023 - 2025
CV:HCI, KIT
Academic Guest, 2024 - 2025
CVG, ETH Zurich
Research Assistant, 2020 - 2023
CV:HCI, KIT

Education

Ph.D. in Computer Science, 2023
KIT, Germany
Visiting Ph.D. Student, 2023
University of Oxford, UK
M.Sc. in Computer Science, 2020
KIT, Germany
B.Sc. in Computer Science, 2015
Shenzhen University, China

Selected Publications

(* equal contribution, † corresponding author.)

Unlocking Constraints: Source-Free Occlusion-Aware Seamless Segmentation
Y. Cao*, J. Zhang*, X. Zheng, H. Shi, K. Peng, H. Liu, K. Yang, H. Zhang
ICCV 2025 Paper Code

Scene-agnostic Pose Regression for Visual Localization
J. Zheng, R. Liu, Y. Chen, Z. Chen, K. Yang, J. Zhang†, R. Stiefelhagen
CVPR 2025 Project page Paper Code

SAMBLE: Shape-Specific Point Cloud Sampling for an Optimal Trade-Off Between Local Detail and Global Uniformity
C. Wu, Y. Wan*, H. Fu*, J. Pfrommer, Z. Zhong, J. Zheng†, J. Zhang, J. Beyerer
CVPR 2025 Project page Paper Code

GraphDoc: A Graph-based Document Structure Analysis
Y. Chen, R. Liu, J. Zheng, D. Wen, K. Peng, J. Zhang†, R. Stiefelhagen
ICLR 2025 Project page Paper Code

@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology
X. Jiang*, J. Zheng*, R. Liu, J. Li, J. Zhang†, S. Matthiesen, R. Stiefelhagen
WACV 2025 Project page Paper Code

OneBEV: Using One Panoramic Image for Bird's-Eye-View Semantic Mapping
J. Wei, J. Zheng, R. Liu, J. Hu, J. Zhang†, R. Stiefelhagen
ACCV 2024 ( Best paper finalist) Paper Code

Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation
J. Zhang, K. Yang, H. Shi, S. Reiß, K. Peng, C. Ma, H. Fu, P. Torr, K. Wang, R. Stiefelhagen.
IEEE T-PAMI 2024 Paper Code

CoBEV: Elevating Roadside 3D Object Detection with Depth and Height Complementarity
H. Shi*, C. Peng*, J. Zhang*, K. Yang, Y. Wu, H. Ni, Y, Lin, R. Stiefelhagen, K. Wang.
IEEE T-IP 2024 Paper Code

Open Panoramic Segmentation
J. Zheng, R. Liu, Y. Chen, K. Peng, C. Wu, K. Yang, J. Zhang†, R. Stiefelhagen.
ECCV 2024 Project page Paper Code

Occlusion-Aware Seamless Segmentation
Y. Cao*, J. Zhang*, H. Shi, K. Peng, Y. Zhang, H. Zhang, R. Stiefelhagen, K. Yang.
ECCV 2024 Paper Code

Referring Atomic Video Action Recognition
K. Peng, J. Fu, K. Yang, D. Wen, Y. Chen, R. Liu, J. Zheng, J. Zhang, S. Sarfraz, R. Stiefelhagen, A. Roitberg.
ECCV 2024 Paper Code

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
Y. Chen, J. Zhang†, K. Peng, J. Zheng, R. Liu, P. Torr, R. Stiefelhagen.
CVPR 2024 Project page Paper Code

MateRobot: Material Recognition in Wearable Robotics for People with Visual Impairments
J. Zheng*, J. Zhang*, K. Yang, K. Peng, R. Stiefelhagen.
ICRA 2024 ( Best paper finalist on HRI) Project page Paper Code

Delivering Arbitrary-Modal Semantic Segmentation
J. Zhang*, R. Liu*, S. Hao, K. Yang, S. Reiß, K. Peng, H. Fu, K. Wang, R. Stiefelhagen.
CVPR 2023 Project page Paper Code Dataset

Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation
J. Zhang, K. Yang, C. Ma, S. Reiß, K. Peng, R. Stiefelhagen.
CVPR 2022 Paper Code

CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation With Transformers
J. Zhang*, H. Liu*, K. Yang*, X. Hu, R. Liu, R. Stiefelhagen.
IEEE Trans. on Intelligent Transportation Systems ( T-ITS) 2023 Paper Code

Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance
J. Zhang, K. Yang, A. Constantinescu, K. Peng, K. Müller, R. Stiefelhagen.
IEEE Trans. on Intelligent Transportation Systems ( T-ITS) 2022 Paper Code

Exploring Event-Driven Dynamic Context for Accident Scene Segmentation
J. Zhang, K. Yang, R. Stiefelhagen.
IEEE Trans. on Intelligent Transportation Systems ( T-ITS) 2021 Paper Code Dataset

Transfer beyond the Field of View: Dense Panoramic Semantic Segmentation via Unsupervised Domain Adaptation
J. Zhang, C. Ma, K. Yang, A. Roitberg, K. Peng, R. Stiefelhagen.
IEEE Trans. on Intelligent Transportation Systems ( T-ITS) 2021 Paper Code Dataset

360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View
Z. Teng*, J. Zhang*†, K. Yang, K. Peng, H. Shi, S. Reiß, K. Cao, R. Stiefelhagen.
WACV 2024 Project page Paper Code Dataset

Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric Images to Allocentric Semantics with Vision Transformers
C. Chen, J. Zhang†, K. Yang, K. Peng, R. Stiefelhagen.
WACV 2023 Paper Code

MatchFormer: Interleaving Attention in Transformers for Feature Matching
Q. Wang*, J. Zhang*, K. Yang, K. Peng, R. Stiefelhagen.
ACCV 2023 Paper Code

Capturing Omni-Range Context for Omnidirectional Segmentation
K. Yang, J. Zhang, S. Reiß, X. Hu, R. Stiefelhagen
CVPR 2021 Paper Code

ISSAFE: Improving semantic segmentation in accidents by fusing event-based data
J. Zhang, K. Yang, R. Stiefelhagen
IROS 2021 Paper Code Dataset

Flying Guide Dog: Walkable Path Discovery for the Visually Impaired Utilizing Drones and Transformer-based Semantic Segmentation
H. Tan, C. Chen, X. Luo, J. Zhang, C. Seibold, K. Yang, R. Stiefelhagen.
IEEE ROBIO 2021 Paper Code Video

HIDA: Towards Holistic Indoor Understanding for the Visually Impaired via Semantic Instance Segmentation with a Wearable Solid-State LiDAR Sensor
H. Liu, R. Liu, K. Yang, J. Zhang, K. Peng, R. Stiefelhagen
ICCV Workshop on Assistive Computer Vision and Robotics ( ACVR) 2021 Paper

Trans4Trans: Efficient Transformer for Transparent Object Segmentation to Help Visually Impaired People Navigate in the Real World
J. Zhang, K. Yang, A. Constantinescu, K. Peng, K. Müller, R. Stiefelhagen
ICCV Workshop on Assistive Computer Vision and Robotics ( ACVR) 2021 Paper Code

DensePASS: Dense Panoramic Semantic Segmentation via Unsupervised Domain Adaptation with Attention-Augmented Context Exchange
C. Ma, J. Zhang, K. Yang, A. Roitberg, R. Stiefelhagen
IEEE ITSC 2021 Paper Code Dataset

Pose2Drone: A Skeleton-Pose-based Framework for Human-Drone Interaction
Z. Marinov, S. Vasileva, Q. Wang, C. Seibold, J. Zhang, R. Stiefelhagen
IEEE EUSIPCO 2021 Paper Code

Panoptic Lintention Network: Towards Efficient Navigational Perception for the Visually Impaired
Wei Mao*, J. Zhang*, K. Yang, R. Stiefelhagen
IEEE RCAR 2021 Paper Code

Perception Framework through Real-Time Semantic Segmentation and Scene Recognition on a Wearable System for the Visually Impaired
Y. Zhang, H. Chen, K. Yang, J. Zhang, R. Stiefelhagen
IEEE RCAR 2021 Paper

Teaching

Lectures

Teaching Assistant, Deep Learning for Computer Vision I: Basics, SS 2024, SS 2025

Teaching Assistant, Deep Learning for Computer Vision II: Advanced Topics, WS 21/22, WS 22/23, WS 23/24

Teaching Assistant, Practical Course: Computer Vision for HCI, WS 20/21, SS 2021, SS 2022, SS 2023, SS 2024, SS 2025

Flying Guide Dog: Walkable Path Discovery for the Visually Impaired Utilizing Drones and Transformer-based Semantic Segmentation, Paper Code Video
Pose2Drone: A Skeleton-Pose-based Framework for Human-Drone Interaction, Paper Code

Teaching Assistant, Seminar: Computer Vision for HCI, WS 20/21, WS 21/22, WS 23/24, WS 24/25

Teaching Assistant, Seminar: Multimodal Large Language Models, SS 2024

Supervision

Since 2023 Junwei Zheng. PhD student at KIT. Vision-Language Navigation. Co-supervised with Prof. Rainer Stiefelhagen.
Since 2023 Ruiping Liu. PhD student at KIT. Multimodal Scene Understanding. Co-supervised with Prof. Rainer Stiefelhagen.
Since 2023 Yufan Chen. PhD student at KIT. Document Analysis. Co-supervised with Prof. Rainer Stiefelhagen.
Since 2024 Fei Teng. PhD student at HNU. Panoramic Understanding. Co-supervised with Prof. Kailun Yang.
2025 Mar. Sebastian Tewes. Master student (co-supervised). Source-free Document Layout Analysis. Paper Code
2025 Jan. Alexander Vogel. Master student (co-supervised). RefChartQA: Grounding Reasoning on Chart Images through Instruction-tuning.
2025 Jan. Jie Hu. Master student (co-supervised). Deformable Mamba for Wide Field of View Segmentation. Paper Code
2024 Nov. Qihao Yuan. Master student. Solving Zero-Shot 3D Visual Grounding as Constraint Satisfaction Problems. Paper Code
2024 Jul. Jiale Wei. Master student. OneBEV: Using One Panoramic Image for Bird's-Eye-View Semantic Mapping. Paper Code
2024 Feb. Xin Jiang. Master student. Unified Vision-Language Models for Assistive Technology. Paper Code
2024 Feb. Jonas Schmitt. Master student. Global Hessian-Based Importance Pruning of Neural Networks in Combination with Knowledge Distillation. Paper Code
2023 Nov. Daniel Bucher. Master student (co-supervised). Improving Robustness of 3D Semantic Sementation with Transformer-based Fusion and Knowledge Distillation. Paper
2023 Aug. Leon Kanstinger. Bachelor student. Improving Accessibility of User Interface in Mobility Assistance Systems.
2023 Juli. Fei Teng. Master student (co-supervised). OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation. Paper Code
2023 Apr. Zhifeng Teng. Master student. 360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View. Paper Code Page
2022 Aug. Chang Chen. Master student. Transformer-based Mapping from Egocentric Images to Top-view Semantics for Scene Understanding. Paper Code Video
2022 Feb. Qing Wang. Master student. MatchFormer: Interleaving Attention in Transformers for Feature Matching. Paper Code
2021 Oct. Chaoxiang Ma. Master student (co-supervised). Unsupervised Domain Adaptation for Panoramic Semantic Segmentation. Paper Code

Invited Talks

2025 Jan. Multimodal Scene Understanding for Mobility Assistance HKUST (Guangzhou)
2024 Dec. Towards Holistic and Robust Visual Assistive Systems Jihua Lab
2024 Dec. Multimodal Scene Understanding for Inclusive Mobility ETH Zurich
2024 Dec. Towards Holistic and Robust Visual Assistive Systems SCUT
2024 Dec. Intelligent Visual Assistance Southeast University
2024 Nov. Multimodal Scene Understanding for Mobility Assistance HNU
2024 Oct. Intelligent Assistance System Based on Scene Understanding GCI Germany
2023 Oct. Vision4Blind : Assistance Systems for People with Visual Impairments ICCV Demo
2023 Sep. Scene Understanding for Intelligent Transportation Systems University of Oxford
2022 Jul. Scene Understanding for Mobility Assistance Helmholtz Workshop, KIT
2021 Oct. Efficient Transformer for Transparent Object Segmentation ACVR
2021 Sep. Improving Semantic Segmentation in Accidents by Fusing Event-based Data IROS

Awards

IEEE ITSS Germany Dissertation Award (The First Price), 2024.

KIT Doctoral Award, 2024.

ACCV 2024 Best Paper Finalist, 2024.

KIT KHYS Research Travel Grant, 2024.

ICM Future Mobility Grants, 2024.

IEEE ICRA 2024 HRI Best Paper Finalist, 2024.

DAAD IFI Program Fellowship, 2023.

KIT Computer Science The Best Practical Course (Teaching Award), 2021.

Services

Associate Editors:
IEEE RA-L, IEEE IV 2024, IEEE IV 2025, IEEE ITSC 2025, IEEE ICVES 2025
Journal Reviewers:
T-PAMI, T-RO, T-IP, IJCV, CVIU, TNNLS, T-ITS, RA-L, T-IV, TCSVT, IJHCI, TGRS, T-ASE
Conference Reviewers:
CVPR, ICCV, ECCV, ICML, ICLR, NeurIPS, AAAI, IJCAI, SIGGRAPH, ACMMM, ACCV, WACV, BMVC, ICRA, IROS, ITSC, IV
Workshop Co-organizers:
BSL@IV2022, iCARE@CoRL2025

Contact

jiamingzhang@hnu.edu.cn
Hunan University (HNU)
School of AI and Robotics
Fenghuangshan Road 66,
410082 Yuelu District, Changsha, Hunan Province
Visitor traffic