Shan Wang

I am a Computer Vision PhD student at Australian National University (ANU), under the guidance of Dr. Chuong Nguyen and Prof. Hongdong Li. My research focuses on Autonomous Driving and 3D reconstruction. Prior to joining ANU, I accumulated 16 years of experience as an embedded software engineer in the automobile industry.

Email / Google Scholar / Linkedin / Github

Publications

	View Consistent Purification for Accurate Cross-View Localization Shan Wang, Yanhao Zhang, Akhil Perincherry, Ankit Vora, and Hongdong Li ICCV, 2023 paper / project page This paper proposes a fine-grained self-localization method for outdoor robotics that utilizes a flexible number of onboard cameras and readily accessible satellite images. The proposed method addresses limitations in existing cross-view localization methods that struggle to handle noise sources such as moving objects and seasonal variations, achieving significant performance improvement.
	Homography Guided Temporal Fusion for Road Line and Marking Segmentation Shan Wang, Chuong Nguyen, Jiawei Liu, Kaihao Zhang, Wenhan Luo, Yanhao Zhang, Sundaram Muthu, Fahira Afzalmaken and Hongdong Li ICCV, 2023 paper / code Reliable segmentation of road lines and markings is critical to autonomous driving. Our work is motivated by the observations that road lines and markings are (1) frequently occluded in the presence of moving vehicles, shadow, and glare and (2) highly structured with low intra-class shape variance and overall high appearance consistency. To solve these issues, we propose a Homography Guided Fusion (HomoFusion) module to exploit temporally-adjacent video frames for complementary cues facilitating the correct classification of the partially occluded road lines or markings.
	Model Calibration in Dense Classification with Adaptive Label Perturbation J Liu, C Ye, S Wang, R Cui, J Zhang, K Zhang, N Barnes ICCV, 2023 paper / code For safety-related applications, it is crucial to produce trustworthy deep neural networks whose prediction is associated with confidence that can represent the likelihood of correctness for subsequent decision-making. Existing dense binary classification models are prone to being over-confident. To improve model calibration, we propose Adaptive Stochastic Label Perturbation (ASLP) which learns a unique label perturbation level for each training image.
	Satellite image based cross-view localization for autonomous vehicle Shan Wang, Yanhao Zhang, Ankit Vora, Akhil Perincherry, and Hongdong Li ICRA, 2023 paper / project page Existing spatial localization techniques for autonomous vehicles mostly use a pre-built 3D-HD map, often constructed using a survey-grade 3D mapping vehicle, which is not only expensive but also laborious. This paper shows that by using an off-the-shelf high-definition satellite image as a ready-to-use map, we are able to achieve cross-view vehicle localization up to a satisfactory accuracy, providing a cheaper and more practical way for localization. Our method is validated on KITTI and Ford Multi-AV Seasonal datasets as ground view and Google Maps as the satellite view. The results demonstrate the superiority of our method in cross-view localization with median spatial and angular errors within 1 meter and 1∘, respectively.
	CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization Yujiao Shi, Xin Yu, Shan Wang, and Hongdong Li ACCV, 2022 paper / code This work addresses city-scale satellite image-based camera localization by using a sequence of ground-view images.

Template from Jon Barron's website.