Shan Wang

I am a Computer Vision PhD student at Australian National University (ANU), under the guidance of Dr. Chuong Nguyen and Prof. Hongdong Li. My research focuses on Autonomous Driving and 3D reconstruction. Prior to joining ANU, I accumulated 16 years of experience as an embedded software engineer in the automobile industry.

Email  /  Google Scholar  /  Linkedin  /  Github

profile photo
Publications
PontTuset View Consistent Purification for Accurate Cross-View Localization
Shan Wang, Yanhao Zhang, Akhil Perincherry, Ankit Vora, and Hongdong Li
ICCV, 2023
paper / project page

This paper proposes a fine-grained self-localization method for outdoor robotics that utilizes a flexible number of onboard cameras and readily accessible satellite images. The proposed method addresses limitations in existing cross-view localization methods that struggle to handle noise sources such as moving objects and seasonal variations, achieving significant performance improvement.

PontTuset Homography Guided Temporal Fusion for Road Line and Marking Segmentation
Shan Wang, Chuong Nguyen, Jiawei Liu, Kaihao Zhang, Wenhan Luo, Yanhao Zhang, Sundaram Muthu, Fahira Afzalmaken and Hongdong Li
ICCV, 2023
paper / code

Reliable segmentation of road lines and markings is critical to autonomous driving. Our work is motivated by the observations that road lines and markings are (1) frequently occluded in the presence of moving vehicles, shadow, and glare and (2) highly structured with low intra-class shape variance and overall high appearance consistency. To solve these issues, we propose a Homography Guided Fusion (HomoFusion) module to exploit temporally-adjacent video frames for complementary cues facilitating the correct classification of the partially occluded road lines or markings.

PontTuset Model Calibration in Dense Classification with Adaptive Label Perturbation
J Liu, C Ye, S Wang, R Cui, J Zhang, K Zhang, N Barnes
ICCV, 2023
paper / code

For safety-related applications, it is crucial to produce trustworthy deep neural networks whose prediction is associated with confidence that can represent the likelihood of correctness for subsequent decision-making. Existing dense binary classification models are prone to being over-confident. To improve model calibration, we propose Adaptive Stochastic Label Perturbation (ASLP) which learns a unique label perturbation level for each training image.

PontTuset Satellite image based cross-view localization for autonomous vehicle
Shan Wang, Yanhao Zhang, Ankit Vora, Akhil Perincherry, and Hongdong Li
ICRA, 2023
paper / project page

Existing spatial localization techniques for autonomous vehicles mostly use a pre-built 3D-HD map, often constructed using a survey-grade 3D mapping vehicle, which is not only expensive but also laborious. This paper shows that by using an off-the-shelf high-definition satellite image as a ready-to-use map, we are able to achieve cross-view vehicle localization up to a satisfactory accuracy, providing a cheaper and more practical way for localization. Our method is validated on KITTI and Ford Multi-AV Seasonal datasets as ground view and Google Maps as the satellite view. The results demonstrate the superiority of our method in cross-view localization with median spatial and angular errors within 1 meter and 1∘, respectively.

PontTuset CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization
Yujiao Shi, Xin Yu, Shan Wang, and Hongdong Li
ACCV, 2022
paper / code

This work addresses city-scale satellite image-based camera localization by using a sequence of ground-view images.

Template from Jon Barron's website.