(Data Augmentation with Semantic Information in Visual Localization)
摘要(中) 我們提出了一種結合資料增強技術和語義資訊的方法,以解決動態環境中因定位誤差增加而導致的視覺定位問題。視覺定位在自動駕駛車輛、機器人和增強現實 (AR) / 虛擬現實 (VR) 等應用中至關重要。然而,在動態環境中,特別是有頻繁人員移動的情況下,定位的準確性和穩定性往往會顯著下降。
摘要(英) We proposes a method that combines data augmentation techniques with semantic information to address the issue of increased positioning errors in visual localization caused by dynamic environments. Visual localization is crucial in applications such as autonomous vehicles, robotics, and augmented reality (AR) / virtual reality (VR). However, in dynamic environments, especially where there is frequent human movement, localization accuracy and stability often significantly decline.
To solve this problem, we adopted the Random Erasing technique from data augmentation. Random Erasing simulates object movement or occlusion by randomly masking parts of the image, allowing the model to learn more diverse features and improve its robustness in dynamic environments. However, we believe the model should learn more useful features. Therefore, we further integrated semantic segmentation techniques to extract human regions in the images and applied special processing to these areas.
This combined approach aims to enhance the model′s adaptability in dynamic environments, ensuring localization accuracy in practical applications.
We conducted experiments on our datasets with varying dynamic characteristics in indoor environment like factory. Experimental results show that this method reduces localization errors caused by human movement. In areas with human movement, our method reduces translation errors by at least 35.8 \% and improves system stability. Additionally, in static environments, our method maintains high accuracy, demonstrating its adaptability across various settings.
關鍵字(中) ★ 視覺定位
★ 數據增強
★ 語義分割
★ 隨機擦除
關鍵字(英) ★ Visual Localization
★ Data Augmentation
★ Semantic Segmentation
★ Random Erasing
論文目次 中文摘要/Chinese abstract i
英文摘要/English abstract ii
目次/Table of contents
1 Introduction 1
2 Related Works 4
2.1 Data Augmentation 4
2.2 Data Augmentation in Visual Localization 5
2.3 Semantic Information in Visual Localization 5
3 Semantic Augmentation 6
3.1 Erasing region 7
3.2 Erasing Probability 9
3.3 Model and Training Strategy 10
4 Dataset 12
4.1 Hardware Setting 12
4.2 Sequence Information 12
4.3 Data Pre-Processing 13
5 Experimental Setting and Results 15
5.1 Data 15
5.2 Experimental Setup 16
5.3 Impact of different erasing probability 16
5.4 Testing Set Errors 17
5.5 Testing Set Errors in Human Active Section 19
5.6 Testing Set Errors in Non-human Active Section 22
5.7 Compare with Grid Erasing Method 22
6 Conclusion and Future Work 24
6.1 Conclusion 24
6.2 Future Work 24
Bibliography 25
