姓名 陳元娣(Yuan-Di Chen)  查詢紙本館藏   畢業系所 軟體工程研究所
論文名稱 結合前景感知與多尺度注意力機制之語意分割模型應用於土石流偵測
(Foreground-Aware and Multi-scale Convolutional Attention Mechanism for Remote Sensing Images Semantic Segmentation in Landslide Detection)
摘要(中) 隨著衛星和無人機技術的進步,現在越來越容易獲取高解析度遙測影像資料,這促使遙測影像在眾多領域中得到廣泛的研究和應用。其中,遙測影像語意分割是一個特殊的語意分割任務,不僅面臨多尺度挑戰,還具有以下兩個獨特的挑戰特徵:一個是極度的前景-背景不平衡分佈,二是多個小物體與複雜背景共存,然而,現有的語意分割方法主要研究在自然場景中的尺度變化,忽略了遙測影像所面臨的特定問題,缺乏對前景建模。為了解決這些問題,本論文提出一種前景感知的遙測語意分割模型。
摘要(英) As satellite and aerial camera technology advances, acquiring high-resolution remote sensing images has become more readily achievable, leading to widespread research and applications in various fields. Remote sensing image semantic segmentation is a crucial task that provides semantic and localization information for target objects. Besides the large-scale variation issues common in most semantic segmentation datasets, aerial images present unique challenges, including high background complexity and imbalanced foreground-background ratios. However, general semantic segmentation methods primarily address scale variations in natural scenes and often neglect the specific challenges in remote sensing images, such as inadequate foreground modeling.
In this paper, we present a foreground-aware remote sensing semantic segmentation model. The model introduces a multi-scale convolutional attention mechanism and utilizes a Feature Pyramid Network (FPN) architecture to extract multi-scale features, addressing the multi-scale problem. Additionally, we introduce a foreground-scene relation module to mitigate false alarms. The model enhances the foreground features by modeling the relationship between the foreground and the scene. In the loss function, a Soft Focal Loss focuses on foreground samples during training, alleviating the foreground-background imbalance issue.
Experimental results indicate that our proposed method surpasses current state-of-the-art general semantic segmentation and transformer-based methods on LS dataset benchmark, achieving a trade-off between speed and accuracy.
關鍵字(中) ★ 遙測語意分割
★ 特徵金字塔網絡
★ 卷積注意力機制
★ 多尺度特徵融合
關鍵字(英) ★ Remote Sensing
★ Semantic segmentation
★ Convolutional Attention Mechanism
★ Multi-scale Features Fusion
論文目次 Abstract I
摘要 II
目錄 III
圖目錄 V
表目錄 VI
1 緒論 1
1.1 研究背景與動機 1
1.2 論文結構 2
2 相關研究 3
2.1 相關背景與發展 3
2.1.1 語意分割 3
2.1.2 遙測語意分割 4
2.2 相關技術 5
2.2.1 VAN 5
2.2.2 GoogLeNet 6
2.2.3 HRNet 7
2.2.4 SegFormer 8
2.2.5 SegNeXt 9
2.2.6 FPN 10
2.2.7 FarSeg 11
3 研究方法 13
3.1 模型架構 13
3.1.1 編碼器(Encoder) 14
3.1.2 Foreground-Scene Relation Module 18
3.1.3 解碼器(Decoder) 20
3.2 損失函數 22
4 實驗結果 23
4.1 實作細節 24
4.2 資料集 25
4.2.1 LS Dataset 25
4.2.2 Bijie Dataset 26
4.3 驗證指標 27
4.3.1 精確率(Precision)、招回率(Recall) 27
4.3.2 IoU(Intersection over Union) 28
4.3.3 Pixel Accuracy 29
4.3.4 F1 score 29
4.4 主要結果 30
4.5 消融實驗 35
5 結論 39
參考文獻 40
指導教授 鄭旭詠(Hsu-Yung Cheng) 審核日期 2024-7-22
