應用卷積神經網路於航攝影像做基於坵塊的水稻判釋之研究

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：18

、訪客IP：18.117.196.184

姓名

陳昭瑋(Chao-Wei Chen) 查詢紙本館藏

畢業系所

資訊工程學系

論文名稱

應用卷積神經網路於航攝影像做基於坵塊的水稻判釋之研究
(Application of Convolutional Neural Networks to Aerial Images for parcel-based Rice Interpretation)

相關論文

★ 採迴歸樹進行規則探勘以有效同時降低多種紡織瑕疵	★ 導體滲鍍瑕疵; 利用同欣電子提供之少量樣本資料獲得生產線中最關鍵工作站
★ 基於領域適應性之非侵入式手機使用者識別機制針對無固定操作習慣之使用者	★ 一種新的基於高斯混合模型之行為塑模方法用於智慧型手錶之駕駛者識別
★ 基於高斯混合模型之行為塑模方法應用於智慧型手錶之開放集駕駛者身分識別研究	★ 使用WGAN-GP合成基於智慧手錶的現實安全與不安全的駕駛行為
★ 基於領域適應及Empirical Kernel Map處理行為改變的智慧型手機非侵入式身份識別系統

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

至系統瀏覽論文 ( 永不開放)

摘要(中)

水稻是台灣重要的作物之一，政府每隔一段時間就需要了解水稻的種植狀況，例如種植區域與種植面積等資訊，用於統計產量及訂定相關決策。傳統作法是以人工標註的方式對每張遙測影像進行判釋及數化作業。近年來隨著人工智慧相關技術的發展，若能使用相關技術輔助專家進行遙測影像的判釋，將能夠減少對人力資源的需求。因此本研究團隊使用深度學習技術，產生將航攝影像作為輸入並且輸出具有是否為水稻的分類結果的水稻判釋模型，將能夠減少以人力資源觀看、標註航攝影像上的水稻的需求。
本團隊在過去取得坵塊向量圖資料後，以當時的研究成果，基於像素的UNet-VGG16[1]為基礎，在資料集加入坵塊向量圖，從坵塊向量圖取出坵塊資訊，想以最小的改動實現基於坵塊(Parcel-based)的判釋，最終完成了基於坵塊的UNet-FNN模型[2]。其設計不對UNet-VGG16模型做任何修改，而是取出UNet-VGG16在後段網路層的特徵圖與坵塊資訊進行處理，產生基於坵塊的資料作為另一個模型FNN(Fully-connect Neural Network)的輸入資料，達到基於坵塊的判釋，比起基於像素的UNet-VGG16具有更準確的測試結果，但是這種坵塊資訊的使用方式在模型的訓練與測試太過耗時，並且受限於FNN的設計需要使用符合一定規則的大量資料。
本研究基於不同於UNet-FNN的坵塊資訊使用方式，將坵塊資訊直接用於從航攝影像中取出基於坵塊的影像資料，並且提出不同於UNet-FNN的網路架構VGG16BN-G，能夠對基於坵塊的影像資料直接進行基於坵塊的判釋。
除了受限於UNet-FNN的設計而不可改變的資料以外，本研究在可改變的資料集中使用盡量相同的訓練資料訓練UNet-VGG16模型、UNet-FNN模型與本研究所提出的VGG16BN-G模型。
研究貢獻為提出一個基於坵塊的水稻判釋模型VGG16BN-G，在相同的測試設計與T-test的結果呈現VGG16BN-G僅需要UNet-FNN的大約20%訓練時間以及大約6%的訓練資料量，即可達到與UNet-VGG、UNet-FNN相近而沒有顯著差異的效能，以盒鬚圖(box plot)的各項指標呈現VGG16BN-G具有與UNet-FNN相近並且比UNet-VGG16更好的穩定度。最後根據本研究團隊的經驗提出一個適用於水稻判釋的航攝影像準備的指引，包含檢查影像品質的想法流程。

摘要(英)

Rice is one of the most important crops in Taiwan. The government needs to know the planting status of rice every now and then, such as the planting location and planting area, for yield statistics and decision making. Traditionally, each remote sensing image is interpreted and digitized by manual annotation. In recent years, with the development of artificial intelligence-related technology, if we can use related technology to assist experts in interpreting remote sensing images, we can reduce the demand for human resources, reduce the possible misjudgment caused by manual interpretation, and improve the operational efficiency. Therefore, our team uses deep learning technology to generate a rice interpretation model that uses aerial images as input and outputs a classification result showing where are rice and non-rice., which will reduce the need for human resources to view and annotate rice on aerial images.
In the past, after obtaining the parcel vector map data, our team added the parcel vector map to the dataset, extracted the parcel information from the parcel vector map, based on the research result pixel-based UNet-VGG16 [1], tried to implement the parcel-based interpretation with minimal changes. Instead of modifying the UNet-VGG16 model, the design idea is to take out the feature map of UNet-VGG16 in the back-end network layer and processes it and parcel information to generate parcel-based data as input to another model FNN (Fully Neural Network) to achieve parcel -based interpretation, which has more accurate test results than the pixel-based UNet-VGG16. However, this way of using parcel information is too time-consuming in model training and testing, and is limited by the fact that the design of FNN requires a large amount of data that conforms to certain rules.
This study uses parcel information in a different way than the UNet-FNN, and uses parcel information directly to retrieve parcel -based image data from aerial image, and proposes a different network architecture from UNet-FNN, that is VGG16BN-G, to do the parcel-based interpretation on the parcel-based image data.
For the contribution of this study, the results of the same test design and t-test show that VGG16BN-G requires only about 20% of the training time and about 6% of the training data of UNet-FNN to achieve similar performance as UNet-VGG and UNet-FNNN without significant differences. The box plot shows that VGG16BN-G has similar stability to UNet-FNN and better stability than UNet-VGG16. Finally, based on the experience of our research team, we propose a guideline for aerial image preparation for rice interpretation, including an idea process to check the image quality.

關鍵字(中)

★ 坵塊
★ 航攝影像
★ 水稻判釋
★ 語意分割
★ 卷積神經網路

關鍵字(英)

★ Parcel
★ Aerial image
★ Rice interpretation
★ Semantic segmentation
★ Convolutional neural network

論文目次

摘要 i
Abstract ii
目錄 iv
圖目錄 vi
表格目錄 viii
一、緒論 1
1-1 研究背景 1
1-2 研究動機與目的 1
1-3 研究貢獻 2
1-4 相關背景知識 2
1-4-1 水稻 2
1-4-2 坵塊 3
1-5 論文架構 4
二、相關研究 5
2-1 遷移式學習(Transfer learning) 5
2-2 VGG16 5
2-3 全局平均池化(Global average pooling) 6
2-4 批次正規化 (Batch Normalization) 6
2-5 Pixel-based 水稻判釋模型 UNet-VGG16 7
2-6 Parcel-based 水稻判釋模型 UNet-FNN 8
2-7 Cohen′s Kappa Coefficient 9
三、解決方案 11
3-1 Proposed method系統架構 11
3-2 資料前處理 12
3-2-1 資料前處理1(適用於UNet& UNet-FNN) 12
3-2-2 資料前處理2(適用於本研究提出的模型) 14
3-2-3 影像增強 15
3-2-4 坵塊圖前處理 16
3-2-5 K-means clustering處理 17
3-2-6 產生遮罩的坵塊資料集處理 18
3-2-7 產生坵塊影像處理 19
3-2-8 遮罩處理 20
3-3 Proposed Parcel-based 水稻判釋模型 21
3-3-1 網路架構VGG16-D 21
3-3-2 網路架構VGG16-G 22
3-3-3 網路架構VGG16BN-G 23
3-3-4 模型訓練與測試流程 24
四、實驗與結果討論 26
4-1 模型效能評估方法 Parcel-based kappa with area weight 27
4-2 資料集介紹 28
4-2-1 航攝影像 28
4-2-2 含水稻標記坵塊圖 31
4-2-3 影像類型介紹 32
4-3 實驗前準備 34
4-4 實驗一： Parcel-based 水稻判釋模型 34
4-4-1 動機與目的 34
4-4-2 問題定義 34
4-4-3 實驗方法 35
4-4-4 結果與討論 39
4-5 實驗二：以GAP layer取代Dense layer 42
4-5-1 動機與目的 42
4-5-2 問題定義 42
4-5-3 實驗方法 42
4-5-4 結果與討論 43
4-6 實驗三：加入Batch Normalization layer 44
4-6-1 動機與目的 44
4-6-2 問題定義 44
4-6-3 實驗方法 44
4-6-4 結果與討論 45
4-7 實驗四：加入稀有案例資料到訓練資料集 49
4-7-1 動機與目的 49
4-7-2 問題定義 49
4-7-3 實驗方法 49
4-7-4 結果與討論 50
五、結論與未來展望 53
5-1 結論 53
5-1-1 模型效能比較 53
5-1-2 適用於水稻判釋的航攝影像資料集準備的指導方針 59
5-2 未來展望 61
參考文獻 64

參考文獻

[1] Wahyuni, Ida, et al. "Rice Semantic Segmentation Using Unet-VGG16: A Case Study in Yunlin, Taiwan." 2021 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS). IEEE, 2021.
[2] 施品妤：〈結合語義分割與全連接網路做基於坵塊的水稻判釋之初步研究〉。碩士論文，國立中央大學，民國 111 年 1 月。
[3] 陳益凰, and 曾義星. "應用多時段衛星影像辨識水稻田之研究." 航測及遙測學刊 4.3 (1999): 1-15.
[4] Pan, Sinno Jialin, and Qiang Yang. "A survey on transfer learning." IEEE Transactions on knowledge and data engineering 22.10 (2009): 1345-1359.
[5] K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large-Scale Image Recognition.” International Conference on Learning Representations, 2015.
[6] Tammina, Srikanth. "Transfer learning using vgg-16 with deep convolutional neural network for classifying images." International Journal of Scientific and Research Publications (IJSRP) 9.10 (2019): 143-150.
[7] Lin, Min, Qiang Chen, and Shuicheng Yan. "Network in network." arXiv preprint arXiv:1312.4400 (2013).
[8] Ioffe, Sergey, and Christian Szegedy. "Batch normalization: Accelerating deep network training by reducing internal covariate shift." International conference on machine learning. PMLR, 2015.
[9] Ronneberger, Olaf, Philipp Fischer, and Thomas Brox. "U-net: Convolutional networks for biomedical image segmentation." International Conference on Medical image computing and computer-assisted intervention. Springer, Cham, 2015.
[10] J. Cohen, “A Coefficient of Agreement for Nominal Scales.”Educational and Psychological Measurement, vol. 20, no. 1, pp. 37-46, 1960, doi: 10.1177/001316446002000104.
[11]J. R. Landis, G. G. Koch, “The Measurement of Observer Agreement for Categorical Data.” Biometrics, vol. 33, no. 1, pp. 159-174, 1977, doi: 10.2307/2529310
[12] Hartigan, John A., and Manchek A. Wong. "Algorithm AS 136: A k-means clustering algorithm." Journal of the royal statistical society. series c (applied statistics) 28.1 (1979): 100-108.

指導教授

梁德容張欽圳(De-Ron Liang Chin-Chun Chang)

審核日期

2022-9-30

推文