用於3D物體辨識基於視圖的注意力圖卷積監督式對比學習神經網路

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：18

、訪客IP：18.191.178.45

姓名

涂珮涓(Pei-Chuan Tu) 查詢紙本館藏

畢業系所

工業管理研究所

論文名稱

用於3D物體辨識基於視圖的注意力圖卷積監督式對比學習神經網路

相關論文

★ 應用失效模式效應分析於產品研發時程之改善	★ 服務品質因子與客戶滿意度關係研究-以汽車保修廠服務為例
★ 家庭購車決策與行銷策略之研究	★ 計程車車隊派遣作業之研究
★ 電業服務品質與服務失誤之探討-以台電桃園區營業處為例	★ 應用資料探勘探討筆記型電腦異常零件-以A公司為例
★ 車用配件開發及車主購買意願探討(以C公司汽車配件業務為實例)	★ 應用田口式實驗法於先進高強度鋼板阻抗熔接條件最佳化研究
★ 以層級分析法探討評選第三方物流服務要素之研究-以日系在台廠商為例	★ 變動良率下的最佳化批量研究
★ 供應商庫存管理架構下運用層級分析法探討供應商評選之研究-以某電子代工廠為例	★ 台灣地區快速流通消費產品銷售預測模型分析研究–以聯華食品可樂果為例
★ 競爭優勢與顧客滿意度分析以中華汽車為例	★ 綠色採購導入對電子代工廠的影響-以A公司為例
★ 以德菲法及層級分析法探討軌道運輸業之供應商評選研究–以T公司為例	★ 應用模擬系統改善存貨管理制度與服務水準之研究-以電線電纜製造業為例

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

[檢視]

[下載]

本電子論文使用權限為同意立即開放。
已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。
請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。

摘要(中)

工業革命自18、19世紀興起，歐美國家透過機器取代手工生產，演進出四次工業革命而目前正處於第四次。本研究針對工業革命的核心自動化，以提高生產效率、降低成本、提升品質為目標，特別關注於製造業中應用的機器視覺系統。傳統三維物體辨識方法多利用二維多視角圖片，但未充分利用多視角圖片間的相關性，以及現實生活中的拍攝環境可能會影響圖片品質增加模型辨識難度。因此，本研究旨在提出一套辨識三維產品的系統，包括基於視圖的圖卷積神經網路、圖片重要特徵提取以及對比學習訓練方法。具體目標為提高辨識效能、提升對圖片重點的捕捉能力以及增強在現實生活中的穩健性。為達成此目的，本研究將採用有效聚合多視角圖片訊息的基於視圖的圖卷積神經網路、注意力機制以提取重要特徵資訊，以及監督式對比學方法來訓練神經網路以提升模型泛化能力。這些方法的詳細內容將在後續章節中詳細探討。

摘要(英)

The Industrial Revolution emerged in the 18th and 19th centuries, during which European and American countries replaced manual labor with machines, leading to four distinct industrial revolutions, with the current era being the fourth. This study focuses on the core of the Industrial Revolution, automation, aiming to improve production efficiency, reduce costs, and enhance quality, particularly through the application of machine vision systems in the manufacturing industry. Traditional methods of three-dimensional object recognition often utilize two-dimensional multi-view images but fail to fully exploit the correlation between these images and the potential impact of real-life shooting conditions on image quality, thereby increasing the difficulty of model recognition. Therefore, this study aims to propose a system for recognizing three-dimensional products, comprising a view-based convolutional neural network, feature extraction from images, and contrastive learning training methods. The specific objectives are to improve recognition efficiency, enhance the capture of key features in images, and strengthen robustness in real-life scenarios. To achieve these goals, the study will adopt a view-based convolutional neural network that effectively aggregates information from multiple-view images, an attention mechanism to extract important feature information, and supervised contrastive learning methods to train neural networks and enhance model generalization capabilities. The detailed implementation of these methods will be discussed in subsequent chapters.

關鍵字(中)

★ 工業自動化
★ 多視圖三維物體辨識
★ 注意力機制
★ 對比學習

關鍵字(英)

★ automated industry
★ multi-view 3D object recognition
★ attention mechanism
★ contrastive learning

論文目次

目錄
摘要 ii
Abstract iv
目錄 v
表目錄 2
第一章緒論 3
1.1 研究背景與動機 3
1.2 研究挑戰 4
1.3 研究目的 4
1.4 研究方法 5
第二章文獻回顧 6
2.1 神經網路 6
2.1.1殘差網路 6
2.1.2基於視圖的圖卷積神經網路 8
2.2 注意力機制 9
2.3 對比學習 10
第三章方法論 13
3.1 基於視圖的圖卷積網路模型 14
3.1.1 ResNet18預訓練 14
3.1.2 圖卷積層 16
3..1.3 訊息傳遞層 16
3.1.4 選擇性視圖採樣層 17
3.2 注意力機制 18
3.3 監督式對比學習 19
第四章實驗 22
4.1 數據集和前處理 22
4.1.1實際產品資料集 22
4.1.2 渲染資料集 23
4.1.3 ModelNet40資料集 24
4.2 實驗設置 25
4.3 實驗結果與分析 25
4.3.1 實驗一: 探討模型辨識實際圖片穩健性 25
4.3.2實驗二: 探討模型學習多類型圖片特徵效能 27
4.3.3實驗三: 探討模型在辨識公開資料集效能 29
第五章結論 31
參考文獻 32

參考文獻

參考文獻
[1] Bahdanau, D., K. Cho & Y. Bengio (2014). Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473.
[2] Fukushima, K. (1980). Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological cybernetics, 36(4), 193-202.
[3] Golnabi, H. & A. Asadpour (2007). Design and application of industrial machine vision systems. Robotics and Computer-Integrated Manufacturing, 23(6), 630-637.
[4] He, K., X. Zhang, S. Ren & J. Sun (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778).
[5] Khosla, P., P. Teterwak, C. Wang, A. Sarna, Y. Tian, P. Isola, ... & D. Krishnan (2020). Supervised contrastive learning. Advances in neural information processing systems, 33, 18661-18673.
[6] Kipf, T. N., & M. Welling (2016). Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907.
[7] Krizhevsky, A., I. Sutskever & G. E. Hinton (2012). Imagenet classification with deep convolutional neural networks. Communications of the ACM, 60(6), 84-90.
[8] LeCun, Y., L. Bottou, Y. Bengio & P. Haffner (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11), 2278-2324.
[9] Mnih, V., N. Heess, & A. Graves (2014). Recurrent models of visual attention. Advances in neural information processing systems, 27.
[10] Niu, Z., G. Zhong & H. Yu (2021). A review on the attention mechanism of deep learning. Neurocomputing, 452, 48-62.
[11] Qi, C. R., Yi, L., Su, H., & L. J. Guibas (2017). Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems, 30.
[12] Simonyan, K., & A. Zisserman (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.
[13] Su, H., S. Maji, E. Kalogerakis, & E. Learned-Miller (2015). Multi-view convolutional neural networks for 3d shape recognition. In Proceedings of the IEEE international conference on computer vision (pp. 945-953).
[14] Szegedy, C., W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, ... & A. Rabinovich (2015). Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1-9).
[15] Thoben, K. D., S. Wiesner, & T. Wuest (2017). “Industrie 4.0” and smart manufacturing-a review of research issues and application examples. International journal of automation technology, 11(1), 4-16.
[16] Vaswani, A., N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, ... & I. Polosukhin (2017). Attention is all you need. Advances in neural information processing systems, 30.
[17] Wei, X., R. Yu & J. Sun (2020). View-gcn: View-based graph convolutional network for 3d shape analysis. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 1850-1859).
[18] Wu, Z., Y. Xiong, S. X. Yu & D. Lin (2018). Unsupervised feature learning via non-parametric instance discrimination. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3733-3742).
[19] Zeiler, M. D., & R. Fergus (2014). Visualizing and understanding convolutional networks. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part I 13 (pp. 818-833). Springer International Publishing.

指導教授

葉英傑(Yin-Gjie Ye)

審核日期

2024-7-22

推文