基於深度學習之人員追蹤辨識系統

、線上人數：17

、訪客IP：3.17.77.161

姓名	洪?綸(Qi-Lun Hong) 查詢紙本館藏	畢業系所	電機工程學系
論文名稱	基於深度學習之人員追蹤辨識系統
檔案	[Endnote RIS 格式] [Bibtex 格式] [相關文章] [文章引用] [完整記錄] [館藏目錄] [檢視] [下載] 本電子論文使用權限為同意立即開放。已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。
摘要(中)	本論文旨在設計並實現多人員追蹤辨識系統，並在魚眼影像上進行分析，為了避免魚眼影像經過校正處理後會有部份影像被裁切掉，故我們不對魚眼影像做任何的校正處理。我們將魚眼攝影機架設於天花板上且視角向下拍攝，影像中的人員物件會隨著與魚眼影像中心點相對應的位置不同，而產生不同的樣貌姿態，為了能夠有效的偵測到魚眼影像中變形的人員物件，我們使用近年來迅速發展的深度學習演算法來達成人員追蹤辨識的目標，透過深度學習的技術讓機器從龐大的資料庫中，其中亦包含變形的人員圖片，學習並辨識圖片的特徵，其目的為解決魚眼影像所導致的變形問題且透過大量的資料訓練可強健網路對人員的姿態變化的適應能力，不因人員的姿態變化而需另行設計系統進行辨識。本文所實現之演算法主要分成兩大部份，第一部份為利用YOLOv2深度網路架構偵測魚眼影像中的人員物件並進行定位，第二部份為使用Deep SORT架構分析在連續影像中所偵測到的人員物件是否為同一人員並進行追蹤，並使用AlignedReID網路架構將已辨識出的人員再進行辨識，進而確認其人員的身份，整個系統的設計主要是藉由深度學習的技術所完成，進而強健此系統的功能。
摘要(英)	This thesis attempts to design and implement a multiple human tracking and identification system that is applicable to omnidirectional surveillance by a fisheye camera. In this work, a top-view fisheye camera is mounted on the ceiling. Image distortion correction is not performed on each image frame because the human objects far from the image center might be cropped after lens undistortion. The appearance of human varies depending on the position in an image, and thus it is hard to detect and recognize human by traditional approaches. Accordingly, the deep learning technique developed rapidly in recent years is employed for achieving efficient human tracking and recognition. By learning from a big amount of training images, the computer will have the ability of extracting features, detecting and identifying human. The main algorithm of the proposed deep neural network includes two stages of networks. The first part is a YOLO-based deep architecture for detecting and locating human objects. The second part combines location and appearance information to track an identical person. Through the entire process of tracking a person, the identification will be kept as the same by using a so-called AlignedReID network. From the experimental results, the efficiency and robustness of the proposed algorithm have been verified.
關鍵字(中)	★ 人員追蹤 ★ 魚眼影像 ★ 深度學習 ★ 人員再辨識	關鍵字(英)	★ human tracking and identification ★ fisheye camera ★ deep learning ★ YOLO ★ AlignedReID
論文目次	摘要 i Abstract ii 致謝 iii 目錄 iv 圖目錄 vi 表目錄 vii 第一章緒論 1 1.1 研究背景與動機 1 1.2 文獻回顧 1 1.3 論文目標 3 1.4 論文架構 3 第二章系統架構與硬體介紹 4 2.1 系統架構 4 2.2 硬體介紹 5 2.3 軟體介紹 6 第三章主要方法與演算法 7 3.1 深度學習追蹤辨識 7 3.2 YOLOv2偵測網路 9 3.3 追蹤處理與狀態估計 10 3.4 分配問題(Assignment Problem) 11 3.4.1運動訊息匹配 11 3.4.2外觀訊息匹配 11 3.4.3結合運動訊息與外觀訊息 12 3.4.4匈牙利演算法 13 3.5 匹配級聯(Matching Cascade) 13 3.6 AlignedReID 人員再辨識網路 14 3.6.1 網路架構 15 3.6.2 相互學習(mutual learning) 18 3.6.3 AlignedReID網路實現細節 19 3.7 應用於魚眼攝影機之處理 20 3.7.1 魚眼影像之半徑設置 21 3.7.2 人員再辨識網路之魚眼影像處理 22 第四章資料訓練 25 4.1 YOLOv2偵測網路之訓練資料 25 4.2 AlignedReID網路之訓練資料 25 第五章實驗結果 27 5.1 YOLOv2偵測網路驗證 27 5.1.1 驗證結果 28 5.2 Deep SORT追蹤驗證 30 5.2.1 驗證結果討論 33 5.3 人員追蹤辨識結果 33 第六章結論與未來展望 38 6.1 結論 38 6.2 未來展望 38 第七章參考文獻 40
參考文獻	[1] J. Bromley, J. W. Bentz, L. Bottou, I. Guyon, Y. LeCun, C. Moore, E. Sackinger, and R. Shah, "Signature verification using a “Siamese” time delay neural network," International Journal of Pattern Recognition and Artificial Intelligence, 1993. [2] S. Zagoruyko and N. Komodakis, "Learning to compare image patches via convolutional neural networks," 2015 IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, 2015, pp. 4353-4361. [3] E. Ahmed, M. Jones and T. K. Marks, "An improved deep learning architecture for person re-identification," 2015 IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, 2015, pp. 3908-3916. [4] L. Zhao, X. Li, Y. Zhuang and J. Wang, "Deeply-learned part-aligned representations for person re-identification," 2017 IEEE International Conference on Computer Vision, Venice, 2017, pp. 3239-3248. [5] H. Liu, J. Feng, M. Qi, J. Jiang and S. Yan, "End-to-end comparative attention networks for person re-Identification," in IEEE Transactions on Image Processing, vol. 26, no. 7, pp. 3492-3506, July 2017. [6] A. Hermans, L. Beyer, and B. Leibe, "In defense of the triplet loss for person re-identification," arXiv preprint arXiv:1703.07737, 2017. [7] O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, et al, "Imagenet large scale visual recognition challenge," International Journal of Computer Vision, 115(3):211–252, 2015. [8] M. Geng, Y. Wang, T. Xiang, and Y. Tian, "Deep transfer learning for person re-identification," arXiv preprint arXiv:1611.05244, 2016. [9] R. Girshick, J. Donahue, T. Darrell and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, 2014, pp. 580-587. [10] R. Girshick, "Fast r-cnn," 2015 IEEE International Conference on Computer Vision , Santiago, 2015, pp. 1440-1448. [11] S. Ren, K. He, R. Girshick and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 6, pp. 1137-1149, June 1 2017. [12] J. Redmon, S. Divvala, R. Girshick and A. Farhadi, "You only look once: Unified, real-time object detection," 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, 2016, pp. 779-788. [13] M. Everingham, S. A. Eslami, L. Van Gool, C. K. Williams, J. Winn, and A. Zisserman, "The pascal visual object classes challenge: A retrospective," International journal of computer vision, 2015. [14] J. Redmon and A. Farhadi, "Yolo9000: Better, faster, stronger," 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, 2017, pp. 6517-6525. [15] A. Bewley, Z. Ge, L. Ott, F. Ramos and B. Upcroft, "Simple online and realtime tracking," 2016 IEEE International Conference on Image Processing, Phoenix, AZ, 2016, pp. 3464-3468. [16] R. Kalman, "A new approach to linear filtering and prediction problems," Journal of Basic Engineering, vol. 82, no. Series D, pp. 35-45, 1960. [17] N. Wojke, A. Bewley and D. Paulus, "Simple online and realtime tracking with a deep association metric," 2017 IEEE International Conference on Image Processing, Beijing, 2017, pp. 3645-3649. [18] A. Milan, L. Leal-Taixe, I. Reid, S. Roth, and K. Schindler, "Mot16: A benchmark for multi-object tracking," arXiv preprint arXiv:1603.00831, 2016. [19] X. Zhang, H. Luo, X. Fan, W. Xiang, Y. Sun, Q. Xiao W. Jiang, C. Zhang, and J. Sun, "Alignedreid: Surpassing human-level performance in person re-identification," arXiv preprint arXiv:1711.08184, 2017. [20] Tensorflow官方網站,https://www.tensorflow.org/ ,2015年11月 [21] H. W. Kuhn, "The hungarian method for the assignment problem", Naval Research Logistics Quarterly, vol. 2, pp. 83-97, 1955. [22] Y. Zhang, T. Xiang, T. M. Hospedales, and H. Lu, "Deep mutual learning," arXiv preprint arXiv:1706.00384, 2017. [23] K. Bernardin and R. Stiefelhagen, "Evaluating multiple object tracking performance: The clear mot metrics", Image and Video Processing, May 2008. [24] N. Wojke and A. Bewley, "Deep cosine metric learning for person re-identification," 2018 IEEE Winter Conference on Applications of Computer Vision, Lake Tahoe, NV, 2018, pp. 748-756. [25] F. Schroff, D. Kalenichenko and J. Philbin, "Facenet: A unified embedding for face recognition and clustering," 2015 IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, 2015, pp. 815-823. [26] 黃冠穎，「基於深度學習之贓車偵測系統」，國立中央大學電機研究所，碩士論文，2017年。 [27] 林宜臻，「基於深度學習之戶外導航機器人」，國立中央大學電機研究所，碩士論文，2018年。
指導教授	王文俊陳翔傑(Wen-June Wang Hsiang-Chieh Chen)	審核日期	2018-8-2
推文	facebook plurk twitter funp google live udn HD myshare reddit netvibes friend youpush delicious baidu
網路書籤	Google bookmarks del.icio.us hemidemi myshare

博碩士論文 105521064 詳細資訊