English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 78937/78937 (100%)
造訪人次 : 39611572      線上人數 : 238
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/85108


    題名: 深度學習應用在不同情況條碼定位之研究;An Application of a Deep Learning Medthod for Barcode Localization under different environment
    作者: 黃千祐;Huang, QIAN-YOU
    貢獻者: 電機工程學系
    關鍵詞: 深度學習;條碼;物體定位;Deep Learning;Barcode;Object Localization
    日期: 2021-01-26
    上傳時間: 2021-03-18 17:41:19 (UTC+8)
    出版者: 國立中央大學
    摘要: 隨著貿易越來越頻繁,每一個商品在製造、買賣的過程中都需要透過條碼來確保過程正確無誤,但條碼容易受光照不均、雜訊等因素影響,再加上條碼的種類十分多元,也讓條碼的定位更加困難。由於傳統方法只能找單一種類條碼且易受到雜訊的影響而使結果有誤,因此為了有效解決這些問題,本研究採用以深度學習為基礎的方法來研究多種類條碼的定位。本研究分為三個階段。第一階段訓練深度學習網路來找出條碼。考慮到FasterRCNN網路相較於RCNN等較舊的神經網路,其在物體辨識的領域中取得了更快更準的辨識結果,因此本研究採用FasterRCNN的架構進行神經網路的訓練。訓練所需資料採用CipherLab之條碼掃描器拍攝,總共包含10008張圖片,並以8:1:1比例分為訓練資料、驗證資料及測試資料。第二階段為提出2個有效的傳統影像處理演算法來辨識條碼。本研究先利用神經網路剪枝的方式試圖找出條碼的特徵,然而效果不佳,因此改採用Gradcam將神經網路的權重可視化方法,以獲得每層神經網路權重分別對圖片的哪些區域影響最大。藉此來幫助傳統影像處理演算法的優化。結果評估所採用的影像以CipherLab公司所提供之測試圖像作為依據,評估方法以辨識結果之precision及recall為主,其中又特別著重在recall值,本研究最終神經網路結果recall為0.89、precision為0.65,最終2種傳統影像方法模擬較佳者可達到recall數值為0.88、precision數值為0.52,顯示本研究對條碼辨識能夠有不錯的效果。第三階段會將傳統影像演算法較佳的結果實現在PDA模擬器上,其辨識速度約為1200ms。雖然在執行速度上有所不足,但本方法能夠成功的定位多種不同的條碼,並且擁有不錯的定位結果。;Abstract
    As trade grows more and more frequently, every product needs to use a barcode to ensure that the process of manufacturing and shipping is correct. However, the recognition of barcodes can easily be affected by factors such as uneven lighting and noise. In addition, the numerous categories of barcodes also make the barcode localization difficult. Traditional methods are limited for the detection capability with different types of barcodes and for the detection accuracy under noisy conditions. In order to solve these problems, we apply deep learning to localize various types of barcodes.

    This research consists of three parts. Firstly, a deep learning neural network was trained to find out barcodes. FasterRCNN was adopted in this research because of its faster and more accurate detection results compared to the older neural network such as RCNN. The training data used were captured by the barcode scanner made by CipherLab. A total of 10008 images were divided into the training data, the verification data and the test data by a ratio of 8:1:1 with the verification and the test date in a ratio.

    Secondly, this research proposed two algorithms based on the results of FasterRCNN. Neural network pruning was used to find the feature of barcodes. However, the result was unsatisfactory. Hence, a weight visualization method, Gradcam, was used to get the saliency maps at some layers of the neural network. By using the result of Gradcam, two traditional image processing approaches was proposed for barcode detection. Method 1 used the image entropy to enhance the region of barcodes. Method 2 detected barcodes by morphology. Testing barcode images provided by CipherLab were used to validate the performance of the two methods. We used recall and precision for our evaluation method, and focused on the performance of recall.

    The recall value of FasterRCNN was 0.89 and precision value was 0.65. Method 1 obtained the recall value of 0.86 and the precision value of 0.53, while Method 2 was with the recall value of 0.88 and the precision value of 0.52. The result showed that our method can detection barcode well.

    Finally, we choose method 2 and implemented it on the PDA simulator. On PDA simulator we can detect all the barcodes of an image within 1.2 seconds. Although we do not have very good barcode detection speed, our method can successfully detect various kind of barcodes and achieve good detection results.
    顯示於類別:[電機工程研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML254檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明