中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/83959
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 78852/78852 (100%)
Visitors : 38692768      Online Users : 589
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/83959


    Title: 結合語義分割特徵與注意力模型之室內場景分類系統;Indoor Scene Image Classification System combining Semantic Segmentation Features and Attention Module
    Authors: 黃健銘;Huang, Jian-Ming
    Contributors: 資訊工程學系
    Keywords: 場景辨識;語義分割;注意力模型;特徵融合
    Date: 2020-07-21
    Issue Date: 2020-09-02 17:46:55 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 場景辨識是電腦視覺中重要的一個環節,現今機器學習的方法效能遠遠高於傳統處理的方式,然而,直接使用神經網路進行分類往往會遺失物體、空間佈局、和背景之間關聯的資訊,導致分類效果不佳。因此抓取出物體、空間佈局、和背景之間關聯的資訊,並使用有效的方式將這些資訊、特徵與原圖結合進行分類,是目前場景分類中重要的挑戰。
    本論文提出的方法,對影像做語義分割,並將語義分割影像與原圖影像分別使用神經網路模型提取特徵,將語義分割特徵使用注意力模型與原圖特徵進行特徵融合,最後進行分類、辨識。
    實驗結果證明,在我們收集的旅館室內場景資料集中,準確率能達到最好的效果。在公開15-Scene資料集中,比較其他論文方法,我們方法的效果可以取得更好的分類準確度。因此,透過使用語義分割的方式,能夠抓取到物體、空間佈局和背景之間關聯的資訊,並使用注意力模型進行特徵融合,能在場景辨識中取得更好的辨識效果。
    ;Scene recognition is an important part of computer vision. The efficiency of current machine learning methods is much better than traditional processing methods. However, using neural networks directly for classification often loses more information of objects, spatial layout, and background. Resulting in poor classification. Therefore, it is an important challenge in scene classification to capture the information of objects, spatial layout, and background, and use an effective method to merge these features to classify scene.
    The method proposed in this paper performs semantic segmentation on the image. Use Neural network model to extract the features of the semantic segmentation image and original image respectively. And then, use the attention module to fuse the semantic segmentation features with original image features. Finally, according to these fused features to classify images.
    The experiment results show that our method can achieve the best result on the Hotel Indoor Scene dataset. Furthermore, in the public 15-Scene dataset, our method can outperform existing methods. Therefore, by using semantic segmentation, the information of objects, spatial layout and background can be captured. Using the attention module to do feature fusion can achieve better accuracy in scene recognition.
    Appears in Collections:[Graduate Institute of Computer Science and Information Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML128View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明