中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/92454
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 78937/78937 (100%)
Visitors : 39425469      Online Users : 475
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/92454


    Title: 應用於物件偵測之動態注意力圖神經網路區塊;Dynamic Graph Attention Blocks on Object Detection
    Authors: 鄭皓中;Cheng, Hao-Chung
    Contributors: 資訊工程學系
    Keywords: 物件偵測;圖神經網路;可變卷積;圖注意力網路
    Date: 2023-05-25
    Issue Date: 2023-10-04 16:01:54 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 在深度學習電腦視覺領域裡,物件偵測任務一直是廣泛受到討論與重視的研究領域,在現實世界中存在廣大的應用場景,使其成為不可撼動的重要研究項目。相關的模型方法也不斷推陳出新,不管是基於卷積神經網路或是基於 Transformer 架構的模型都在持續發展中,但圖神經網路在這方面的應用卻不多,尤其是二維影像的應用,使我們想探討圖神經網路在二維影像物件偵測任務上的可能性。
    圖神經網路近期逐漸受到重視,歸功於其對圖資料結構良好的表示能力,使其能夠
    探索不規則鄰居節點的關係。先前有部分研究工作將圖神經網路架設在卷積神經網路上,並探索此方法帶來的性能提升,但其存在實驗比較對象不夠適當、其圖結構的鄰居與邊是依據固定空間範圍建立,如此可能使感受野及探索能力受到限制,甚至可能降低圖神經網路探索能力等問題。
    我們針對上述問題提出了可模組化的動態注意力圖神經網路區塊(Dynamic Graph Attention Blocks),引入可變卷積來增加圖神經網路的探索能力,使其建邊的方式由固定改為動態建立,讓模型可以自己學習找到更好的特徵做卷積,同時將模組架設在 state-of-the-art 物件偵測器上進行實驗。經由實驗顯示我們的方法可以得到匹配或稍加的表現。;In the realm of deep learning for computer vision, the object detection tasks have always been a widely discussed and emphasized research area. There are many application scenarios in the real world, making it an unshakable research area. Various models are constantly being evolved, whether based on convolutional neural networks or Transformer architectures, both of which are in continuous development. However, the application of graph neural networks in this area is not common, especially in the case of 2D images, which prompts us explore the potential of graph neural networks in 2D image object detection tasks.
    Graph neural networks have recently gained attention due to their strong representation ability for graph data structures, enabling them to explore the relationship of irregular
    neighborhood nodes. Some previous research efforts have combined graph neural networks
    with convolutional neural networks and explored the performance improvements brought by this method. But the experimental comparisons are not suitable enough, and the neighbors and edges of the graph structure are established based on a fixed spatial range, which may limit the receptive field and exploration capabilities, and may even reduce the exploration ability of graph neural networks.
    To address these issues, we propose a modular Dynamic Graph Attention Blocks, which introduces deformable convolutions to enhance the exploration capabilities of graph neural networks. This change allows the edges to be dynamically established rather than fixed, enabling the model to learn to find better features for convolution. Simultaneously, we integrate the module into state-of-the-art object detector for experiments. Our experiments show that our method can achieve comparable or slightly improved performance.
    Appears in Collections:[Graduate Institute of Computer Science and Information Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML38View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明