中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/90018
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 80990/80990 (100%)
Visitors : 41641138      Online Users : 1382
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/90018


    Title: 基於知識蒸餾的動作識別模型;Knowledge Distillation-based Models for Action Recognition
    Authors: 武德光;Quang, Vu Duc
    Contributors: 資訊工程學系
    Keywords: 深度學習;動作識別;卷積神經網絡;視頻分類;3D CNN;Deep learning;Action recognition;Convolutional Neural Network;Video Classification;3D CNN
    Date: 2022-09-19
    Issue Date: 2022-10-04 12:07:55 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 過去這幾年,我們看到多種電腦視覺應用有顯著的進步,尤其是在人類動作識別這個領域。人類動作辨識的目的在自動檢查和識別影片中的發聲的動作,且已經廣泛地在多種應用中使用。本論文對基於深度學習的人類動作識別的方法和技術進行了全面概述,並特別聚焦在三種主要的學習策略:監督學習、自監督式學習和半監督學習。針對每個學習機制,我們引入了有效的方法來解決基於知識蒸餾的動作辨識和知識蒸餾的優化。具體來說,對於監督式學習,我們提出了一個輕量化的網路架構,也就是(2+1)DShuffleNet,此外我們也引入了兩個基於知識蒸餾的方法來優化學生網路的泛化能力和性能,而不需要龐大且昂貴的教師網路;至於自監督式學習,我們提出一個新的對基於自監督式學習的動作辨識的委託任務;最後,我們提供了一種基於相互學習的半監督式動作辨識的有效方法。所有的實驗結果顯示,這些方法不僅實現最先進的性能,更在模型大小、運算成本、訓練時間、運行時間等不同指標都有所提升。;Over the past several years, we have witnessed remarkable progress in numerous computer vision applications, particularly in human activity analysis. Human action recognition, which aims to automatically examine and recognize the actions taking place in the video, has been widely applied in many applications. This thesis presents a comprehensive survey of approaches and techniques in deep learning-based human activity analysis. In particular, the thesis focuses on three main strategies of learning including supervised learning, self-supervised learning, and semi-supervised learning. In each learning mechanism, we introduce efficient approaches to address action recognition based on knowledge distillation and improvements for knowledge distillation. Specifically, for supervised learning, we proposed a lightweight network architecture i.e., (2+1)D ShuffleNet. Besides, we also introduce two self-knowledge distillation-based approaches to improve the generalization and performance of the student network without the large and expensive teacher network. For self-supervised learning, we present a novel pretext task for self-supervised learning-based action recognition. Finally, we propose an efficient approach based on mutual learning to semi-supervised action recognition. All experiment results have shown that these approaches not only achieve state-of-the-art performance but also improve in terms of many different metrics such as model size, computational cost, training time, running time, etc.
    Appears in Collections:[Graduate Institute of Computer Science and Information Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML62View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明