以多特徵神經網路實現連續手語識別

DC 欄位	值	語言
DC.contributor	資訊工程學系	zh_TW
DC.creator	費群安	zh_TW
DC.creator	Arda Satata Fitriajie	en_US
dc.date.accessioned	2022-7-25T07:39:07Z
dc.date.available	2022-7-25T07:39:07Z
dc.date.issued	2022
dc.identifier.uri	http://ir.lib.ncu.edu.tw:88/thesis/view_etd.asp?URN=109522601
dc.contributor.department	資訊工程學系	zh_TW
DC.description	國立中央大學	zh_TW
DC.description	National Central University	en_US
dc.description.abstract	若有 RGB 視頻串流，我們的目標是正確識別與連續手語識別 (CSLR) 相關的手語。儘管該領域提出的深度學習方法逐漸增加，但大多數主要集中在僅使用 RGB 特徵，無論是全幀圖像還是手部和臉部的細節。 CSLR 訓練過程信息的不足嚴重限制了他們學習視頻輸入幀中多個特徵的能力。目前，多特徵網路變得相當普遍，因為當前的計算能力不再限制我們擴大網路規模。因此，在本文中，我們將研究深度學習網路並應用多特徵技術，以期增加和改進當前的連續手語識別任務，詳細說明我們將包括的另一個特徵在這項研究中，如果我們將它們做比較，關鍵點特徵沒有圖像特徵那麼沉重。這項研究的結果表明，在 Phoenix2014 和中國手語這兩個最流行的 CSLR 數據集上，添加關鍵點特徵作為一種多特徵模態可以提高識別率，或者通常會降低單詞錯誤率 (WER)。	zh_TW
dc.description.abstract	Given the RGB video streams, we aim to recognize signs related to continuous sign language recognition (CSLR) correctly. Despite there are increasing of proposed deep learning methods in this area, most of them mainly focus on only using an RGB feature, either the fullframe image or the detail of hands and face. The scarcity of information for the CSLR training process heavily constrains their capability to learn the multiple features within the video input frames. Currently, Multi-feature networks became something quite common since the current computing power is something that is not limiting us from scaling the network size anymore. Thus, in this thesis, we’re going to work deep learning network and apply a multi-feature technique with the hope to increase & improve the current state of the art of continuous sign language recognition tasks, in detail another feature that we would include in this research is the key-point feature which is not as heavy as the image feature if we are comparing them. The result of this research shows that adding a key-point feature as a multi-feature modality could increase the recognition rate or commonly, decrease the word error rate (WER) on the two most popular CSLR datasets: Phoernix2014 and Chinese Sign Language.	en_US
DC.subject	圖像處理	zh_TW
DC.subject	視頻處理	zh_TW
DC.subject	連續手語識別	zh_TW
DC.subject	手勢識別	zh_TW
DC.subject	關鍵點	zh_TW
DC.subject	Image Processing	en_US
DC.subject	Video Processing	en_US
DC.subject	Continuous Sign Language Recognition	en_US
DC.subject	Gesture Recognition	en_US
DC.subject	Keypoint	en_US
DC.title	以多特徵神經網路實現連續手語識別	zh_TW
dc.language.iso	zh-TW	zh-TW
DC.title	Realizing Sign Language Recognition using Multi-Feature Neural Network	en_US
DC.type	博碩士論文	zh_TW
DC.type	thesis	en_US
DC.publisher	National Central University	en_US

博碩士論文 109522601 完整後設資料紀錄