博碩士論文 104523038 完整後設資料紀錄

DC 欄位 語言
DC.contributor通訊工程學系zh_TW
DC.creator伍聿旂zh_TW
DC.creatorYu-Chi Wuen_US
dc.date.accessioned2017-7-26T07:39:07Z
dc.date.available2017-7-26T07:39:07Z
dc.date.issued2017
dc.identifier.urihttp://ir.lib.ncu.edu.tw:88/thesis/view_etd.asp?URN=104523038
dc.contributor.department通訊工程學系zh_TW
DC.description國立中央大學zh_TW
DC.descriptionNational Central Universityen_US
dc.description.abstract隨著人類追求便利性,我們使用電腦使其學習並了解人類所熟知的事物,我們希望通過分析聲音使電腦認識自己的環境,自2013年首次舉辦IEEE Audio and Acoustic Signal Processing (AASP) 聲音場景與事件辨識(Detection and Classification of Acoustic Scenes and Events, DCASE) 競賽,掀起了聲音場景分類 (Acoustic scene classification, ASC)的風波,邁向統一ASC的資料庫與評估方法的第一步,更於2016年舉辦第二屆 DCASE2016競賽。 本論文利用深度學習中的摺積神經網路 (Convolutional Neural Net-work, CNN) 作為ASC的方法。由於CNN之輸入資料為頻譜,而頻譜包含時域資訊與頻域資訊,因此我們假設時域資訊與頻域資訊的資料變化量不一,因此使用長形的摺積核 (kernel) ,也就是本論文提出之非對稱摺積核 (Asymmetric Kernel) (相對於以往的方形的對稱摺積核),並在訓練期間做資料正規化 (Normalization)加速訓練。我們發現即使現在多以寬又深的網路作為趨勢,發展更佳的資料分類方法,但其實本論文所提出的架構,兩層不用預訓練 (Pre-train)的CNN即可達到相較DCASE2016排名第五名更佳的效果。zh_TW
dc.description.abstract Detection and Classification of Acoustic Scenes and Events (DCASE) Challenge have held in three times. The first DCASE Challenge was held in 2013. Then, DCASE2016 Challenge was the 2nd times of DCASE Challenge. The result why IEEE Audio and Acoustic Signal Processing (AASP) held the 2nd challenge after 3 years is to reset a brand new dataset and united the rule of ASC. In this work, we use the dataset of ASC from DCASE2016 to propose an Asymmetric Kernel Convolutional Neural Network (AKCNN), whose kernel shape is very different from the traditionally squared kernel. The width and height of the kernel are asymmetric which means that the shape of the kernel is a rectangular kernel. Also, the proposed uses weight normalization (WN) to accelerate the training time because it can early converge the training loss and testing accuracy during training. The best of all, WN can help increase the accuracy of ASC. The result shows that AKCNN achieves accuracy 86.7%. If we rank the score in DCASE2016 ASC Challenge, it would show that we have a better score than the 5th place.en_US
DC.subject計算聽覺場景分析zh_TW
DC.subject聲音場景辨分類zh_TW
DC.subject深度學習zh_TW
DC.subject摺積神經網路zh_TW
DC.subjectComputational Auditory Scene Analysisen_US
DC.subjectAcoustic scenes classificationen_US
DC.subjectDeep learningen_US
DC.subjectConvolutional neural networken_US
DC.title非對稱摺積神經網路之聲音場景分類zh_TW
dc.language.isozh-TWzh-TW
DC.titleAsymmetric Kernel Convolutional Neural Network for Acoustic Scenes Classificationen_US
DC.type博碩士論文zh_TW
DC.typethesisen_US
DC.publisherNational Central Universityen_US

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明