博碩士論文 104522023 完整後設資料紀錄

DC 欄位 語言
DC.contributor資訊工程學系zh_TW
DC.creator謝旻哲zh_TW
DC.creatorMin-Che Hsiehen_US
dc.date.accessioned2017-8-18T07:39:07Z
dc.date.available2017-8-18T07:39:07Z
dc.date.issued2017
dc.identifier.urihttp://ir.lib.ncu.edu.tw:88/thesis/view_etd.asp?URN=104522023
dc.contributor.department資訊工程學系zh_TW
DC.description國立中央大學zh_TW
DC.descriptionNational Central Universityen_US
dc.description.abstract聽覺在人們生活中站了一大部分的地位,在擁有聽覺情況下,聲音讓人們更加清楚周遭的狀況,並且使生活多了一點色彩。在各式各樣的聲音種類下,若經由強健其特徵與自動化的分類方法,有助於迅速瞭解各式的緊急狀況或增進學習效果,因此以環境聲音、樂器聲為類別來進行分類與強健,也逐漸受到重視。 在傳統的自編碼器上,主要是經由神經網路[29]去做重建,並有利於連接各式分類器提升其辨識效果;而變異型自編碼器(Variational Auto-encoder , VAE)引入隨機變分推理[25],運用隨機梯度法使重新參數化的變分下界可以達到最佳優化的結果,進而使用識別模型(Recognition model)近似較難處理之後驗分佈(Posterior distribution)。基於高斯程序回歸模型(Gaussian Process Regression Model)也須經由訓練其參數得出下界值,並加以結合變異型自編碼器與高斯程序回歸模型之下界,使其同時訓練其參數以便減少各別訓練之時間,達到最佳優化之效果。 在實驗部分,為了顯示出此模型之強健性,我們藉此比較有噪聲與無噪聲之辨識效果,而我們也將討論不同的初始參數設定的差異,了解其收斂速度與辨識效果。zh_TW
dc.description.abstract The sense of hearing plays an important role in human’s daily life. In the case of hearing circumstances, sense of hearing not only enables people to understand the situation more clearly, but also enrich people’s life more colorful. Within all various of sound types, if we apply robust features and automated classification methods can assist us to understand different types of emergencies more quickly or enhance the effect of learning. Therefore, the classification of categories and robustness through ambient sound and musical instruments has gradually been taken more seriously. In the traditional auto-encoder, photos and audios are mainly reconstructed through the neural network [29], and it is conducive to connect all kinds of classifiers to enhance its recognition effect. On the other side, variational auto-encoder introduced random variational inference [25]. It uses the stochastic gradient method to re-parameterize the variational lower bound to achieve the best optimization results. Afterwards, they use the recognition model to estimate the more difficult the Posterior distribution. The Gaussian Process Regression Model is also required to derive the lower bound by training its parameters, and then we combine the lower bound of the variational auto-encoder and Gaussian process regression model. Finally, we train these parameters which including (Gaussian process regression model and the variational auto-encoder) will achieve the best optimize effect, by reducing the cost of time. In the experimental part, in order to show the robustness of this model, we compare the differences between noise and clean identification effect. And we will also discuss the differences between the initial parameters of different, to discover its speed of convergence and identification effect.en_US
DC.subject高斯程序回歸模型zh_TW
DC.subject變異型自編碼器zh_TW
DC.subject變異推理zh_TW
DC.subjectGaussian Process Regression Modelen_US
DC.subjectVariational Auto-encoderen_US
DC.subjectvariational inferenceen_US
DC.title基於高斯程序回歸模型與變異型自編碼器之強健性聲音辨識方法zh_TW
dc.language.isozh-TWzh-TW
DC.titleRobust Audio Recognition Based on Gaussian Process Regression Model and Variational Auto-encoderen_US
DC.type博碩士論文zh_TW
DC.typethesisen_US
DC.publisherNational Central Universityen_US

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明