基於卷積神經網路之語音辨識;Speech Recognition by Using Convolutional Neural Network

NCU Institutional Repository > 資訊電機學院 > 電機工程研究所 > 博碩士論文 > Item 987654321/81386

請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/81386

題名:	基於卷積神經網路之語音辨識;Speech Recognition by Using Convolutional Neural Network
作者:	楊恕先;Yang, Shu-Sian
貢獻者:	電機工程學系
關鍵詞:	語音辨識;深度學習;神經網路;speech recognition;deep learning;neural network
日期:	2019-06-27
上傳時間:	2019-09-03 15:49:49 (UTC+8)
出版者:	國立中央大學
摘要:	本論文在探討如何利用深度學習來進行語音辨識，而使用的辨識方法是先透過梅爾倒頻譜係數((Mel frequency cepstral coefficients, MFCCs)取得語音特徵參數，並輸入卷積神經網路(Convolutional Neural Network, CNN)進行語音辨識。此法與傳統語音辨識方法最大不同是在於不需要建立聲學模型，以中文為例就省去建立大量聲母(consonant)、韻母(vowel)比對的時間。藉由透過MFCCs取得特徵參數後就可以透過卷積神經網路實現語音辨識，並且不會受到語言種類的限制。 ;The thesis developed a speech recognition method for automatic speech recognition. In this speech recognition method, we obtained the speech feature parameters through Mel frequency cepstral coefficients and input a Convolutional Neural Network. The main difference between this Convolutional Neural Network speech recognition method and traditional speech recognition method is that it does not need to establish an acoustic model. For example, in Chinese, it saved a lot of time without establishing a large number of consonant and vowel models. After obtaining the speech feature parameters through the MFCCs, speech recognition is finished through Convolutional Neural Network.
顯示於類別:	[電機工程研究所] 博碩士論文

文件中的檔案:

檔案	描述	大小	格式	瀏覽次數
index.html		0Kb	HTML	100	檢視/開啟

在NCUIR中所有的資料項目都受到原著作權保護.

社群 sharing

資料載入中.....