中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/9735
English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 80990/80990 (100%)
造訪人次 : 41668452      線上人數 : 1405
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/9735


    題名: 類神經網路應用於語音情緒的分析與辨識;The Analysis and Recognition of Emotional Speech Using Artificial Neural Networks
    作者: 陳正倫;Zheng-lun Chen
    貢獻者: 資訊工程研究所
    關鍵詞: 模糊化多維矩形複合式神經網路;類神經網路;費雪比例;多頻帶線性預估倒頻譜係數;音高;語音情緒辨識;emotional speech recognition;FHRCNN;MBLPCC;pitch;Fisher's ratio;artificial neural networks
    日期: 2009-07-03
    上傳時間: 2009-09-22 11:54:49 (UTC+8)
    出版者: 國立中央大學圖書館
    摘要: 本論文提出一個多頻帶線性預估倒頻譜係數(multi-band linear predictive cepstral coefficients)的語音情緒特徵,利用離散小波轉換將訊號分解至多個子頻帶,對全頻帶和每個頻帶萃取出線性預估編碼係數,同時分析不同參數多頻帶線性預估倒頻譜係數,最後決定以分解2層、10階線性預估編碼係數和縮短取樣比例為8的做為參數。並且結合音高和能量曲線特徵,總共有52特徵,最後藉由費雪比例選擇出32個做為7種情緒的語音情緒辨識系統特徵,其整體辨識率達到90%。 最後本論文比較三種不同的類神經網路辨識器(多層感知機、放射基底函數網路和多維矩形複合式神經網路)。在整體資料集辨識率,多層感知機有90% 以上的最佳辨識率;模糊化多維矩形複合式神經網路對於訓練資料有著高達百分百的辨識結果;最後放射基底函數網路在測試資料集有68% 的辨識率。 This thesis presents a multi-band linear predictive cepstral coefficients (MBLPCC) feature for the emotional speech recognition system. Base on discrete wavelet transform (DWT), the emotional speech is decomposed into various frequency subband, and LPCC of the lower frequency subband for each decomposition process are calculated. Furthermore, we analyze the different parameters of MBLPCC, and then decide to decompose two times, 10 LPCC coefficients and the downsampling ratio of eight as the parameters. We also combine MBLPCC with pitch and energy curve features, a total of 52 features, and choose 32 features by Fisher’s ratio for the seven kinds of emotion of emotional speech recognition system, and achieves the recognition rate of 68%. Finally, we compare three different artificial neural networks (ANN) recognizer, multilayer perceptrons (MLP), radial basis function networks (RBF) and fuzzy hyperrectangular composite neutral networks (FHRCNN). In the recognition rate of overall data set, MLP achieved the best rate of over 90%. FHRCNN with training data set achieves recognition result of up to 100%. Finally, RBFN with testing data set achieves the recognition rate of 68%.
    顯示於類別:[資訊工程研究所] 博碩士論文

    文件中的檔案:

    檔案 大小格式瀏覽次數


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明