中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/10385
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 78852/78852 (100%)
Visitors : 38063227      Online Users : 811
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/10385


    Title: 整合高斯混合與具性能指標支撐向量機模型之語者確認研究;A Hybrid Model of GMM and SVM with Representative Labels for Speaker Verification
    Authors: 游智翔;Chih-hsiang Yu
    Contributors: 電機工程研究所
    Keywords: 高斯混合模型;支撐向量機;語者確認;support vector machine;speaker verification;gaussian mixture model
    Date: 2008-06-13
    Issue Date: 2009-09-22 12:14:51 (UTC+8)
    Publisher: 國立中央大學圖書館
    Abstract:   本論文主要針對語者確認系統上,提出新的辨識流程,使得系統效能得到提升,此架構包含了高斯混合模型和具性能指標支撐向量機模型的整合應用。   其中,具性能指標支撐向量機,主要是在原始特徵向量中,加入所定義的性能指標,使得向量維度增高,讓整個系統更具鑑別力。而在提出的系統架構中,測試句與所有註冊模型算分數,以決定類別標籤,依據Top1減Top2的分數,並觀察是否大於或等於臨界值,若大於或等於,則使用Top1的類別標籤,使測試句的特徵向量增維,並和含類別標籤的支撐向量機算距離值,反之,則進入原本傳統的語者確認系統。   從實驗結果顯示,在提出的架構中,高斯混合模型選定為128-mixture並定臨界值為0.3時,系統性能可達最好的相等錯誤率及決策成本函數為14.43%和0.1743,比起支撐向量機語者確認系統的效能17.86%和0.2175,改善了3.43%和0.0414,而比起傳統的語者確認系統的效能15.87%和0.1912,改善了1.44%和0.0169。 This thesis proposes a new recognition system to improve performance for speaker verification. The proposed system combines the Gaussian Mixture Model (GMM) and Support Vector Machine (SVM) with representative labels. The SVM with representative labels is built by adding the defined class labels to the original feature vectors to increase the dimension of feature vectors and make the system more discriminative. In the proposed system, each input segment is sent to compute the log-likelihood ratio with all the enrolled models to decide the class labels. Accordingly, if the difference of the scores between Top1 and Top2 is greater than a chosen threshold, the class labels for the top1 speaker will be added as extra features to the original feature vectors. Then the augmented feature vectors are applied to the SVM classifier. Otherwise, we verify the speaker using the GMM-UBM baseline system. The experimental result shows that with a 128-mixture GMM and a 0.3 threshold, the proposed system obtains a 3.43% EER and 4.14% DCF improvement over the SVM speaker verification system, and a 1.44% EER and 1.69% DCF improvement over the baseline system.
    Appears in Collections:[Graduate Institute of Electrical Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File SizeFormat


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明