English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 94201/94201 (100%)
造訪人次 : 81626064      線上人數 : 3340
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: https://ir.lib.ncu.edu.tw/handle/987654321/106007


    題名: A generative data augmentation model for enhancing Chinese dialect pronunciation prediction
    作者: 蔡宗翰;Lin, Chu-Cheng;Tsai, R. T-H
    貢獻者: 資訊電機學院資訊工程學系
    關鍵詞: Applied sciences;Chinese dialects;data augmentation;Data models;Dictionaries;Exact sciences and technology;generative model;Information, signal and communications theory;pronunciation database;Signal and communications theory;Signal processing;Signal representation. Spectral analysis;Signal, noise;Speech;Speech processing;Support vector machines;Telecommunications and information theory
    日期: 2012-05-01
    上傳時間: 2026-04-23 13:03:37 (UTC+8)
    出版者: Institute of Electrical and Electronics Engineers Inc.;Piscataway, NJ: IEEE
    摘要: 摘要: Most spoken Chinese dialects lack comprehensive digital pronunciation databases, which are crucial for speech processing tasks. Given complete pronunciation databases for related dialects, one can use supervised learning techniques to predict a Chinese character's pronunciation in a target dialect based on the character's features and its pronunciation in other related dialects. Unfortunately, Chinese dialect pronunciation databases are far from complete. We propose a novel generative model that makes use of both existing dialect pronunciation data plus medieval rime books to discover patterns that exist in multiple dialects. The proposed model can augment missing dialectal pronunciations based on existing dialect pronunciation tables (even if incomplete) and the pronunciation data in rime books. The augmented pronunciation database can then be used in supervised learning settings. We evaluate the prediction accuracy in terms of phonological features, such as tone, initial phoneme, final phoneme, etc. For each character, features are evaluated on the whole, overall pronunciation feature accuracy (OPFA). Our first experimental results show that adding features from dialectal pronunciation data to our baseline rime-book model dramatically improves OPFA using the support vector machine (SVM) model. In the second experiment, we compare the performance of the SVM model using phonological features from closely related dialects with that of the model using phonological features from non-closely related dialects. The experimental results show that using features from closely related dialects results in higher accuracy. In the third experiment, we show that using our proposed data augmentation model to fill in missing data can increase the SVM model's OPFA by up to 7.6%.
    其他題名: TASL
    出版者: Piscataway, NJ: IEEE
    出版日期: 2012-05-01
    出處: IEEE transactions on audio, speech, and language processing, 2012-05, Vol.20 (4), p.1109-1117
    資源來源: IEEE Electronic Library (IEL)
    版權: 2015 INIST-CNRS
    識別號: ISSN: 1558-7916
    識別號: EISSN: 1558-7924
    識別號: DOI: 10.1109/TASL.2011.2172424
    識別號: CODEN: ITASD8
    顯示於類別:[資訊工程學系] 期刊論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML14檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明