中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/81960
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 80990/80990 (100%)
造访人次 : 42409350      在线人数 : 895
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/81960


    题名: 漢語之端到端語音合成研究;Mandarin End-To-End Text-To-Speech Research
    作者: 王紹全;WANG, SHAO-CHUAN
    贡献者: 資訊工程學系在職專班
    关键词: 端到端;語音合成;深度學習;End-To-End;speech synthesis;deep learning
    日期: 2019-09-25
    上传时间: 2020-01-07 15:10:34 (UTC+8)
    出版者: 國立中央大學
    摘要: 語音合成即指將文本合成語音的技術,在過去一個語音合成系統
    通常分多個階段處理,並涉及了語音學、聲學等相關領域知識,因
    此造就了高的技術門檻,由於近年來硬體技術的提升,以往基於神
    經網絡架構的深度學習方法在近期廣為研究者使用,本論文亦將深
    度學習技術應用到文字轉語音(TTS)系統上,利用端到端語音合成架
    構,透過訓練用語音訓練出單一神經網路模型,捨棄傳統由時間模
    型、聲學特徵等多個模型生成語音的架構,只使用一個端到端模
    型, 輸入文字即可生成目標語音。

    目前常見的端到端語音合成研究以英語語系為主,然而,只要找到
    文字和語音的對應關係,我們也可將其應用在其他非英語語系合
    成, 本論文利用漢語拼音方案的字母音標取代中文注音,以此取代
    中文文字作為訓練的資料,以實現中文的語音合成,未來也希望能
    以此概念將端到端語音合成推廣到其他非英文語系的使用。;Speech synthesis refers to the technique of synthesizing text into speech,In the past a speech synthesis system usually has multiple stages of processing, and it
    also related to phonetics, acoustics or other related domain knowledge, which creates high technical threshold. Due to the advancement of hardware technology in recent
    years, the deep learning methods based on neural network architecture have been widely used by researchers recently. This paper also applies deep learning technology to text-to-speech. (TTS) system , by using End-To-End speech synthesis architecture, training a single neural network model through audio training data, and abandoning the traditional architecture of generating speech from multiple models such as time models and acoustic features, use only an end-to-end model to enter the text to generate the target speech .

    Current End-To-End speech synthesis research is mainly in English, however, as long as we find the correspondence between text and speech, we can also apply it to other non-English language synthesis. This thesis replaces Chinese phonetic transcription with the phonetic symbols from Scheme of the Chinese Phonetic Alphabet, which replaces Chinese characters as training materials to achieve Chinese speech synthesis. And I hope that this concept can be used to implement other non-English languages end-to-end speech synthesis too.
    显示于类别:[資訊工程學系碩士在職專班 ] 博碩士論文

    文件中的档案:

    档案 描述 大小格式浏览次数
    index.html0KbHTML248检视/开启


    在NCUIR中所有的数据项都受到原著作权保护.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明