English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 80990/80990 (100%)
造訪人次 : 41634692      線上人數 : 2232
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/81225


    題名: 端到端輕量化音樂源分離深度學習模型;Lightweight End-to-End Deep Learning Model for Music Source Separation
    作者: 王耀霆;Wang, Yao-Ting
    貢獻者: 資訊工程學系
    關鍵詞: 深度學習;語音增強;音源分離;Deep Learning;Speech Enhancement;Audio Source Separation
    日期: 2019-07-31
    上傳時間: 2019-09-03 15:39:52 (UTC+8)
    出版者: 國立中央大學
    摘要: 深度類神經網路(DNN)在音訊處理的領域中進展快速,過去大多利用經由短時傅立葉轉換(STFT)出來的頻譜資訊進行處理,但其中許多作法都是僅處理實數部分,近年來為了避免複數資訊未被考慮而造成的資訊損失,陸續提出了基於時域資訊直接進行端到端處理的音源分離深度學習模型。不過這些方法一來模型龐大,參數量多,在設備運算效能受限的狀態下難以利用;另一方面,一般都需要較長時間的輸入才能獲得良好的分離效果,這代表著高延遲,對於需要低延遲的應用而言較無助益。
    本論文基於前人之研究提出端到端輕量化音樂源分離深度學習模型,減少模型參數量並加速運算,並提出新穎的解碼器來進一步提升在輸入時間長度受限的狀態下的分離效果。實驗結果表明,本論文提出的方法,只需過去10%以下或是更少的參數量,就能獲得優於之前的分離結果。
    ;DNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the information loss caused by the lack of consideration of complex value, deep learning models have gradually been proposed for audio source separation based on time domain for end-to-end processing. However, those models are huge, i.e., the number of parameters is very large. Therefore, it is difficult to use them where the computing resources of the device is limited. On the other hand, it generally takes a long term input to obtain a good result for separation, which represents high delay. It is less helpful for some applications that require low latency.
    Based on the previous research, this thesis proposes a lightweight end-to-end music source separation deep learning model. To reduce the number of parameters and accelerate the computation, and then propose a novel decoder that can further enhance the result of separation while the input context length is limited. The experimental results show that the method proposed in this paper can obtain better than the previous results by only uses 10% or less parameters.
    顯示於類別:[資訊工程研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML116檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明