中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/72263
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 80990/80990 (100%)
Visitors : 41967635      Online Users : 835
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/72263


    Title: 基於複數深層類神經網路之單通道訊號源分離;Monaural Source Separation Based on Complex-valued Deep Neural Network
    Authors: 王書凡;Wang,Shu-Fan
    Contributors: 資訊工程學系
    Keywords: 深層學習;盲訊號源分離;相位;Deep Learning;Blind Source Separation;Phase
    Date: 2016-08-25
    Issue Date: 2016-10-13 14:35:26 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 深層類神經網路(Deep neural network, DNN)目前已成為處理訊號源分離問題之熱門方法。其中,幾乎所有以DNN為基礎之分離方法皆只用混合訊號頻譜之能量(Magnitude)做為網路訓練資料,而忽略了相位(Phase)這個隱含在短時傅立葉轉換(STFT)係數中之重要資訊。然而,最近的研究表明,加入相位資訊可以提升分離訊號的聽覺品質。故而在本論文中,我們在進行分離時保留頻譜之相位資訊,從輸入混合訊號中估算目標來源訊號之STFT係數,並視其為一複數域回歸問題。我們發展複數深層類神經網路(Complex-valued Deep neural network),來學習混合訊號之STFT係數到來源訊號之STFT係數間的非線性映射,做法是利用STFT將混合訊號轉至時頻域後,將其複數STFT係數輸入複數深層類神經網路中,藉此同時考慮能量與相位資訊。此外本論文也提出在成本函數部分加入具有重建性及稀疏性限制式,以提升訊號分離效果。在實驗上,我們將所提出的方法分別應用於語音分離和歌唱分離中。;Deep neural networks (DNNs) have become a popular means of separating a target source from a mixed signal. Almost all DNN-based methods modify only the magnitude spectrum of the mixture. The phase spectrum is left unchanged, which is inherent in the short-time Fourier transform (STFT) coefficients of the input signal. However, recent studies have revealed that incorporating phase information can improve the perceptual quality of separated sources. Accordingly, in this paper, estimating the STFT coefficients of target sources from an input mixture is regarded a regression problem. A fully complex-valued deep neural network is developed herein to learn the nonlinear mapping from complex-valued STFT coefficients of a mixture to sources. The proposed method is applied to speech separation and singing separation.
    Appears in Collections:[Graduate Institute of Computer Science and Information Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML300View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明