博碩士論文 106522015 完整後設資料紀錄

DC 欄位 語言
DC.contributor資訊工程學系zh_TW
DC.creator王耀霆zh_TW
DC.creatorYao-Ting Wangen_US
dc.date.accessioned2019-7-31T07:39:07Z
dc.date.available2019-7-31T07:39:07Z
dc.date.issued2019
dc.identifier.urihttp://ir.lib.ncu.edu.tw:88/thesis/view_etd.asp?URN=106522015
dc.contributor.department資訊工程學系zh_TW
DC.description國立中央大學zh_TW
DC.descriptionNational Central Universityen_US
dc.description.abstract深度類神經網路(DNN)在音訊處理的領域中進展快速,過去大多利用經由短時傅立葉轉換(STFT)出來的頻譜資訊進行處理,但其中許多作法都是僅處理實數部分,近年來為了避免複數資訊未被考慮而造成的資訊損失,陸續提出了基於時域資訊直接進行端到端處理的音源分離深度學習模型。不過這些方法一來模型龐大,參數量多,在設備運算效能受限的狀態下難以利用;另一方面,一般都需要較長時間的輸入才能獲得良好的分離效果,這代表著高延遲,對於需要低延遲的應用而言較無助益。 本論文基於前人之研究提出端到端輕量化音樂源分離深度學習模型,減少模型參數量並加速運算,並提出新穎的解碼器來進一步提升在輸入時間長度受限的狀態下的分離效果。實驗結果表明,本論文提出的方法,只需過去10%以下或是更少的參數量,就能獲得優於之前的分離結果。 zh_TW
dc.description.abstractDNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the information loss caused by the lack of consideration of complex value, deep learning models have gradually been proposed for audio source separation based on time domain for end-to-end processing. However, those models are huge, i.e., the number of parameters is very large. Therefore, it is difficult to use them where the computing resources of the device is limited. On the other hand, it generally takes a long term input to obtain a good result for separation, which represents high delay. It is less helpful for some applications that require low latency. Based on the previous research, this thesis proposes a lightweight end-to-end music source separation deep learning model. To reduce the number of parameters and accelerate the computation, and then propose a novel decoder that can further enhance the result of separation while the input context length is limited. The experimental results show that the method proposed in this paper can obtain better than the previous results by only uses 10% or less parameters. en_US
DC.subject深度學習zh_TW
DC.subject語音增強zh_TW
DC.subject音源分離zh_TW
DC.subjectDeep Learningen_US
DC.subjectSpeech Enhancementen_US
DC.subjectAudio Source Separationen_US
DC.title端到端輕量化音樂源分離深度學習模型zh_TW
dc.language.isozh-TWzh-TW
DC.titleLightweight End-to-End Deep Learning Model for Music Source Separationen_US
DC.type博碩士論文zh_TW
DC.typethesisen_US
DC.publisherNational Central Universityen_US

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明