中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/93214
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 81570/81570 (100%)
Visitors : 47019412      Online Users : 105
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/93214


    Title: 具有基於神經網路的音訊瞬態/穩態分離和多頻 段壓縮器之虛擬低音系統;Virtual bass system with neural network based transient/stationary audio separation and multiband compressor
    Authors: 周軒宇;Chou, Xuan-Yu
    Contributors: 電機工程學系
    Keywords: 虛擬低音系統;音訊瞬態/穩態分離;深度神經網路;非線性元件;相位聲碼器;多頻段壓縮器;Virtual bass system;Stationary-transient source separation;Deep neural networks;Nonlinear device;Phase vocoder;Multiband compressor
    Date: 2023-04-21
    Issue Date: 2024-09-19 16:48:46 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 移動多媒體電子設備的尺寸和厚度減小,嚴格限制了可配置揚聲器的尺
    寸。因此,很容易犧牲播放的音質,尤其是低音。在本文中,我們提出了一
    種基於神經網路的虛擬低音增強系統來解決這項問題。此外,虛擬低音增強
    任務中產生的額外諧波可能會導致算術溢出而發生削波失真。因此,我們在
    系統末端添加了一個多頻段壓縮器,以減少由於虛擬低音增強而導致的削
    波失真。虛擬低音增強可以分為兩種主要的方法,一種是非線性元件(Non linear Device, NLD),另一種則是相位聲碼器(Phase Vocoder, PV)。NLD 通
    過非線性元件(如乘法迴圈)直接在時域中產生諧波來實現虛擬低音; 而 PV
    首先將訊號轉換至頻域,並使用頻譜偏移產生更高次的諧波。相較之下,由
    於其設計特性,NLD 更適合使用於鼓和打擊樂等瞬態訊號(transient signal),
    而 PV 更適合人聲等穩態訊號(stationary signal)。因此,我們首先使用神經
    網路將輸入音訊訊號分離成瞬態和穩態分量,並分別對它們應用虛擬低音
    增強的方法,我們使用這些技術提出了一個完整的虛擬低音增強系統。最後,
    通過與其他虛擬低音系統相比的主觀聽覺測試,可以驗證我們的虛擬低音
    系統具有更高的低音感知和更低的失真。
    ;The reduced size and thickness of mobile multimedia electronics strictly limit
    the size of the configurable loudspeakers. As a result, it is easy to sacrifice the
    sound quality of playback, especially the bass. In this paper, we propose a neural
    network-based virtual bass system to solve this problem. In addition, the
    additional harmonics generated in the virtual bass enhancement may lead to
    arithmetic overflow and distortion due to clipping. Therefore, we add a multiband
    compressor at the end of the system to reduce clipping due to virtual bass
    enhancement. Virtual bass enhancement can be divided into two main approaches,
    one is the non-linear device (NLD), and the other is the phase vocoder (PV). NLD
    achieves virtual bass by generating harmonics directly in the time domain through
    a non-linear device such as a multiplication loop. The PV first converts the signal
    to the frequency domain and uses spectrum shifting to generate higher harmonics.
    In contrast, due to their design characteristics, NLDs are more suitable for
    transient signals such as drums and percussion, while PVs are more suitable for
    stationary signals such as vocals. Therefore, we first use a neural network to split
    the input audio signal into transient and stationary components and apply the
    virtual bass enhancement to them separately. We use these techniques to propose
    a complete virtual bass enhancement system. Finally, using subjective listening
    tests compared with other virtual bass systems, we can see that our virtual bass
    system has higher bass perception and lower distortion.
    Appears in Collections:[Graduate Institute of Electrical Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML53View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明