中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/65821
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 80990/80990 (100%)
Visitors : 41648026      Online Users : 2195
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/65821


    Title: 應用以傅立葉轉換為基礎之動態性局部二值模式於自動化聲音訊號辨識;Automatic Recognition of Audio Signal Using Dynamic Local Binary Patterns Based on Fourier Transform
    Authors: 溫偉森;Gunawan,David
    Contributors: 資訊工程學系
    Keywords: 傅立葉轉換;聲音訊號辨識;局部二值模式;Local Binary Patterns;Automatic Recognition;Audio Signal
    Date: 2014-08-27
    Issue Date: 2014-10-15 17:11:04 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 聲音辨識技術一直是一個很重要的課題,因為其發展使我們的生活更加便捷,並且,近年來此項技術也被廣泛應用在一些移動裝置如:智慧型手機、平板等等。因此,如何開發一套效果良好之音訊辨識系統非常重要。聲音可細分為很多種類型,在這篇論文中,我們針對環境聲音事件來研究。
    我們提出的辨識系統以傅立葉轉換為基礎,結合了所提出之動態Local Binary Pattern (LBP) Uniform與具平滑化功能之Filter,並且利用Variance Measure (VAR)作為前處理來強化時頻圖之邊緣紋理與對比度。
    在我們提出的系統中,利用Box Filter 與Gaussian Filter來使傅立葉轉換後的時頻圖平滑。此外,我們進一步考慮到時頻圖中能量分布差異的特性,提出了動態Local Binary Pattern (LBP) Uniform方法。本論文提出把頻譜圖分為不同頻段區域,並且藉由對LBP Histogram降維來動態的調整不同頻率之解析度,以形成特徵參數並藉由Support Vector Machine(SVM)來進行環境聲音辨認。;Sound recognition has become an important application in some devices. The type of sound to be recognized may vary, e.g., musical instrument sounds, environmental sounds, and speech. In this study we use environmental sound for our experiment.
    Time-frequency, which can represent an audio signal, is a form of texture image that can be used for image classification. In this paper, we introduce a simple image classification method using local binary pattern (LBP) and an image smoothing method prior to feature extraction to reduce spectrogram image noise.
    In this thesis, we combine spectrograms and LBP uniform with an image filter and variance measure (VAR) for contrast enhancement. We alsointroduce adynamic LBP method to reduce the dimension in difference dimension for each sub-band(high, middle, and low frequency). After using image filter as pre-treatment and VAR for contrast enhancement, weconcatenate all thesefeatures.
    To remove image noise, we use two types of smoothing filter:a box filter (mean filter) and a Gauss filter. To improve recognition, filtering is applied as a pretreatment prior to feature extraction. To enhance local image texture contrast, such as object edges and corners, we use a VAR function. We use a support vector machine for the classifier.
    Appears in Collections:[Graduate Institute of Computer Science and Information Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML462View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明