中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/72278
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 78728/78728 (100%)
造访人次 : 34344981      在线人数 : 986
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/72278


    题名: 聲學場景分類運用卷積神經網絡;Acoustic scene classification using self-determination convolutional neural network
    作者: 沈安迪;Santoso,Andri
    贡献者: 資訊工程學系
    关键词: 聲學場景分類運用卷積神經網絡;acoustic scene classification;deep learning;audio processing
    日期: 2016-08-26
    上传时间: 2016-10-13 14:36:38 (UTC+8)
    出版者: 國立中央大學
    摘要: 自動場景分類在機器學習研究領域中是個熱門的議題。許多研究專注於以視覺為基礎做自動場景分類,而使用聲音為基礎做場景分類的研究則相對較少。以聲音為基礎的場景分類系統,或稱為聲學場景分類,分析輸入的聲音資料,並自動分類紀錄聲音的環境場景。當視覺資訊無法取得時,聲學場景分類可視為以視覺為基礎的場景分類的延伸。當聲音資訊被取得,聲學場景分類系統可以分類場景,因此可被稱為機器聽覺。此領域有數種針對聲學場景分類提出的方法。近年來,使用電腦視覺技術分析聲學事件的研究愈來愈多。此外,深層學習的研究也受到許多注意。深層學習在許多領域都展現傑出的效果。本篇論文中,針對聲學場景分類問題提出了以深層學習為基礎的方法。;Automatic scene classification is an active issue in the machine learning research field. While many works put a lot of focus on visual based approach, relatively little attention has been put to solve the problem of automatic scene classification using audio-based approach. The audio-based scene classification, or is known as acoustic scene classification (ASC), analyzes the input of audio data to automatically identify the scene of environment where the sound was recorded. Furthermore, the works in ASC can be seen as an alternative to visual-based approach when the performance of visual-based classifier is compromised. The audio-based approach has benefit, that as long as the sound can be listened, the practical ASC system will be able to perform scene classification, thus the obscuring object problem that exists in visual-based approach can be alternatively addressed. In this field, there have been a number of proposed approach to address the problem of audio-based scene classification. In recent years, there is an increasing interest of adopting the approach from computer vision research field to address the problem in audio analysis. Moreover, the research works of deep learning have attracted many attention. The deep learning based system has presented a promising result in many fields. In this thesis, the problem of ASC is solved using deep learning based approach. Several ASC systems, including the proposed system, have been implemented and discussed in the experiments. The results show the superiority of proposed system versus another systems that have been discussed in this thesis.
    显示于类别:[資訊工程研究所] 博碩士論文

    文件中的档案:

    档案 描述 大小格式浏览次数
    index.html0KbHTML551检视/开启


    在NCUIR中所有的数据项都受到原著作权保护.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明