中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/71964
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 78852/78852 (100%)
Visitors : 38571579      Online Users : 554
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/71964


    Title: 基於機器學習方法之巨量音樂檢索系統;Large-Scale Music Retrieval System Using Machine Learning Approaches
    Authors: 黃梓翔;Huang,Tzu-Hsiang
    Contributors: 通訊工程學系
    Keywords: 音樂資訊檢索;翻唱歌曲辨識;二維傅立葉轉換;機器學習;Music information retrieval;Cover song identification;2D-Fourier transform;Machine learning
    Date: 2016-07-26
    Issue Date: 2016-10-13 14:09:19 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 在大數據的時代中,網際網路上的多媒體資訊量以指數性成長,如何正確地尋找特定多媒體資訊成為一個重要的研究議題。
    本系統參考翻唱歌曲辨識的理論架構,利用歌曲的音樂內涵式特徵,消除不同樂器、語言、歌手等等演奏時的音色、調性與些微結構差異,尋找資料庫中與輸入歌曲俱有相似旋律特徵的歌曲。
    在內涵式音樂檢索領域中,由於不同歌曲的時間長度不一,先前的研究以輸入歌曲對整個資料庫的歌曲進行高複雜度的比對來計算兩首歌曲的相似度,最後輸出資料庫中相似度最高的歌曲清單,這種方法雖然盡可能提升辨識正確率,但是消耗過多的運算資源,在大規模的資料庫並不可行。本研究提出在大規模資料庫中快速檢索特定相似歌曲的系統,系統擷取音樂的頻譜特徵並以二維傅立葉轉換壓縮資料,接著合併成固定長度的向量,再以K-Means、主成份分析、線性判別分析等機器學習的方式強化向量的模式特徵,藉此將資料庫的全部歌曲投影到一個向量空間,系統直接比對查詢歌曲與資料庫歌曲的向量距離,將相似度最高的音樂作為回饋歌單。本系統不僅大幅度地提升內涵式音樂檢索的效率,更探討音樂檢索結合機器學習的潛力。
    ;In this work, we proposed a music retrieval system which can search the similar music in large-scale database.
    Large-scale similar music recognition should calculate song-to-song simi-larity that can accommodate differences in timing, key and tempo. Simple vector distance measure is not powerful enough to perform the similar music recogni-tion task, but expensive solutions such as dynamic time warping do not scale to millions of instances, making the similar music recognition inappropriate for commercial-scale application. In this work, we used the content-based music features of songs as input and transformed them into semantic vectors by 2D-Fourier transform. We even explored different machine learning approaches to learn and reinforce the pattern of these semantic vector. By projecting the songs into the sematic vector space, we can use the efficient nearest neighbor algorithm to compare the similarity of songs and retrieve the most similar songs in the large-scale database.
    The proposed system is not only efficient enough to perform scalable con-tent-based music retrieval, but also develop the potential of machine learning approaches, making the similar music recognition application more fast and accurate.
    Appears in Collections:[Graduate Institute of Communication Engineering] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML400View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明