博碩士論文 92521002 詳細資訊




以作者查詢圖書館館藏 以作者查詢臺灣博碩士 以作者查詢全國書目 勘誤回報 、線上人數:31 、訪客IP:3.145.69.255
姓名 王永宗(Yung-Tsung Wang)  查詢紙本館藏   畢業系所 電機工程學系
論文名稱 壓縮域中利用以歌找歌方式之內涵式搜尋應用於嵌入式系統設計
(Compression Domain Content-Based Retrieval of Audio Example for Embedded System Design)
相關論文
★ 即時的SIFT特徵點擷取之低記憶體硬體設計★ 即時的人臉偵測與人臉辨識之門禁系統
★ 具即時自動跟隨功能之自走車★ 應用於多導程心電訊號之無損壓縮演算法與實現
★ 離線自定義語音語者喚醒詞系統與嵌入式開發實現★ 晶圓圖缺陷分類與嵌入式系統實現
★ 語音密集連接卷積網路應用於小尺寸關鍵詞偵測★ G2LGAN: 對不平衡資料集進行資料擴增應用於晶圓圖缺陷分類
★ 補償無乘法數位濾波器有限精準度之演算法設計技巧★ 可規劃式維特比解碼器之設計與實現
★ 以擴展基本角度CORDIC為基礎之低成本向量旋轉器矽智產設計★ JPEG2000靜態影像編碼系統之分析與架構設計
★ 適用於通訊系統之低功率渦輪碼解碼器★ 應用於多媒體通訊之平台式設計
★ 適用MPEG 編碼器之數位浮水印系統設計與實現★ 適用於視訊錯誤隱藏之演算法開發及其資料重複使用考量
檔案 [Endnote RIS 格式]    [Bibtex 格式]    [相關文章]   [文章引用]   [完整記錄]   [館藏目錄]   [檢視]  [下載]
  1. 本電子論文使用權限為同意立即開放。
  2. 已達開放權限電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。
  3. 請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。

摘要(中) 近幾年來,多媒體資料的搜尋與索引的技術受到愈來愈多的注意,尤其是在內涵式搜尋這塊領域中。在音訊這個範圍裡,許多的研究皆專注在MIDI或WAV這些未壓縮的音樂格式上,很少會針對壓縮過的音樂格式這個領域來探討。然而,MP3在目前已是一個重要且受歡迎的音樂格式,許多網路或是個人電腦中所流通的都是以MP3為主。在本篇論文中,我們提出了一個利用以歌找歌方式並且以MP3為主的內涵式音樂搜尋系統。我們將MP3中的次頻帶值取出來當成特徵值,並且利用量化樹索引方法及旋律線擷取方法將所取出的特徵值建成我們要的索引資料,最後再利用這些索引資料作為輸入歌曲片段與資料庫歌曲之相似度比對的依據。
此外,我們將整個內涵式音樂搜尋系統移植到以ARM為主的嵌入式系統平台上來做實現,整個系統的評估實驗分別進行在PC及嵌入式平台上,實驗結果顯示,在整個搜尋的準確性方面,在第一首找到的機率大約在70%左右,大約有90%的機率可以在前三首歌中找到。
摘要(英) In recent years, the searching and indexing techniques for multimedia data are getting more attention in the area of multimedia databases. As many research works were done on the content-based retrieval of MIDI or waveform format music, less attention was received on the compressed domain audio data. As the explosive growth of internet, MP3 audio has become one of the most important and popular media. Most of the audio accessed from database of Internet are all compressed on MP3 domain, not raw data or even low quality MIDI audio. In this thesis, the content-based retrieval of audio example on MP3-based (MPEG 1 layer III) digital music archive is considered. In the proposed approach, the sub-band coefficients (SBC) in a MP3 frame are used. These values are extracted from the MP3 decoder to compute the MP3 features for indexing the MP3 objects. A quantization-tree (QT) and the melody-line tracking (MLT) method are also proposed for indexing MP3 objects. These indexes are used to measure the similarity between MP3 objects. Evaluations on a content-based MP3 retrieval system are performed on the pc version and platform-based embedded system version which indicate the proposed approach can achieve a good performance.
關鍵字(中) ★ 內涵式搜尋
★ 嵌入式系統
★ 壓縮域音樂
★ 索引法
★ 相似度比對
關鍵字(英) ★ audio retrieval
★ compression domain
★ MP3
★ embedded system
★ content-based
★ indexing
論文目次 ABSTRACT
LIST OF FIGURES
LIST OF TABLES
CHAPTER 1 INTRODUCTION………………………………………………….1
1.1 Background………………………………………………………………...1
1.2 Motivation…………………………………………………………………4
1.3 Thesis Organization………………………………………………………..5
CHAPTER 2 RELATED WORKS……………………………………………….7
2.1 Audio Feature Analysis……………………………………………………7
2.1.1 Physical Features………………………………………………………...8
2.1.2 Perceptual Features……………………………………………………..10
2.2 Audio Representations……………………………………………………10
2.2.1 Digital Audio……………………………………………………………11
2.2.2 Meta-Data……………………………………………………………….12
2.2.3 Melodic Contours……………………………………………………….13
2.2.4 The Fourier Transform………………………………………………….14
2.3 Overview of MPEG-7 Audio……………………………………………..14
2.4 A General Content-Based Retrieval System……………………………...18
CHAPTER 3 PROPOSED APPROACH FOR CONTENT-BASED RETRIEVAL OF QUERY-BY-EXAMPLE..........……………....22
3.1 Algorithm Development………………………………………………….22
3.1.1 Feature Extraction………………………………………………………24
3.1.2 Tone Classification……………………………………………………...26
3.1.3 Slot Processing………………………………………………………….27
3.1.4 Index Construction……………………………………………………...29
3.1.5 Similarity Measurement………………………………………………...33
3.2 Platform-Based Design…………………………………………………...37
3.2.1 Introduction of ARM Integrator Family………………………………...38
3.2.2 Implementation with ARM Embedded System…………………………41
3.2.3 System Integration………………………………………………………44
3.3 Results of Implementation………………………………………………..45
3.3.1 Presentation of Practical System………………………………………..45
3.3.2 Experiment Results……………………………………………………..50
CHAPTER 4 CONCLUSIONS…………………………………………………..54
REFERENCES………………………………………………………………………56
參考文獻 [1] Foote, Jonathan, “Content-Based Retrieval of Music and Audio” in Proc. SPIE, Multimedia Storage and Archiving systems II, Vol.3229, Pages: 138-147.
[2] Wold, E., T. Blum, D. Keislar, and J. Wheaton, “Content-based Classification, Search, and Retrieval of audio”, IEEE Multimedia, Vol. 3, No. 3, Fall 1996, Pages: 27-36.
[3] Zhu Liu; Qian Huang, ”Content-based indexing and retrieval-by-example in audio”, IEEE International Conference on Multimedia and Expo, 2000. ICME 2000. Volume: 2,July-2 Aug. 2000, Pages: 877 -880.
[4] N. Kosugi, Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima, “Music Retrieval by Humming”, In Proceedings of PACRIM'99,. IEEE, August 1999, Pages: 404-407.
[5] Tomonari Sonoda, Masataka Goto, Yoichi Muraoka,”A WWWbased Melody Retrieval System”, ICME’98 Proc.98, Pages: 349- 352.
[6] Cheng Yang, “Efficient acoustic index for music retrieval with various degrees of similarity”, Proceedings of the tenth ACM international conference on Multimedia, December 2002, Pages: 584 – 591.
[7] Chih-Chin Liu , Po-Jun Tsai, “Content-based retrieval of MP3 music objects”, Proceedings of the tenth international conference on Information and knowledge management, October 05-10, 2001, Atlanta, Georgia, USA, Pages: 506-511.
[8] Wen-Nung Lie, Chen-Kang Su, “Content-based of MP3 songs based on query by singing”, IEEE International Converence on Acoustics Speech and Signal Processing, 2004, vol 5, Pages: 929 – 932.
[9] Tong Zhang and C.-C. Jay Kuo, “Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing”.
[10] Faisal Bashir, Shashank Khanvilkar, Ashfaq Khokhar, and Dan Schonfeld, “Content Based Indexing and Retrieval”.
[11] Karydis, I.; Nanopoulos, A.; Papadopoulos, A.N.; Manolopoulos, Y.; “Audio Indexing for Efficient Music Information Retrieval”, Multimedia Modelling Conference, 2005. MMM 2005. Proceedings of the 11th International, 12-14 Jan. 2005, Pages: 22 – 29.
[12] Steven George Blackburn, “Content Based Retrieval and Navigation of Music Using Melodic Pitch Contours”.
[13] Rui Cai, Lie Lu, Hong-Jiang Zhang, “Using structure patterns of temporal and spectral feature in audio similarity measure”, Proceedings of the eleventh ACM international conference on Multimedia, Berkeley, CA, USA, Pages: 219 – 222.
[14] Quackenbush, S.; Lindsay, A.; “Overview of MPEG-7 audio”, IEEE Transactions on Circuits and Systems for Video Technology, Volume 11, Issue 6, June 2001, Pages: 725 – 729.
[15] Cheng Yang, “Peer-to-peer architecture for content-based music retrieval on acoustic data”, Proceedings of the 12th international conference on World Wide Web, Budapest, Hungary, Pages: 376 – 383.
[16] Piamsa-Nga, P.; Alexandridis, N.A.; Srakaew, S.; Blankenship, G.C., Jr.; Subramanya, S.R.; “In-clip search algorithm for content-based audio retrieval”, Third International Conference on Computational Intelligence and Multimedia Applications, 1999. ICCIMA '99. 23-26 Sept. 1999, Pages: 263 – 267.
[17] Dongmoon Park; Eenjun Hwang; “Popularity-adaptive index scheme for fast music retrieval”, IEEE International Conference on Multimedia and Expo, 2002. ICME '02. Proceedings. 2002, Volume 1, 26-29 Aug. 2002, Pages: 121 – 124.
[18] Ning Hu, Roger B. Dannenberg, “A comparison of melodic database retrieval techniques using sung queries”, Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, Portland, Oregon, USA, Pages: 301 – 307.
[19] Shih-Fu Chang; Sikora, T.; Purl, A.; “Overview of the MPEG-7 standard”, IEEE Transactions on Circuits and Systems for Video Technology, Volume 11, Issue 6, June 2001, Pages: 688 – 695.
[20] Jungmin Song; So-Young Bae; Kyoungro Yoon; “Query by humming: matching humming query to polyphonic audio”, IEEE International Conference on Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 Volume 1, 26-29 Aug. 2002, Pages: 329 – 332.
[21] Asif Ghias, Jonathan Logan, David Chamberlin, Brian C. Smith, “Query by humming: musical information retrieval in an audio database”, Proceedings of the third ACM international conference on Multimedia, San Francisco, California, United States, Pages: 231 – 236.
[22] Yongwei Zhu; Kankanhalli, M.; Qi Tian; “Similarity matching of continuous melody contours for humming querying of melody databases”, IEEE Workshop on Multimedia Signal Processing, 2002, 9-11 Dec. 2002, Pages: 249 – 252.
[23] Zhu, Y.; Kankanhalli, M.S.; “A robust music retrieval method for query-by-humming”, Proceedings. ITRE2003. International Conference on Information Technology: Research and Education, 2003, 11-13 Aug. 2003, Pages: 89 – 93.
[24] Ya-Dong Wu; Yang Li; Bao-Long Liu; “A new method for approximate melody matching”, International Conference on Machine Learning and Cybernetics, 2003, Volume 5, 2-5 Nov. 2003, Pages: 2687 – 2691.
[25] Hsuan-Huei Shih; Narayanan, S.S.; Kuo, C.-C.Jay.; “Multidimensional humming transcription using a statistical approach for query by humming systems”, International Conference on Multimedia and Expo, 2003. ICME '03. Proceedings. 2003, Volume 3, 6-9 July 2003, Pages: III - 385-8.
[26] Parker, C.; “A tree-based method for fast melodic retrieval”, Conference on Digital Libraries, 2004. Proceedings of the 2004 Joint ACM/IEEE, 7-11 June 2004, Pages: 254 – 255.
[27] Yoshiharu Suga, Naoko Kosugi, Masashi Morimoto, “Real-time background music monitoring based on content-based retrieval”, Proceedings of the 12th annual ACM international conference on Multimedia, New York, NY, USA, Pages: 120 – 127.
[28] Chih-Chin Liu, Chuan-Sung Huang, “A singer identification technique for content-based classification of MP3 music objects”, Proceedings of the eleventh international conference on Information and knowledge management, McLean, Virginia, USA, Pages: 438 – 445.
[29] Jun Gao; Tzanetakis, G.; Steenkiste, P.; “Content-based retrieval of music in scalable peer-to-peer networks”, International Conference on Multimedia and Expo, 2003. ICME '03. Proceedings. 2003, Volume 1, 6-9 July 2003, Pages: I - 309-12.
[30] Keiichiro Hoashi, Kazunori Matsumoto, Naomi Inoue, “Personalization of user profiles for content-based music retrieval based on relevance feedback”, Proceedings of the eleventh ACM international conference on Multimedia, Berkeley, CA, USA, Pages: 110 – 119.
[31] Jyh-Shing Roger Jang, Hong-Ru Lee, “Hierarchical filtering method for content-based music retrieval via acoustic input”, Proceedings of the ninth ACM international conference on Multimedia, Ottawa, Canada, Pages: 401 – 410.
[32] Yuen-Hsien Tseng, “Content-based retrieval for music collections”, Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, Berkeley, California, United States, Pages: 176 – 182.
[33] Tao Li, Mitsunori Ogihara, Qi Li, “A comparative study on content-based music genre classification”, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, Toronto, Canada, Pages: 282 – 289.
[34] Pierre-Yves Rolland, Gailius Raškinis, Jean-Gabriel Ganascia, “Musical content-based retrieval: an overview of the Melodiscov approach and system”, Proceedings of the seventh ACM international conference on Multimedia (Part 1), Orlando, Florida, United States, Pages: 81 – 84.
[35] Hsuan-Huei Shih; Narayanan, S.S.; Kuo, C.-C.Jay.; “A statistical multidimensional humming transcription using phone level hidden Markov models for query by humming systems”, International Conference on Multimedia and Expo, 2003. ICME '03. Proceedings. 2003, Volume 1, 6-9 July 2003, Pages: I - 61-4.
[36] ARM Integrator/LM-XCV600E+ Logic Module User Guide.
[37] ARM Integrator/CM946E-S Core Module User Guide.
[38] ARM Integrator/AP User Guide.
[39] Johnson, S.E.; Woodland, P.C.; “A method for direct audio search with applications to indexing and retrieval”, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000, Volume 3, 5-9 June 2000, Pages: 1427 – 1430.
指導教授 蔡宗漢(Tsung-Han Tsai) 審核日期 2005-7-19
推文 facebook   plurk   twitter   funp   google   live   udn   HD   myshare   reddit   netvibes   friend   youpush   delicious   baidu   
網路書籤 Google bookmarks   del.icio.us   hemidemi   myshare   

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明