參考文獻 |
[1] M. A. Casey, R. Veltkamp, M. Goto, M. Leman, C. Rhodes, and M. Slaney, “Content-Based Music Information Retrieval:Current Directions and Feature Challenges,” in Proc. of the IEEE, vol. 96 no. 4, pp. 668-696, April 2008.
[2] 侯志欽,聲學原理與多媒體音訊科技,初版,台灣商務印書館,台北市,民國九十六年。
[3] 陳仁寬,樂理入門與指導,初版,五洲出版有限公司,台北市,民國八十五年。
[4] Music Information Retrieval Evaluation eXchange. http://www.music-ir.org/mirex/wiki/2006:Main_Page
[5] J. Serra, E. Gomez, and P. Herrera, “Audio cover song identification and similarity: background, approaches, evaluation, and beyond,” Advances in Music Information Retrieval, vol. 274, ch. 14, pp. 307-332, March 2010.
[6] D. P. W. Ellis, and G.E. Poliner, “Identifying ‘Cover Songs’with Chroma Features and Dynamic Programming Beat Tracking,” in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Honolulu, Hawaii, U.S.A., pp. 1429-1432, April 15-20, 2007.
[7] J. Serra, and E. Gomez, “Audio cover song identification based on tonal sequence alignment,” in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Las Vegas, Nevada, U.S.A., pp.61-64, March 30- April 4, 2008.
[8] S. Ravuri and D. P. W. Ellis, “Cover song detection: From high scores to general classification,” in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Dallas, Texas, U.S.A., pp. 65-68, March 14-19, 2010.
[9] E. Ravelli, G. Richard, and L. Daudet, “Audio Signal Representations for Indexing in the Transform Domain,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 3, pp. 434-446, March. 2010.
[10] H. Wang, A. Divakaran, A. Vetro, S. Chang, and H. Sun,“Survey of Compressed-Domain Features Used in Audio-Visual Indexing and Analysis,” Journal of Visual Communication and Image Representation, vol. 14, no. 2, pp. 150-183, June 2003.
[11] T. H. Tsai and Y. T. Wang, “Content-Based Retrieval of Audio Example on MP3 Compression Domain,” in Proc. IEEE 6th Workshop on Multimedia Signal Processing, pp.123-126, September 2004.
[12] T. H. Tsai and W. C. Chang, “Two-Stage Method for Specific Audio Retrieval based on MP3 Compression Domain,” in Proc. IEEE International Symposium on Circuits and Systems, pp. 713-716, May 2009.
[13] C. C. Liu and C. S. Huang, “A singer identification technique for content-based classification of MP3 music objects,” in Proc. Int. Conf. on Information and Knowledge Management, McLean, Virginia, U.S.A., pp. 438-445, November 4-9, 2002.
[14] D. Pan, “A Tutorial on MPEG/Audio Compression,” IEEE Multimedia Magazine, summer 1995, pp. 60-74.
[15] International Organization for Standardization, “Information Technology - Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s - Part 3:Audio,” ISO/IEC 11172-3, March 1999.
[16] International Organization for Standardization, “Information Technology - Generic coding of moving pictures and associated audio information - Part 7:Advanced Audio Coding (AAC), ”ISO/IEC 13818-7, 1997.
[17] International Organization for Standardization, “Information Technology - Coding of audio-visual objects - Part 3: Audio,”ISO/IEC DIS 14496-3, 1998.
[18] M. Muller, D. P. W. Ellis, A. Klapuri, and G. Richard, “Signal Processing for Music Analysis,” IEEE Journal of Selected Topics in Signal Processing, vol. 5, no.6, pp.1088-1110, October 2011.
[19] The musical instrument dynamic ranges and names:http://en.wikipedia.org/wiki/Range_(music)#cite_note-M29-0
[20] Instrument frequency dynamic ranges poster:http://www.independentrecording.net/irn/resources/freqchart/main_display.htm
[21] J. Serra, E. Gomez, P. Herrera, and X. Serra, “Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 6, pp. 1138-1151, August 2008.
[22] The Cover 80 cover song data set : http://labrosa.ee.columbia.edu/projects/coversongs/covers80/
[23] T. H. Tsai and C. Liu, “A Configurable Common Filterbank Processpr for Multi-Standard Audio Decoder,” IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, vol. 90, no.9, pp. 1913-1923, September 2007.
[24] T. Bertin-Mahieux and D. P. W. Ellis, “Large-scale cover song recongnition using hashed chroma landmarks,” in Proc. IEEE Workshop on Application of Signal Processing to Audio and Acoustics, New Paltz, NY, U.S.A., pp.117-120, October 16-19, 2011.
|