參考文獻 |
[1] Serra, Joan, Emilia Gómez, and Perfecto Herrera. "Audio cover song iden-tification and similarity: background, approaches, evaluation, and beyond", Advances in Music Information Retrieval, pp. 307-332, Springer Berlin Heidelberg, 2010.
[2] Tzanetakis, George, Andrey Ermolinskyi, and Perry Cook, "Pitch histograms in audio and symbolic music information retrieval", Journal of New Music Research, pp. 143-152, 2003.
[3] T. Bertin-Mahieux and D. P. W. Ellis, "Large-scale cover song recognition using hashed chroma landmarks", 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 117-120, New Paltz, NY, 2011.
[4] Bertin-Mahieux, Thierry, and Daniel PW Ellis, "Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude.", International So-ciety for Music Information Retrieval Conference (ISMIR), 2012.
[5] Khadkevich, Maksim, and Maurizio Omologo, "Large-Scale Cover Song Identification Using Chord Profiles.", International Society for Music In-formation Retrieval Conference (ISMIR), 2013.
[6] M. Marolt, "A Mid-Level Representation for Melody-Based Retrieval in Audio Collections," in IEEE Transactions on Multimedia, vol. 10, no. 8, pp. 1617-1625, Dec. 2008.
[7] Schmidt, Erik, and Youngmoo Kim, "Learning Rhythm And Melody Features With Deep Belief Networks", International Society for Music In-formation Retrieval Conference (ISMIR), 2013.
[8] O. Nieto and J. P. Bello, "Music segment similarity using 2D-Fourier Magni-tude Coefficients," 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 664-668, Florence, 2014.
[9] 林銀議,信號與系統,二版,五南圖書出版股份有限公司,台北市,2009年。
[10] Slaney, Malcolm, Kilian Weinberger, and William White, "Learning a met-ric for music similarity." International Symposium on Music Information Retrieval (ISMIR). 2008.
[11] J. Schluter and C. Osendorfer, "Music Similarity Estimation with the Mean-Covariance Restricted Boltzmann Machine", Machine Learning and Applications and Workshops (ICMLA), pp. 118-123, 2011 10th International Conference on, Honolulu, HI, 2011.
[12] J. Stephen Downie: MIREX 2006:Audio Cover Song. 2006, from http://www.music-ir.org/mirex/wiki/2006:Audio_Cover_Song
[13] Ranjani, S. Sri, et al. "Application of SHAZAM-Based Audio Finger-printing for Multilingual Indian Song Retrieval", Advances in Communi-cation and Computing, pp. 81-92, Springer India, 2015.
[14] Bertin-Mahieux, Thierry, et al. "The million song dataset", International Society for Music Information Retrieval Conference (ISMIR). Vol. 2. No. 9. 2011.
[15] Pedregosa, Fabian, et al, "Scikit-learn: Machine learning in Python", Journal of Machine Learning Research, pp. 2825-2830, 12, Oct, 2011.
[16] E. J. Humphrey, J. P. Bello, and Y. LeCun, “Moving beyond feature design: Deep architectures and automatic feature learning in music informatics”, International Society for Music Information Retrieval Conference (ISMIR), Porto, Portugal, October 2012.
[17] Honglak Lee, Peter Pham, Yan Largman, and Andrew Ng, “Unsupervised feature learning for audio classification using convolutional deep belief networks”, Advances in Neural Information Processing Systems, 22. 2009.
[18] Hamel, Philippe, and Douglas Eck, "Learning Features from Music Audio with Deep Belief Networks", International Society for Music Information Retrieval Conference (ISMIR), 2010.
[19] Humphrey, Eric J., Juan P. Bello, and Yann LeCun, "Feature learning and deep architectures: new directions for music informatics", Journal of Intel-ligent Information Systems, pp. 461-4814. 2013.
[20] Y. Kim, H. Lee and E. M. Provost, "Deep learning for robust feature generation in audiovisual emotion recognition", 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3687-3691, Vancouver, BC, 2013.
[21] Dieleman, Sander, and Benjamin Schrauwen, "Multiscale approaches to music audio feature learning", International Society for Music Information Retrieval Conference (ISMIR), Pontifícia Universidade Católica do Paraná, 2013.
[22] S. Dieleman and B. Schrauwen, "End-to-end learning for music audio" ,2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6964-6968, Florence, 2014..
[23] Coates, Adam, Honglak Lee, and Andrew Y. Ng, "An analysis of sin-gle-layer networks in unsupervised feature learning", Ann Arbor, 2010. |