摘要: With the explosive growth in the number of music albums produced, retrieving music information has become a critical aspect of managing music data. Extracting frequency parameters directly from the compressed files to represent music greatly benefits processing speed when working on a large database. In this study, we focused on advanced audio coding (AAC) files and analyzed the disparity in frequency expression between discrete Fourier transform and discrete cosine transform, considered the frequency resolution to select the appropriate frequency range, and developed a direct chroma feature-transformation method in the AAC transform domain. An added challenge to using AAC files directly is long/short window switching, ignoring which may result in inaccurate frequency mapping and inefficient information retrieval. For a short window in particular, we propose a peak-competition method to enhance the pitch information that does not include ambiguous frequency components when combining eight subframes. Moreover, for chroma feature segmentation, we propose a simple dynamic-segmentation method to replace the complex computation of beat tracking. Our experimental results show that the proposed method increased the accuracy rate by approximately 7 % in Top-1 search results over transform-domain methods described previously and performed nearly as effectively as state-of-the-art waveform-domain approaches did. 其他題名: Multimed Tools Appl 出版者: New York: Springer US 出版日期: 2015-09-01 出處: Multimedia tools and applications, 2015-09, Vol.74 (18), p.7921-7942 資源來源: ABI/INFORM Collection 版權: Springer Science+Business Media New York 2014 版權: Springer Science+Business Media New York 2015 識別號: ISSN: 1380-7501 識別號: EISSN: 1573-7721 識別號: DOI: 10.1007/s11042-014-2031-1