摘要: In this paper, we proposed a singer identification approach to automatically identify the singer of an unknown MP3 audio data. Differing from previous researches for singer identification in MP3 compressed domain, we use Mel-Frequency Cepstral Coefficients (MFCC) as the feature instead of MDCT (modified discrete cosine transform) coefficients. Although MFCC is often used in music classification and speaker recognition, it cannot be directly obtained from compressed music data such as MP3 format. We introduce a modified method for calculating MFCC vector in MP3 compressed domain. For describing the distribution of MFCC vector, the Gaussian mixture model (GMM) is applied. To find the nearest singer, we use maximum likelihood classification (MLC) to allot each input MFCC vector to its nearest group. The experimental result verifies the feasibility of the proposed approach. 其他題名: Multimed Tools Appl 出版者: Boston: Springer US 出版日期: 2015-02-01 出處: Multimedia tools and applications, 2015-02, Vol.74 (4), p.1489-1509 資源來源: ABI/INFORM Collection 版權: Springer Science+Business Media New York 2014 版權: Springer Science+Business Media New York 2015 識別號: ISSN: 1380-7501 識別號: EISSN: 1573-7721 識別號: DOI: 10.1007/s11042-014-2189-6