參考文獻 |
[1] Rabiner , L. R. and Juang, B. H., Fundamentals of Speech Recognition,
Prentice Hall, New Jersey, 1993.
[2] Huang, X., Acero, A. and Hon, H. W., Spoken Language Processing, Prentice Hall, 2001.
[3] J. T. Tou, R. C. Gonzalez, Pattern Recognition Principles, Addison Wesley, 1974.
[4] L. S. Lee, Y. Lee, “Voice Access of Global Information for Broad-Band Wireless: Technologies of Today and Challenges of Tomorrow,” Proceedings of the IEEE, vol. 89, no. 1, pp. 41-57, January 2001.
[5] Johan A.K. Suykens, Tony Van Gestel, Jos De Brabanter, Bart De Moor and Joos Vandewalle, Least Squares Support Vector Machines, World Scientific, 2002
[6] Reynolds, D. A. and Rose, R. C., “Robust Text-Independent Speaker Identification Using Gaussian Mixture Models,” IEEE Trans. Speech and Audio Processing, vol. 3, no. 1, pp. 72-83, January 1995.
[7] Alex Solomonoff, W. M. Campbell, and I. Boardman, “Advances in channel compensation for SVM speaker recognition,” in Proceedings of ICASSP, 2005.
[8] Vergin , R. and O’Shaughnessy, D., and Farhat, A., “Generalized Mel Frequency Coefficients for Large-Vocabulary Speaker-Independent Continuous-Speech Recognition,” IEEE Trans. Speech and Audio Processing, vol. 7, no. 5, pp. 525-532, September 1999.
[9] Rosenberg, A. E. and Parthasarathy, S.”Speaker background models for connected digit password speaker verification”. In Proceedings of the International Conference on Acoustics,Speech, and Signal Processing, May 1996, pp. 81–84.
[10] Isobe, T. and Takahashi, J., “Text-independent speaker verification using virtual speaker based cohort normalization”. In Proceedings of the European Conference on Speech Communication and Technology, 1999, pp. 987–990.
[11] Reynolds, D. and Quatieri, T., “Speaker Verification Using Adapted Gaussian Mixture Models,” Digital Signal Processing 10, PP. 19-41, 2000.
[12] Dempster, A., Laird, N., and Rubin, D.,” Maximum likelihood from incomplete data via the EM algorithm”, J. Roy. Stat. Soc. 39 (1977), 1–38.
[13] Reynolds, D. A.,” A Gaussian Mixture Modeling Approach to Text-Independent Speaker Identification”. Ph.D. thesis, Georgia Institute of Technology, September 1992.
[14] Reynolds, D. A. and Rose, R. C., “Robust text-independent speaker identification using Gaussian mixture speaker models”, IEEE Trans. Speech Audio Process. 3 (1995), 72–83.
[15] Moon, T. K., “The Expectation-Maximization Algorithm,” IEEE Signal Processing Magazine, vol. 13, no. 6, pp. 47-60, November 1996.
[16] Duda, R. O. andHart, P. E., “Pattern Classification and Scene Analysis”. Wiley, New York, 1973.
[17] Gauvain, J. L. and Lee, C.-H., “Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains”, IEEE Trans. Speech Audio Process. 2 (1994), 291–298.
[18] Vuuren, S., “Speaker Verification in a Time-Feature Space”. Ph.D. thesis, Oregon Graduate Institute, March 1999.
[19] Dunn, R. B., Reynolds, D. A., and Quatieri, T. F., “Approaches to speaker detection and tracking in conversational speech”, Digital Signal Process. 10 (2000), 93–112.
[20] Higgins, A., Bahler, L., and Porter, J.,“ Speaker verification using randomized phrase prompting”,Digital Signal Process. 1 (1991), 89–106.
[21] Rosenberg, A. E., DeLong, J., Lee, C. H., Juang, B. H., and Soong, F. K., “The use of cohort normalized scores for speaker verification”.In International Conference on Speech and Language Processing, November 1992, pp. 599–602.
[22] Reynolds, D. A.,“ Speaker identification and verification using Gaussian mixture speaker models”,Speech Commun. 17 (1995), 91–108.
[23] Matsui, T. and Furui, S.,“ Similarity normalization methods for speaker verification based on a posteriori probability”, In Proceedings of the ESCA Workshop on Automatic Speaker Recognition, Identification and Verification, 1994, pp. 59–62.
[24] Carey, M., Parris, E., and Bridle, J., “A speaker verification system using alphanets”. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, May 1991, pp. 397–400.
[25] Reynolds, D. A., “Comparison of background normalization methods for text-independent speaker verification”. In Proceedings of the European Conference on Speech Communication and Technology, September 1997, pp. 963–966.
[26] Matsui, T. and Furui, S., “Likelihood normalization for speaker verification using a phonemeand speaker-independent model”, Speech Commun. 17 (1995), 109–116.
[27] Rosenberg, A. E. and Parthasarathy, S.,” Speaker background models for connected digit password speaker verification”. In Proceedings of the International Conference on Acoustics,Speech, and Signal Processing, May 1996, pp. 81–84.
[28] Heck, L. P. and Weintraub, M., “Handset-dependent background models for robust textindependent speaker recognition”. In Proceedings of the International Conference on Acoustics,Speech, and Signal Processing, April 1997, pp. 1071–1073.
[29] Martin, A., Doddington, G., Kamn, T., Ordowski, M., and Przybocki, M., “The DET curve in assessment of detection task performance,” in Proceedings of European Conference on Speech Communication and Technology, pp. 1895-1898, 1997.
[30] V. Wan and W. M. Campbell, “Support vector machines for speaker verification and identification,” in Proc. Neural Networks for Signal Processing X, pp. 775–784, 2000.
[31] Wan, V. and Renals, S., “SVMSVM: Support Vector Machine speaker verification methodology,” in Proc. IEEE ICASSP, 2003.
[32] Kong-Aik Lee, Changhuai You1, Haizhou Li1, Tomi Kinnunen2, and Donglai Zhu1“ Characterizing Speech Utterances for Speaker Verification with Sequence Kernel SVM”
[33] Haykin, S., Neural Network: A Comprehensive Foundation. NJ:
Prentice-Hall, 1999.
[34] N. Cristianini and J. Shawe-Taylor, “An Introduction to Support
Vector Machines”. Cambridge: Cambridge University Press,2000.
[35] Campbell, W.M., “Generalized linear discriminant sequence kernels for speaker recognition,” in Proceedings of ICASSP, 2002, pp. 161–164.
[36] Cristianini, Nello and John Shawe-Taylor, Support Vector Machines,
Cambridge University Press, Cambridge, 2000.
[37] Pedro J. Moreno, Purdy P. Ho, and Nuno Vasconcelos, “A Kullback-
Leibler divergence based kernel for SVM classification in multimedia
applications,” in Adv. in Neural Inf. Proc. Systems 16, S. Thrun, L. Saul,
and B. Sch‥olkopf, Eds. MIT Press, Cambridge, MA, 2004.
[38] Minh N. Do, “Fast approximation of Kullback-Leibler distance for
dependence trees and hidden Markov models,” IEEE Signal Processing
Letters, vol. 10, no. 4, pp. 115–118, 2003.
[39] Mathieu Ben,Michel Bester, Frederic Bimbot, and Guillaume Gravier,“Speaker diarization using bottom-up clustering based on a parameterderived distance between adapted GMMs,” in Proc. of ICSLP, 2004.
[40] Campbell, W.M., “Generalized linear discriminant sequence kernels for speaker recognition,” in Proceedings of ICASSP, 2002, pp. 161–164.
[41] Mikhail Belkin and Partha Niyogi, “Laplacian eigenmaps and spectral techniques for embedding and clustering,”in Advances in Neural Information Processing 14, T. G.Deitterich, S. Beck, and Z. Ghahramani, Eds., 2003.
[42] Layton, M., “Augmented statistical models for classifying sequence
data,” Ph.D. dissertation, Cambridge Univ., Cambridge, U.K., 2006.
[43] Jaakkola , T. and Haussler, D., “Exploiting generative models in discriminative classifiers,” in Proc. NIPS, 1999, pp. 487–493.
[44] Gales, M. J. F. and Layton, M., “Training augmented models using
SVMs,” IEICE Special Iss. Statist. Models Speech Recognition, 2006.
[45] Wan, V. and Renals, S., “Speaker verification using sequence discriminant support vector machines,” IEEE Trans. Speech Audio Process.,vol. 13, no. 2, pp. 203–210, Mar. 2004.
[46] Raghavan, S., Lazarou, G.. and Picone, J., “Speaker Verification Using Support Vector Machines,” in Proc. IEEE, 2006.
[47] “The NIST Year 2001 Speaker Recognition Evaluation Plan”, http://www.nist.gov/speech/tests/spk/2001/
[48] Wei Huang, Jianshu Chao, Yaxin Zhang,” Combination of Pitch and MFCC GMM Supervectors for Speaker Verification,” ISSC 2008. IET Irish Publication,pp. 32 – 36. 2008
[49] Chen, C. P. and Bilmes, J.,“MVA Processing of Speech Features” , Audio, Speech and Language Processing, vol. 15,pp257-270, 2007.
|