參考文獻 |
[1]J. P. Campbell and JR., “Speaker recognition: a tutorial,” Proceedings of the IEEE , vol. 85, no. 9, pp. 1437-1462, 1997.
[2]呂易宸,「語音門禁系統」,桃園:國立中央大學碩士論文,2011。
[3]B. H. Juang and S. Furui, “Automatic recognition and understanding of spoken language - a first step toward natural human-machine communication,” Proceedings of the IEEE , vol. 88, no. 8, pp. 1142-1165, 2000.
[4]林品宏,「關鍵詞萃取系統及語音聲控車之應用」,桃園:國立中央大學碩士論文,2012。
[5]J. Bradbury, “Linear predictive coding,” Online PDF, pp. 1-23, 2000.
http://my.fit.edu/~vkepuska/ece5525/lpc_paper.pdf
[6]R. Vergin, D. O′Shaughnessy and A. Farhat, “Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition,” IEEE Transactions on Speech and Audio Processing, vol.7, no. 5, pp. 525-532, 1999.
[7]B. J. Shannon and K. K. Paliwal, “Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition,” Science Direct Speech Communication, vol. 48, pp. 1458-1485, 2006.
[8]J. Wu and J. Yu, “An improved arithmetic of MFCC in speech recognition system,” International Conference on Electronics, Communications and Control (ICECC), pp. 719-722, 2011.
[9]J. G. Wilpon, L. Rabiner, C. H. Lee and E. R. Goldman, “Automatic recognition of keywords in unconstrained speech using hidden Markov models,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 38, no. 11, pp. 1870-1878, 1990.
[10]L. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proceedings of the IEEE , vol. 77, no. 2, pp. 257-286, 1989.
[11]R. P. Lippmann, “An introduction to computing with neural nets,” ASSP Magazine, IEEE , vol. 4, no. 2, pp. 4-22, 1987.
[12]A. E. Rosenberg, C. H. Lee and F. K. Soong, “Cepstral channel normalization techniques for HMM-based speaker verification,” International Conference on Spoken Language Processing (ICSLP), pp. 1835-1838, 1994.
[13]O. Viikki and K. Laurila, “Cepstral domain segmental feature vector normalization for noise robust speech recognition,” ScienceDirect Speech Communication, vol. 25, pp. 133-147, 1998.
[14]C. W. Hsu and L. S. Lee, “Higher order cepstral moment normalization for Improved robust speech recognition,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 2, pp. 205-220, 2009.
[15]N. V. Prasad and S. Umesh, “Improved cepstral mean and variance normalization using Bayesian framework,” IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 156-161, 2013.
[16]F. Hilger and H. Ney, “Quantile based histogram equalization for noise robust large vocabulary speech recognition,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 3, pp. 845-854, 2006.
[17]王小川,「語音訊號處理」,修訂二版,台北:全華圖書股份有限公司,2009。
[18]杜文祥,「組合式倒頻譜統計正規化法於強健性語音辨識之研究」,南投:暨南國際大學,2009。
[19]何冠旻, “併合式倒頻譜統計正規化技術於強健性語音辨識之研究,” 南投:暨南國際大學. 2009
[20]R. C. Rose and D. B. Paul, “A hidden Markov model based keyword recognition system,” IEEE Acoustics, Speech, and Signal Processing, pp. 129-132, 1990.
[21]黃國彰,「關鍵詞萃取與確認之研究」,桃園:國立中央大學碩士論文,1996。
[22]蔡炎興,「關鍵詞萃取即語者辨識系統之研製」,桃園:國立中央大學碩士論文,2003。
[23]「國音學」,台北:國立臺灣師範大學國音教編輯委員會,2001。
[24]「大五碼」,台北:台灣財團法人資訊工業策進會,1983。
[25]R. W. Schafer and L. R. Rabiner, “Digital representations of speech signals,” Proceedings of the IEEE , vol. 63, no. 4, pp. 662-677, 1975.
[26]R. M. Nickel, “Feature - Automatic speech character identification,” IEEE Circuits and Systems Magazine, vol. 6, no. 4, pp. 10-31, 2006.
[27]H. Hermansky, “Perceptual linear predictive analysis of speech,” J Acoustic. SOC. Am, vol. 87, no. 4, pp. 1738-1752, 1990.
[28]X. Zhu, Y. Chen, J. Liu and R. Liu, “Feature selection in Mandarin large vocabulary continuous speech recognition,” IEEE International Conference on Signal Processing, vol. 1, pp. 508-511, 2002.
[29]張志豪,「強健性和鑑別力語音特徵擷取技術於大詞彙連續語音辨識之研究」,台北:國立師範大學碩士論文,2005。
[30]謝宗學,「加成性雜訊環境下運用特徵參數統計補償法於強健性語音辨識」,南投:國立暨南國際大學碩士論文,2006。
[31]林銘駿,「環境中低頻噪音之量測及管制策略研究」,桃園:國立中央大學碩士論文,2008。
[32]許時懷,「語音特徵值擷取濾波器之改良」,桃園:國立中央大學碩士論文,2015。
[33]張智傑,「多種語音特徵的合併及其在智慧型手機上之應用」,桃園:國立中央大學碩士論文,2014。
[34]J. Junkawitsch, L. Neubauer, H. Hoge and G. Ruske, “A new keyword spotting algorithm with pre-calculated optimal thresholds,” Proceedings of Fourth International Conference on Spoken Language, vol. 4, pp. 2067-2070, 1996.
[35]M. W. Koo, C. H. Lee and B. H. Juang, “Speech recognition and utterance verification based on a generalized confidence score,” IEEE Transactions on Speech and Audio Processing, vol. 9, no. 8, pp. 821-832, 2001.
[36]H. Ney, “The use of a one-stage dynamic programming algorithm for connected word recognition,” IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 32, no. 2, pp. 263-271, 1984.
[37]郭又偵,「改良式梅爾倒頻譜參數應用於關鍵字萃取」,桃園:國立中央大學碩士論文,2014。 |