參考文獻 |
[1] J. Makhoul, "Linear Prediction: a tutorial review," Proceedings of the IEEE, vol. 63, pp. 561-580, 1975.
[2] G. Fant, Acoustic Theory of Speech Production, Morton, S-Gravenhage, 1960.
[3] Dongbing Wei and C. C. Goodyear, "Experiments in Female Voice Speech Synthesis Using A Parametric Articulatory Model," ICASSP, vol. 3, pp. 1631 –1634, April 1997.
[4] L. R. Rabiner and R. W. Schafer, Digital Prediction of Speech Signals, Prentice Hall, 1978.
[5] F. F. Lee, "Time Compression and Expansion of Speech by the Sampling method, " J. Audio Eng. Soc. 20: 738-742, 1972.
[6] R. J. Scott and S. E. Gerber, "Pitch Synchronous Time Compression of Speech, " in Proc. Conf. Speech Commun. Process., Newton, Mass., 63-65, 1972.
[7] F. Charpentier and M. Stella, "Diphone synthesis using an overlap-add technique for speech waveform concatenation," Proc. ICASSP Tokyo, pp. 2015-2018, 1986.
[8] H. Valbret, E. Moulines, and J.P. Tubach, "Voice transformation using PSOLA technique," ICASSP, vol. 1, pp. 145 –148, 1992.
[9] 王鴻彬, 國語聲訊處理, 碩士論文, 國立交通大學, 1995
[10] G. J. Lin, S. G. Chen, and T. Wu, "High Quality and Low Complexity Pitch Modification of Acoustic Signals," ICASSP, vol. 5, pp. 2987-2990, 1995.
[11] B. Gold and N. Morgan, Speech and Audio Signal Processing, Wiley, 2000
[12] F. Thomas and J. Robert, "Shape Invariant Time-Scale and Pitch Modification of Speech," IEEE Transactions on Signal Processing, vol. 40, No. 3, March 1992.
[13] J. L. Flanagan and R. M. Golden, "Phase Vocoder, " Bell Syst. Tech. J. 45: 1493-1509, 1966.
[14] M. R. Portnoff, "Time-Scale Modification of Speech Based on Short-Time Fourier Analysis, " IEEE Trans. ASSP, pp. 374-390, 1986.
[15] A. S. Spanias, "Speech Coding: A Tutorial Review," Proceedings of the IEEE, vol. 82, no. 10, pp. 1541-82, October 1994.
[16] A. M. Kondoz, Digital Speech Coding for Low Bit Rate Communications Systems, Wiley, 1994.
[17] H. Kobayashi and T. Shimamura, "A Weighted Autocorrelation Method for Pitch Extraction of Noisy Speech," Proc. ICASSP, vol.3, pp. 1307 –1310, June 2000.
[18] M. J. Ross et al, "Average Magnitude Difference Function Pitch Extractor," IEEE Trans. ASSP, vol.22, pp. 353-362, 1974.
[19] W. Hess, Pitch Determination of Speech Signals, Springer-Verlag, New York, 1983.
[20] L. Gu and R. Liu, "High-Performance Mandarin Pitch Estimation," Journal of Electronics, vol.27, pp. 8-11, 1999.
[21] D. G. Childers, Speech processing and Synthesis toolboxes, Wiley, New York, 2000.
[22] W. Zhang, G. Xu, and Y. Wang, "Pitch Estimation Based on Circular AMDF," ICASSP, pp. I-341-344, 2002.
[23] F. M. Gimenez de los Galanes, M. Savoji, and J. M. Pardo, "Speech synthesis system based on a variable decimation/interpolation factor," ICASSP, pp. 636-639, 1995.
[24] R. Vergin, D. O’Shaughnessy, and A. Farhat, "Time Domain Technique for Pitch Modification and Robust Voice Transformation," Proc. ICASSP, pp. 947-950, April 1997.
[25] R. Muralishankar, A. G. Ramakrishnan, and P. Prathibha, "Modification of pitch using DCT in the source domain," Speech Communication, vol. 42, pp.143-154, 2004.
[26] Ahmed, N., Rao, K.R., Orthogonal Transforms for Digital Signal Processing, Springer, New York, 1975.
[27] H. Fujisaki and T. Kawashima, "The Roles of Pitch and Higher Formants in the Perception of Vowels," IEEE Trans. on Audio and Electroacoustics, vol. AU-16, No. 1, pp. 73-77, March 1968.
[28] A. Acero, "Formant Analysis and Synthesis using Hidden Markov Models," Proc. of the Eurospeech Conference, Budapest, 1999.
[29] C. H. Ho, D. Rentzos, and S. Vaseghi, "Formant Model estimation and transformation for Voice Morphing," Proc. ICSLP, pp. 2149-5151, 2002.
[30] E. Turajlic, D. Rentzos, S. Vaseghi, and C. H. Ho, "Evaluation of methods for parametric formant transformation in voice conversion," ICASSP, pp. I-724-727, 2003.
[31] J. Slifka and T. R. Anderson, "Speaker Modification with LPC Pole Analysis," ICASSP, vol. 1, pp. 644 –647, May 1995.
[32] 楊東敏, 基於線性預測編碼及音框週期同步之高品質語音變換技術, 碩士論文, 國立中央大學, 2003
[33] H. Wakita, "Normalization of Vowels by Vocal-Tract Length and Its Application to Vowel Identification," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 25, pp. 183 –192, April 1977.
[34] R. Ansari, D. Kahn, and M. Macchi, "Pitch Modification of Speech Using a Low-Sensitivity Inverse Filter Approach," IEEE Signal Processing Letters, pp. 60-62, vol. 5, no. 3, March 1998.
[35] B. Atal and L. Rabiner, "A Pattern Recognition Approach to Voiced-Unvoiced-Silence Classification with Applications to Speech Recognition," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 24, pp. 201 –212, Jun 1976.
[36] ITU-T Recommendation G.729, "Coding of speech at 8 kbit/s using Conjugate-Structure Algebraic-Coded-Excited Linear-Prediction (CS-ACEP)," March 1996.
[37] Jr. A. Gray and J. Markel, "A Spectral-Flatness Measure for Studying the Autocorrelation Method of Linear Prediction of Speech Analysis," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 22, pp. 207 –217, Jun 1974.
[38] B. Yegnanarayana and P. Satyanarayana Murthy, "Enhancement of Reverberant Speech Using LP Residual Signal," IEEE Transactions on Speech and Audio Processing, vol. 8, no.3, pp. 267-281, May 2000. |