參考文獻 |
REFERENCES
[1] Prajna Kunche and K.V.V.S. Reddy, “Metaheuristic Applications to Speech Enhancement,” SpringerBriefs in Electrical and Computer Engineering., p. 3, 2016.
[2] Tassadaq Hussain, K. Cho, Sabato Marco Siniscalchi, Chi-Chun Lee, Syu-Siang Wang and Yu Tsao, “Experimental Study on Extreme Learning Machine Applications for Speech Enhancement,”IEEE Access vol. 5, 2017.
[3] Jeremy Chiaming Yang et al., “Speech Enhancement via Ensamble Modeling NMF Adaptation ,” International Conference on Consumer Electronics-Taiwan, 2016.
[4] A. L. Maas, Q. V. Le, T. M. O’Neil, O. Vinyals, P. Nguyen, and A. Y. Ng, “Recurrent neural networks for noise reduction in robust ASR,” in Proc. Interspeech, 2012, pp. 22–25.
[5] X. Lu, Y. Tsao, S. Matsuda, and C. Hori, “Speech enhancement based on deep denoising autoencoder.” in Proc. Interspeech, 2013, pp. 436–440.
[6] F. Weninger, F. Eyben, and B. Schuller, “Single-channel speech separation with memory-enhanced recurrent neural networks,” in Proc. IEEE Intl. Conf. on Acoustic, Speech and Signal Processing, 2014.
[7] X. Feng, Y. Zhang, and J. Glass, “Speech feature denoising and dereverberation via deep autoencoders for noisy reverberant speech recognition,” in Proc. IEEE Intl. Conf. on Acoustic, Speech and Signal Processing, 2014..
[8] J. Du, L. Dai, and Q. Huo, “Synthesized stereo mapping via deep neural networks for noisy speech recognition,” in Proc. IEEE Intl. Conf. on Acoustic, Speech and Signal Processing, 2014, pp. 1764–1768.
[9] A. Narayanan and D. Wang, “Ideal ratio mask estimation using deep neural networks,” in Proc. IEEE Intl. Conf. on Acoustic, Speech and Signal Processing, 2013, pp. 7092–7096.
[10] Y. Wang, A. Narayanan, and D. Wang, “On training targets for supervised speech separation,” IEEE/ACM Trans. on Audio, Speech and Language Processing, vol. 22, no. 12, pp. 1849–1858, 2014..
[11] Zhuo Chen, Yan Huang, Jinyu Li, and Yifan Gong, “Improving Mask Learning Based Speech Enhancement system with Restoration Layer and Residual Connection,” Conference: Interspeech 2017.
[12] Xugang Lu, Yu Tsao, Shigeki Matsuda, and Chiori Hori, “Speech Enhancement on Deep Denoising Autoencoder,” Conference: Interspeech 2013.
[13] Ryandhimas Edo, Jia-Ching Wang, and Yu Tsao, “Study of Robustness of DNN Acoustic Modeling Based on Multi-style Training with Speech Enhancement,” Master Thesis NCU Taiwan. pp. 18–23, June 2017.
[14] Y. Xu, J. Du, L. R. Dai, and C. H. Lee, ‘‘A regression approach to speech enhancement based on deep neural networks,’’ IEEE/ACM Trans. Audio, Speech, Language Process., vol. 23, no. 1, pp. 7–19, Jan. 2015.
[15] G.-B. Huang, Q.-Y. Zhu, and C.-K. Siew, ‘‘Extreme learning machine: Theory and applications,’’ Neurocomputing, vol. 70, nos. 1–3, pp. 489–501, 2006.
[16] Ryandhimas E. Zezario et al, ‘‘Deep Denoising Autoencoder Based Post Filter for Speech Enhancement,’’ Proceedings, APSIPA Annual Summit and Conference. 2018.
[17] A. A. Mohammed, R. Minhas, Q. M. J. Wu, and M. A. Sid-Ahmed, “Human face recognition based on multidimensional PCA and extreme learning machine,” Pattern Recognit., vol. 44, nos. 10–11, pp. 2588–2597, 2011.
[18] C. Pan, D. S. Park, Y. Yang, and H. M. Yoo, “Leukocyte image segmentation by visual attention and extreme learning machine,” Neural Comput. Appl., vol. 21, no. 6, pp. 1217–1227, 2012
[19] R. Minhas, A. Baradarani, S. Seifzadeh, and Q. M. J. Wu, “Human action recognition using extreme learning machine based on visual vocabularies,” Neurocomputing, vol. 73, nos. 10–12, pp. 1906–1917, 2010.
[20] G.-B. Huang, L. Chen, and C.-K. Siew, “Universal approximation using incremental constructive feedforward networks with random hidden nodes,” IEEE Trans. Neural Netw., vol. 17, no. 4, pp. 879–892, Jul. 2006
[21] G.-B. Huang, M.-B. Li, L. Chen, and C.-K. Siew, “Incremental extreme learning machine with fully complex hidden nodes,” Neurocomputing, vol. 71, nos. 4–6, pp. 576 -583, 2008
[22] Jiexiong Tang, Chenwei Deng, and G.-B. Huang, “Extreem Learning Machine for Multilayer Perceptron,” IEEE Transactions on Neural Networks and Learning Systems, vol. 27, no. 4 , 2016.
[23] L. L. C. Kasun, H. Zhou, G.-B. Huang, and C. M. Vong, “Representational learning with extreme learning machine for big data,” IEEE Intell. Syst., vol. 28, no. 6, pp. 31–34, Nov. 2013.
[24] Y. Xu, J. Du, L. R. Dai, and C. H. Lee, “A regression approach to speech enhancement based on deep neural networks,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, no. 1, pp. 7–19, 2014.
[25] Xiao-Lei Zhang, and DeLiang Wang, “A Deep Ensemble Learning Method for Monaural Speech Separation,” IEEE/ACM Trans Audio Speech Lang Process. 2016 Mar; 24(5): 967–977.
[26] Kun Han et al, “Deep Neural Network Based Spectral Feature Mapping for Robust Speech Recognition,” Conference: Interspeech 2015.
[27] DeLiang Wang, and Jitong Chen, “Supervised Speeh Separation Based on Deep Learning: An Overview,” IEEE/ACM Trans Audio Speech Lang Process. 2018 Oct; 26(10): 1702–1726.
[28] C. H. Taal, R. C. Hendriks, and R. Heusdens, “Matching pursuit for channel selection in cochlear implants based on an intelligibility metric,” in Proc. EUSIPCO, 2012, pp. 504–508.
[28] C. H. Taal, R. C. Hendriks, and R. Heusdens, “Matching pursuit for channel selection in cochlear implants based on an intelligibility metric,” in Proc. EUSIPCO, 2012, pp. 504–508.
[29] A. H. Andersen, J. M. d. Haan, Z. H. Tan, and J. Jensen, “Predicting the Intelligibility of Noisy and Nonlinearly Processed Binaural Speech,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 11, pp. 1908–1920, 2016.
[30] S. M. Kay, Fundamentals of Statistical Signal Processing: Estimation Theory. Prentice Hall, 2010.
[31] Philipos C. Loizou and Gibak Kim,” Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions”, IEEE transactions on audio, speech, and language processing, vol. 19, no. 1, January 2011
[32] N. Parihar, J. Picone, D. Pearce, and H.-G. Hirsch, ‘‘Performance analysis of the Aurora large vocabulary baseline system,’’ in Proc. 12th Eur. Signal Process. Conf., 2004, pp. 553–556.
[33] Yong Xu, Jun Du, Li-Rong Dai, and Chin-Hui Lee, ‘‘Cross-language Transfer Learning for Deep Neural Network Based Speech Enhancement,’’ 9th International Symposium on Chinese Spoken LanguageProcessing (ISCSLP). 2014.
[34] S. Quackenbush, T. Barnwell, and M. Clements, Objective Measures of Speech Quality. Englewood Cliffs, NJ, USA: Prentice-Hall, 1988. |