博碩士論文 107421022 詳細資訊




以作者查詢圖書館館藏 以作者查詢臺灣博碩士 以作者查詢全國書目 勘誤回報 、線上人數:32 、訪客IP:3.149.254.35
姓名 鄭雅容(Ya-Rong Cheng)  查詢紙本館藏   畢業系所 企業管理學系
論文名稱 以 LSTM 模型判 別嬰兒哭泣原因之研究
(Classifying causes of infant crying with LSTM)
相關論文
★ 在社群網站上作互動推薦及研究使用者行為對其效果之影響★ 以AHP法探討伺服器品牌大廠的供應商遴選指標的權重決定分析
★ 以AHP法探討智慧型手機產業營運中心區位選擇考量關鍵因素之研究★ 太陽能光電產業經營績效評估-應用資料包絡分析法
★ 建構國家太陽能電池產業競爭力比較模式之研究★ 以序列採礦方法探討景氣指標與進出口值的關聯
★ ERP專案成員組合對績效影響之研究★ 推薦期刊文章至適合學科類別之研究
★ 品牌故事分析與比較-以古早味美食產業為例★ 以方法目的鏈比較Starbucks與Cama吸引消費者購買因素
★ 探討創意店家創業價值之研究- 以赤峰街、民生社區為例★ 以領先指標預測企業長短期借款變化之研究
★ 應用層級分析法遴選電競筆記型電腦鍵盤供應商之關鍵因子探討★ 以互惠及利他行為探討信任關係對知識分享之影響
★ 結合人格特質與海報主色以類神經網路推薦電影之研究★ 資料視覺化圖表與議題之關聯
檔案 [Endnote RIS 格式]    [Bibtex 格式]    [相關文章]   [文章引用]   [完整記錄]   [館藏目錄]   至系統瀏覽論文 ( 永不開放)
摘要(中) 哭泣為嬰兒最初與外界交流的語言,亦為主要表達其需求與情緒之溝通手法,透過發出哭聲訊號引起照顧者之回應,以滿足其需求。不同類型的哭聲模式之間存有差異,因此哭聲成為辨識嬰兒不同需求與狀態之重要訊息來源。然而,新手父母或缺乏經驗的照護人員難以單憑哭聲即能了解嬰兒哭泣之原因,嬰兒哭泣亦成為照顧嬰兒時需面對的主要問題。嬰兒哭泣原因不計其數,根據依附理論可得知嬰兒會因缺乏安全感而有尋求安全感之需求,但尚未有研究辨識此哭聲類型,因此,相較其它研究之哭聲辨識模型常見的哭泣原因外,本研究還多考慮缺乏安全感之哭聲類型。
本研究旨在建立哭聲自動判斷模型並採用真實哭聲數據庫,研究流程使用梅爾倒頻譜係數(Mel-Frequency Cepstral Coefficients, MFCC)作為特徵提取,並以長短期記憶網絡(Long Short Term Memory networks,LSTM)作為分類模型,最終分為五種哭聲類型,即肚子餓、想睡覺、大小便、生氣與缺乏安全感。實驗結果顯示模型 之 精度
(Precision)為 48% 有 鑑於此,嬰兒哭泣問題有望自動解決。
摘要(英) Crying is the first language of an infant to communicate with the external world. It’s also the primary means of communication that expresses infant’s needs and emotions. Infant crying is a signal that elicits caregivers to respond to meet his/her needs. There are variations between different types of crying patterns. Therefore, crying becomes an important source of information to distinguish different needs and conditions of infants. However, it is challenging for novice parents or inexperienced caregivers to understand the reason of infant crying. Infant crying is also the main trouble when caring for infants. According to the attachment theory, it can be known that infants have a need to seek security from the main caregivers due to lack of security. But there is no research to recognize this type of crying. Consequently, compared with other studies, this study also considered the cause of insecurity.
This study aims to construct an automatic infant cry recognition model and use a real crying database. The acoustic features are extracted using Mel-Frequency Cepstral Coefficients (MFCC), and Long Short Term Memory networks (LSTM) is used to classify crying signal into five categories including want to sleep, poopee, anger and insecurity. The experimental results show that the model achieves 48% precision, indicating the infant crying problem is expected to be resolved automatically.
關鍵字(中) ★ 嬰兒哭泣
★ 哭聲檢測
★ 聲學分析
★ 長短期記憶網絡(LSTM)
關鍵字(英) ★ infant cry
★ cry detection,
★ acoustic analysis
★ LSTM
論文目次 中文摘要 ...............................................ⅰ
Abstract...............................................ⅱ
圖目錄..................................................ⅴ
表目錄..................................................ⅵ
第一章 緒論.................................................1
1-1研究背景....................................................1
1-2研究動機....................................................2
1-3研究目的....................................................4
1-4研究架構....................................................5
第二章 文獻探討.............................................6
2-1嬰兒哭聲分析................................................6
2-2依附理論....................................................8
2-3照顧行為....................................................9
第三章 研究方法............................................11
3-1研究流程...................................................11
3-2哭聲訊號分析與處理..........................................12
3-2-1 音檔切割與預處理.........................................14
3-2-2 特徵提取................................................14
3-3分類模型...................................................19
第四章 研究實驗............................................21
4-1 資料來源..................................................21
4-2蒐集過程與標記工作..........................................23
4-3 模型參數設定..............................................24
4-4 實驗結果分析..............................................25
第五章 結論與未來研究建議...................................27
5-1 研究結論..................................................27
5-2研究限制與未來研究建議......................................28
參考文獻......................................................29
參考文獻 [1] Balandong, R. P. J. B. E. U. o. M. (2013). "Acoustic Analysis of Baby Cry."

[2] Barajas-Montiel, S. E. and C. A. Reyes-Garcia (2005). Identifying pain and hunger in infant cry with classifiers ensembles. International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC′06), IEEE.

[3] Barnow, S., et al. (2005). "Correlates of aggressive and delinquent conduct problems in adolescence." 31(1): 24-39.

[4] Beck, J. E., et al. (2005). "The influence of perinatal complications and environmental adversity on boys’ antisocial behavior." 46(1): 35-46.

[5] Beeney, J. E., et al. (2017). "Disorganized attachment and personality functioning in adults: A latent class analysis." 8(3): 206.

[6] Bell, S. M. and M. D. S. J. C. d. Ainsworth (1972). "Infant crying and maternal responsiveness." 1171-1190.

[7] Bhattacharya, S. (2016). Speech processing system and method for recognizing speech samples from a speaker with an oriyan accent when speaking english, Google Patents.

[8] Bornstein, M. H. and D. L. J. C. d. Putnick (2012). "Cognitive and socioemotional caregiving in developing countries." 83(1): 46-61.

[9] Bowlby, J. J. A. j. o. O. (1982). "Attachment and loss: retrospect and prospect." 52(4): 664.

[10] Bowlby, J. J. I. j. o. p.-a. (1958). "The nature of the child′s tie to his mother." 39: 350-373.

[11] Bowlby, J. J. R. H. F., W.,, et al. (1969). "Attachment and loss v. 3 (Vol. 1)." 33: 470-478.

[12] Cecchini, M., et al. (2007). "Communication and crying in newborns." 30(4): 655-665.

[13] Chang, C.-Y. and L.-Y. Tsai (2019). A CNN-Based Method for Infant Cry Detection and Recognition. Workshops of the International Conference on Advanced Information Networking and Applications, Springer.

[14] Chittora, A. and H. A. Patil (2015). Classification of normal and pathological infant cries using bispectrum features. 2015 23rd European Signal Processing Conference (EUSIPCO), IEEE.

[15] Cicchetti, D., et al. (1981). "Developmental perspectives on the etiology, intergenerational transmission, and sequelae of child maltreatment." 1981(11): 31-55.

[16] Dewi, S. P., et al. (2019). Analysis of LFCC Feature Extraction in Baby Crying Classification using KNN. 2019 IEEE International Conference on Internet of Things and Intelligence System (IoTaIS), IEEE.

[17] Garcia, J. O. and C. R. Garcia (2003). Mel-frequency cepstrum coefficients extraction from infant cry for classification of normal and pathological cry with feed-forward neural networks. Proceedings of the International Joint Conference on Neural Networks, 2003., IEEE.

[18] Graves, A., et al. (2013). Hybrid speech recognition with deep bidirectional LSTM. 2013 IEEE workshop on automatic speech recognition and understanding, IEEE.

[19] Han, S., et al. (2017). Ese: Efficient speech recognition engine with sparse lstm on fpga. Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays.

[20] Hazan, C. and P. R. J. P. i. Shaver (1994). "Attachment as an organizational framework for research on close relationships." 5(1): 1-22.

[21] Howes, C., et al. (1998). "Child care caregiver sensitivity and attachment." 7(1): 25-36.

[22] Jong, J.-T., et al. (2010). "Can temperament be understood at birth? The relationship between neonatal pain cry and their temperament: A preliminary study." 33(3): 266-272.

[23] Kadushin, A., et al. (1981). Child abuse--An interactional event, Columbia University Press.

[24] LaGasse, L. L., et al. (2005). "Assessment of infant cry: acoustic cry analysis and parental perception." 11(1): 83-93.

[25] Messaoud, A. and C. Tadj (2010). A cry-based babies identification system. International Conference on Image and Signal Processing, Springer.

[26] Michelsson, K. and O. J. I. j. o. p. o. Michelsson (1999). "Phonation in the newborn, infant cry." 49: S297-S301.

[27] Newman, J. D. (1985). The infant cry of primates. Infant crying, Springer: 307-323.

[28] Newman, L. K., et al. (2007). "Borderline personality disorder, mother–infant interaction and parenting perceptions: preliminary findings." 41(7): 598-605.

[29] Pal, P., et al. (2006). Emotion detection from infant facial expressions and cries. 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, IEEE.

[30] Price, J., et al. (2005). "Design of an Automatic Speech Recognition System Using MATLAB."

[31] Ruvolo, P. and J. Movellan (2008). Automatic cry detection in early childhood education settings. 2008 7th IEEE International Conference on Development and Learning, IEEE.

[32] Saha, B., et al. (2013). An embedded system for automatic classification of neonatal cry. 2013 IEEE Point-of-Care Healthcare Technologies (PHT), IEEE.

[33] Silva, G. and D. Wickramasinghe (2017). "Infant cry detection system with automatic soothing and video monitoring functions."

[34] Singh, N., et al. (2012). "Mfcc and prosodic feature extraction techniques: A comparative study." 54(1).
[35] Soltis, J. J. B. and b. sciences (2004). "The signal functions of early infant crying." 27(4): 443-458.

[36] Yong, B. F., et al. (2019). Baby cry recognition using deep neural networks. World Congress on Medical Physics and Biomedical Engineering 2018, Springer.

[37] Zeifman, D. M. J. D. P. T. J. o. t. I. S. f. D. P. (2001). "An ethological analysis of human infant crying: answering Tinbergen′s four questions." 39(4): 265-285.

[38] Zeifman, D. M. J. I. M. H. J. O. P. o. T. W. A. f. I. M. H. (2003). "Predicting adult responses to infant distress: Adult characteristics associated with perceptions, emotional reactions, and timing of intervention." 24(6): 597-612.

[39] Zeskind, P. S. and B. M. Lester (2001). "Analysis of infant crying."

[40] Zeyer, A., et al. (2017). A comprehensive study of deep bidirectional LSTM RNNs for acoustic modeling in speech recognition. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE.

[41] Zhang, Y.-F., et al. (2020). "Predicting the Trend of Dissolved Oxygen Based on the kPCA-RNN Model." 12(2): 585.

[42] Swarnalexmi Nagarajan ,Rajeswari Rengarajan ,Nivethitha Manoharan ,Kanniga Devi Baskaran,”Infant Cry Analysis For Emotion Detection By Using Feature Extraction Methods, International Journal Of Industrial Electronics And Electrical Engineering,Volume-5,Issue5,May-2017

[43] Bănică, I. A., Cucu, H., Buzo, A., Burileanu, D., & Burileanu, C. (2016, June). Automatic methods for infant cry classification. In 2016 International Conference on Communications (COMM) (pp. 51-54). IEEE.

[44] Chittora, A., & Patil, H. A. (2015, August). Classification of normal and pathological infant cries using bispectrum features. In 2015 23rd European Signal Processing Conference (EUSIPCO) (pp. 639-643). IEEE.

[45] Cohen, R., & Lavner, Y. (2012, November). Infant cry analysis and detection. In 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel (pp. 1-5). IEEE.

[46] Dixit, A. A., & Dharwadkar, N. V. (2018, April). A Survey on Detection of Reasons behind Infant Cry using Speech Processing. In 2018 International Conference on Communication and Signal Processing (ICCSP) (pp. 190-194). IEEE.
[47] Gustafson, G. E., & Harris, K. L. (1990). Women′s responses to young infants′ cries. Developmental Psychology, 26(1), 144.

[48] Lederman, D. (2002). Automatic classification of infants′ cry (pp. 1-11). Ben-Gurion University of the Negev.

[49] Liu, L., Li, Y., & Kuo, K. (2018, March). Infant cry signal detection, pattern extraction and recognition. In 2018 International Conference on Information and Computer Technologies (ICICT) (pp. 159-163). IEEE.

[50] Matsunaga, S., Sakaguchi, S., Yamashita, M., Miyahara, S., Nishitani, S., & Shinohara, K. (2006). Emotion detection in infants′ cries based on a maximum likelihood approach. In Ninth International Conference on Spoken Language Processing.

[51] Myakala, P. R., Nalumachu, R., Sharma, S., & Mittal, V. K. (2017, November). A low cost intelligent smart system for real time infant monitoring and cry detection. In TENCON 2017-2017 IEEE Region 10 Conference (pp. 2795-2800). IEEE.

[52] Zeifman, D. M. (2001). An ethological analysis of human infant crying: answering Tinbergen′s four questions. Developmental Psychobiology: The Journal of the International Society for Developmental Psychobiology, 39(4), 265-285.

[53] Graves, A., Mohamed, A. R., & Hinton, G. (2013, May). Speech recognition with deep recurrent neural networks. In 2013 IEEE international conference on acoustics, speech and signal processing (pp. 6645-6649). IEEE.

[54] Sak, H., Senior, A., & Beaufays, F. (2014). Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition. arXiv preprint arXiv:1402.1128.

[55] Biloop Technologic,S.L., (2009). Cry Translator. Web. 05 Mar. 2011.
甲、 <http://www.crytranslator.com>.

[56] Christopher Olah. (2015). Understanding LSTM Networks
甲、 <https://colah.github.io/posts/2015-08-Understanding-LSTMs/>
[57] 沈依. (2020). 利用LSTM建立聲音滿意度辨識模型
指導教授 許秉瑜 審核日期 2020-12-25
推文 facebook   plurk   twitter   funp   google   live   udn   HD   myshare   reddit   netvibes   friend   youpush   delicious   baidu   
網路書籤 Google bookmarks   del.icio.us   hemidemi   myshare   

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明