博碩士論文 108521601 完整後設資料紀錄

DC 欄位 語言
DC.contributor電機工程學系zh_TW
DC.creator普蒂薇zh_TW
DC.creatorEpri Wahyu Pratiwien_US
dc.date.accessioned2021-8-30T07:39:07Z
dc.date.available2021-8-30T07:39:07Z
dc.date.issued2021
dc.identifier.urihttp://ir.lib.ncu.edu.tw:88/thesis/view_etd.asp?URN=108521601
dc.contributor.department電機工程學系zh_TW
DC.description國立中央大學zh_TW
DC.descriptionNational Central Universityen_US
dc.description.abstract音樂的特徵囊括了不同的頻譜線索和時域線索,這些線索在音樂感知上扮演了重要的角色。人工電子耳(Cochlear Implant, CI)編碼策略主要用於語音傳遞,但此法仍會造成音訊失真。本研究檢驗了音調(pitch)與節律(rhythm)對於旋律辨識(melody recognition)的相對貢獻,同時也評估了三種人工電子耳編碼策略對音樂品質的影響,此三種策略分別為:進階聯合編碼(Advanced Combinational Encoder, ACE) 、基本頻率調變(Fundamental Frequency Modulation, F0mod) ,以及包絡增強(Envelope Enhancement, EE) 。 本研究的旋律資料庫內有30首流行於台灣的童歌,每首童歌都有兩種音樂線索,分別為時域線索(音調)以及頻域線索(節律),這些童歌的旋律皆會經由中央大學的人工電子耳模擬器搭配三種人工耳電子耳的編碼策略所處理。接下來,藉由一個主觀的聆聽測試,此測試是從熟悉旋律辨識(Familiar Melody Identification, FMI)測試中處理後的刺激訊號來測量旋律感知(melody perception),藉此收集作答正確率及反應時間。共有5名 正常聽力 參與者參加熟悉旋律辨識測試。開始測試時要先從30首童歌中選出15首,參與者需要使用這15首歌,且這15首歌都會經由不同的人工電子耳編碼策略處理,處理後的歌曲會有不同的音樂特徵(音調及節律)。每名參與者會聽到90種刺激訊號 (15首歌,乘以2種音樂特徵,乘以3種人工電子耳編碼策略),這些參與者共選出了23首童歌旋律。熟悉旋律辨認的結果顯示,當旋律辨識中留存了節律線索,則熟悉旋律辨識測試的表現顯著較佳(p < 0.05),其聽者有較高的作答正確率以及較快的反應時間。此外,旋律伴隨節律線索經由進階聯合編碼策略有最好的分數,其分數為86.80%。 以此23首童歌旋律作為基礎,並使用包絡差值指標(Envelope Difference Index, EDI) 、音強錯配型態(Intensity Mismatch Pattern)和對數頻譜距離(Log Spectral Distance, LSD),三種方法來進行客觀分析,來評估原始音樂訊號,以及根據時域特徵及頻率特徵處理後的音樂訊號。原始訊號與經由進階聯合編碼、基本頻率調變、以及包絡增強三種訊號處理方式所處理後的訊號,其之間的平均音強錯配型態(Intensity Mismatch Pattern)分別為5.9、6.4,以及6.0,錯配型態越低,振幅旋律保留得越好;除此之外,原始訊號與處理後的訊號,其間的包絡差值指標數值在進階聯合編碼、基本頻率調變、以及包絡增強三種訊號處理方式的數值分別為0.11、0.11,以及0.15。包絡差值指標越高,頻域包絡保留得越好;原始訊號與處理後的訊號在頻域品質的差異,透過三種訓號處理方式的表現分別為2.10、2.16,以及2.19,對數頻譜距離越低,頻域品質越佳。 綜合 主觀與客觀分析,進階聯合編碼策略在頻域及時域品質的保留上有最好的效果,另外進階聯合編碼策略和節律線索合併使用時,在旋律辨識上有最高的準確性。zh_TW
dc.description.abstractAcoustic music features include various spectral and temporal cues, which play a critical role in music perception. The cochlear implant (CI) coding strategy designs primarily to convey speech, but music distortion remains. This study examined the relative contribution of pitch and rhythm to melody recognition, as well as the music quality from three CI coding strategies, Advanced Combinational Encoder (ACE), Fundamental Frequency Modulation (F0mod), and Envelope Enhancement (EE). The database of melody children′s songs consisted of 30 popular songs in Taiwan. Each melody children song had two music features, temporal (pitch) and spectral (rhythm). Then, the melody was processed with three CI coding strategies using NCU-CI, a cochlear implant simulation software. Then, a pilot subjective listening test was conducted to measure the melody perception from the processed stimuli using the familiar melody identification (FMI) test by collecting the percent correct and response time. There were 5 NH participants who joined the FMI test. The FMI test was begun with selecting 15 of 30 songs by the participants. Then, the participants tested with 15 chosen songs with different music features (pitch and rhythm) that were processed with three CI strategies in each FMI test session. Each participant had 90 tested stimuli (15 songs x 2 music features x 3 CI coding strategies). In total, 23 melody children songs were chosen by the participants. The results indicated that when the rhythm cues were preserved in melody recognition, the FMI performance was significantly better (p<0.05) by having a higher percent correct and faster response time than the pitch cues. Also, the melody with the rhythm cues processed with the ACE strategy achieved the best score, 86.80%. Based on the 23 chosen melody children′s songs, it was further examined using objective analysis. The envelope difference index (EDI), the intensity mismatch pattern, and the log spectral distance (LSD) were used to assess the quality of processed music compared to original music based on temporal and spectral features for the objective tests. The average intensity mismatch pattern between original and processed by the ACE, F0mod, and EE strategy were 5.9, 6.4, and 6.0, respectively. The lower the mismatch pattern, the better the amplitude melody was preserved. Then, the EDI value between original and processed by the ACE, F0mod, and EE strategy were 0.11, 0.11, and 0.15, respectively. The higher the EDI value, the better the temporal envelope was preserved. Then, the spectral quality differences between original and processed by the ACE, F0mod, and EE strategy were 2.10, 2.16, and 2.19, respectively. The lower the LSD, the better the spectral quality. In line with the subjective and objective analysis, the ACE strategy was the most outperforming the CI coding strategy in preserving spectral and temporal quality in our study. The results also revealed that the rhythm cues combined with the ACE strategy performed the highest accuracy in the melody recognition.en_US
DC.subject熟悉旋律辨認(familiar melody identification)zh_TW
DC.subject時域品質(temporal quality)zh_TW
DC.subject頻域品質(spectral quality)zh_TW
DC.subject音樂(music)zh_TW
DC.subject節律(rhythm)zh_TW
DC.subject音調(pitch)zh_TW
DC.subject人工電子耳(cochlear implant)zh_TW
DC.subject人工電子耳模擬(cochlear implant simulation)zh_TW
DC.subject聲碼器(vocoder)zh_TW
DC.subjectfamiliar melody identificationen_US
DC.subjecttemporal qualityen_US
DC.subjectspectral qualityen_US
DC.subjectmusicen_US
DC.subjectrhythmen_US
DC.subjectpitchen_US
DC.subjectcochlear implanten_US
DC.subjectcochlear implant simulationen_US
DC.subjectvocoderen_US
DC.title不同人工電子耳編碼策略之兒歌感知時頻分析zh_TW
dc.language.isozh-TWzh-TW
DC.titleTemporal and Spectral Analysis of Children Song Perception with Different Simulated Cochlear Implant Coding Strategiesen_US
DC.type博碩士論文zh_TW
DC.typethesisen_US
DC.publisherNational Central Universityen_US

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明