即時語音辨識多媒體系統; Real-time speech recognition Multimedia system

NCU Institutional Repository > 資訊電機學院 > 電機工程研究所 > 博碩士論文 > Item 987654321/48463

請使用永久網址來引用或連結此文件: https://ir.lib.ncu.edu.tw/handle/987654321/48463

題名:	即時語音辨識多媒體系統;Real-time speech recognition Multimedia system
作者:	邱介川;Chieh-Chuan Chiu
貢獻者:	電機工程研究所
關鍵詞:	隱藏式馬可夫模型;關鍵字擷取;Hidden Markov Model;keyword spotting
日期:	2011-07-19
上傳時間:	2012-01-05 14:55:29 (UTC+8)
摘要:	本論文主要是開發一套即時辨識多媒體系統，整合在車上常用的功能，提供簡單但實用的服務，配合自動錄音的技術，即時偵測指令的下達與否；使用關鍵詞萃取的技術來判斷屬於哪種服務，此技術是使用訓練好的次音節模型來辨識，無需因為更改服務而重複訓練模型，提升辨識效率與系統移植性。系統採用階層式架構，漸進式的引導使用者熟悉本系統，配合語音合成技術(Text To Speech, TTS)模擬人聲與使用者互動，系統開發工具是使用Borland C++ 6.0來實現視窗化的人機介面，達到即時辨識的效果。 This thesis develops a real-time voice recognition multimedia system to provide simple but useful services. System detects whether commands were made or not by using automatic recording technology, then determining what kind of service is with keyword spotting technology. This technology implements recognition with sub-syllable models, which don’t need to repeat training, to improve the performance efficiency and portability. System uses a hierarchical structure for keyword spotting with TTS (Text To Speech) to let user familiar with system. The system achieved by the Borland C + + 6.0 Windows based interface to realize real-time recognition.
顯示於類別:	[電機工程研究所] 博碩士論文

文件中的檔案:

檔案	描述	大小	格式	瀏覽次數
index.html		0Kb	HTML	575	檢視/開啟

在NCUIR中所有的資料項目都受到原著作權保護.

社群 sharing

資料載入中.....