多媒體應用之語音辨識系統; Multimedia Applications for Speech Recognition System

NCU Institutional Repository > 資訊電機學院 > 電機工程研究所 > 博碩士論文 > Item 987654321/10375

請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/10375

題名:	多媒體應用之語音辨識系統;Multimedia Applications for Speech Recognition System
作者:	溫家誠;Chia-chen Wen
貢獻者:	電機工程研究所
關鍵詞:	語音辨識系統;Speech Recognition System
日期:	2008-06-13
上傳時間:	2009-09-22 12:14:27 (UTC+8)
出版者:	國立中央大學圖書館
摘要:	隨著電子多媒體系統的迅速發展，使得多媒體服務有無限可能。其中藍芽系統已成為無線通訊技術發展的新領域，這代表著所有的應用將可透過藍芽技術整合功能，而能夠讓使用者更便利的利用這項服務，關鍵詞萃取語音辨識系統就成了重要的方式之ㄧ。在本論文中，我們首先將針對語音辨識發展理念規劃一套多媒體應用語音辨識系統，模擬使用者使用多媒體系統的情況。所提出的服務則基於駕駛者在車內最常使用的操控模式，包括聽音樂、打電話及導航系統等等，透過問答方式的人機互動介面讓操作者感到友善，且本系統中將採用語音合成來模擬人聲以作為回應。我們以關鍵詞萃取為主的辨識技術可提升系統的移植性與擴展性，而階層式架構設計可於各種環境下增加語音辨識的可靠度。然而環境噪音以及雜音干擾，我們將進行強健性語音辨識，利用強建語音參數及模型調適等方面的技術來降低測試環境的影響。最後，我們再對系統進一步增建個人化使用的設計，藉由語者辨識技術提供專屬的服務，且再運用語者模型調適技術來強化系統的辨識效能。 Vehicle electronic multimedia system with the rapid development of the car makes the services provide immense possibilities. In which, the Bluetooth wireless technology has become a new area, and then all the applications will be integrated through this technology. However, the crucial role to play in that is speech recognition. In this thesis, we develop a speech recognition system of multimedia applications in car environment to mimic the using of multimedia for the driver and passengers. Our service is based on the most common use of control modes, including listening to music, phone and navigation systems, and so on. The user-friendly interface will be made through the interactive question-and-answer approach. Speech synthesis is adopted in our system to simulate human voices as response. Keyword spotting-based recognition system can improve the portability and system scalability and the design of hierarchical structure can increase speech recognition reliability in car environment. However, the vehicle noise and interference from vehicle environment is still a challenge, so we carry out the robustness speech recognition. Robust features and model adaptation methods are adopted to reduce the environmental impact of testing. Finally, we build a more personalized system for providing exclusive services. By the speaker recognition techniques, we also expect to strengthen the recognition system performance further.
顯示於類別:	[電機工程研究所] 博碩士論文

文件中的檔案:

檔案	大小	格式	瀏覽次數

在NCUIR中所有的資料項目都受到原著作權保護.

社群 sharing

資料載入中.....