博碩士論文 91522071 完整後設資料紀錄

DC 欄位 語言
DC.contributor資訊工程學系zh_TW
DC.creator楊智宇zh_TW
DC.creatorZhi-Yui Yangen_US
dc.date.accessioned2004-7-14T07:39:07Z
dc.date.available2004-7-14T07:39:07Z
dc.date.issued2004
dc.identifier.urihttp://ir.lib.ncu.edu.tw:88/thesis/view_etd.asp?URN=91522071
dc.contributor.department資訊工程學系zh_TW
DC.description國立中央大學zh_TW
DC.descriptionNational Central Universityen_US
dc.description.abstract在資訊大量擴充與爆炸的今日,加上資訊種類的繁多與複雜,所以更是難以找尋正確與所需的資料。而利用資訊檢索(Information Retrieval)與資訊擷取(Information Extraction)的方法,我們便可以易於在大量的資料中檢索與擷取重要的資訊。 問題答覆答系統結合了資訊檢索與資訊擷取,在大量的文件中找尋問題相關的內文,進而擷取其答案。資訊尋找方式通常是利用資訊檢索的技術,但資訊檢索所得的資訊過於廣泛且雜訊過多,所以加上資訊擷取的方法,可以把資訊精簡。但單純的加入資訊擷取與資訊檢索,真正感興趣的部分還是無法得知,這時就需要專有名詞(Name Entity)辨識我們感興趣的部分,並加以擷取。一般的資訊檢索與資訊擷取無法直接套用在問題回答系統,原因是問題與答案的種類繁多,而且涉及自然語言的格式與方法,加上隨字彙語義、語法不同,語句的表示法也會不同,所以大部分問題答覆系統都需要進一步的問題分類(Question Classification)與段落擷取(Passage Retrieval)技巧,並加上人所觀察出的經驗法則(Heuristic)來解決問題與答案間的關連性。而人的因素牽涉越多,所花的成本也隨之增大。也由於人類相關的知識介入,所牽涉的領域很廣,很難用一個通則涵蓋所有範圍。 而本篇所要設計的問題回答系統,即是利用已知的資訊加上分類演算法來建立系統模組,模組會自動學習如何找尋問題的答案。此種機器學習(Machine Learning)的技巧能讓系統面對未來可利用的訓練資料時,更能學習到重要資訊,而不需複雜的人為介入造成時間、人力成本的增加。這種以分類為基礎的問題回答系統是第一次被嘗試,而實驗也證明了其獨特性與優越性。zh_TW
dc.description.abstractIt is a world of information explosion nowadays. Due to the variety and the complexity of information, the accurate data becomes more difficult to search. Meanwhile, people may have tended to neglect some important information which appears shortly. By using Information Retrieval (IR) and Information Extraction (IE) techniques, it is beneficial for helping people to fetch accurate and important information within a large amount of databases more effectively. A Question Answering System (QA system) combines both IR and IE techniques. It is able to search answers in documents of questions. Information Retrieval usually uses Document Retrieval to find the relevant documents, but the documents may have too much information and many noise. Hence, most QA Systems use question classification and passage retrieval to improve the system accuracy. Then, they use Name Entity to tag the proper noun they interested. Because QA systems involve linguistics studying, most of them use the observations of human efforts to create the relations between questions and answers. But more human efforts involve, more time and money spend. This research of the QA System is designed to utilize the information that is already known. It includes classified questions and correct answer sentences. By adding Machine Learning techniques, our QA system integrates the information and classification-based methods. We can answer the question automatically without human efforts. It is the first time that QA systems use classification-based system architecture. And from our experiments, they prove that our QA system has its uniqueness and superiority.en_US
DC.subject問題答覆zh_TW
DC.subject語句分類zh_TW
DC.subject答案擷取zh_TW
DC.subject特徵擷取zh_TW
DC.subject問題分類zh_TW
DC.subject文件檢索zh_TW
DC.subject段落萃取zh_TW
DC.subjectpassage retrievalen_US
DC.subjectdocument retrievalen_US
DC.subjectanswer extractionen_US
DC.subjectquestion classificationen_US
DC.subjectquestion answeringen_US
DC.subjectsentence categorizationen_US
DC.title問題答覆系統使用語句分類排序方式之設計與研究zh_TW
dc.language.isozh-TWzh-TW
DC.titleRanking by Sentence Categorization for Question Answering Systemsen_US
DC.type博碩士論文zh_TW
DC.typethesisen_US
DC.publisherNational Central Universityen_US

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明