博碩士論文 104423009 完整後設資料紀錄

DC 欄位 語言
DC.contributor資訊管理學系zh_TW
DC.creator張昇暉zh_TW
DC.creatorSheng-Hui Changen_US
dc.date.accessioned2017-7-26T07:39:07Z
dc.date.available2017-7-26T07:39:07Z
dc.date.issued2017
dc.identifier.urihttp://ir.lib.ncu.edu.tw:444/thesis/view_etd.asp?URN=104423009
dc.contributor.department資訊管理學系zh_TW
DC.description國立中央大學zh_TW
DC.descriptionNational Central Universityen_US
dc.description.abstract隨著新聞媒體的蓬勃發展,新聞的產生是一連串的文件串流,過往使用以NGD為基礎之方式,找出和標題關鍵字具高度相關性的主題關鍵字,然而此步驟由於透過Solr全文檢索系統進行查詢,需要耗費相當長的時間,而使用非監督式圖形化摘要方法,其建立文句網路之結果也不如預期,以致於品質仍有提升空間。將過去應用於英文自動摘要之技術直接使用於中文自動摘要,然而其品質與效率皆不如預期。本研究透過增加中文詞性辨別強化中文分詞結果、以TextRank為基礎之關鍵字擷取和鏈結分析法和考慮了文句位置特徵,不僅在單文件摘要得到了較好的品質,且速度也提升了許多。並以單文件摘要方法為基礎,以瀑布式架構結合文句分群進行動態多文件摘要,不但能產生隨時間演進之摘要,也能過濾文件間的冗餘訊息。zh_TW
dc.description.abstractWith the rapid development of news media, and the news is a series of document stream. In the past, the production methods of news summary were based on NGD method, it found the keywords which were highly correlated to the title. However, because that method is through the Solr full text search system, it would take lots of time. In the other way, there are still a lot of improvements in quality for the unsupervised graph-based method, since the result of the sentence network is not as good as expected. Nevertheless, when used the techniques for the English summaries in Chinese summaries directly, the quality and efficiency are still not as good as expected. In this study, I enhance the Chinese word segmentation with increasing the Chinese part of speech recognition. In addition, I take into account the positions of the sentence through adopting the TextRank-based keyword extraction and link-analysis method. Eventually, not only it improves the quality of the single document, but also the speed is well improved. At last, based on the single document summary method, I use the sentence grouping in the waterfall architecture to produce the dynamic multi-document summary. It can produce the summary with the evolution of time, and also filter the redundant message in the documents.en_US
DC.subject動態摘要zh_TW
DC.subject擷取式摘要zh_TW
DC.subject單文件摘要zh_TW
DC.subject多文件摘要zh_TW
DC.subject中文摘要zh_TW
DC.subjectDynamic Summarizationen_US
DC.subjectExtractive Summarizationen_US
DC.subjectGeneral Summarizationen_US
DC.subjectChinese Summarizationen_US
DC.title中文文件串流之摘要擷取研究zh_TW
dc.language.isozh-TWzh-TW
DC.type博碩士論文zh_TW
DC.typethesisen_US
DC.publisherNational Central Universityen_US

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明