博碩士論文 100423034 完整後設資料紀錄

DC 欄位 語言
DC.contributor資訊管理學系zh_TW
DC.creator周慧鈴zh_TW
DC.creatorHui-ling Chouen_US
dc.date.accessioned2013-6-21T07:39:07Z
dc.date.available2013-6-21T07:39:07Z
dc.date.issued2013
dc.identifier.urihttp://ir.lib.ncu.edu.tw:444/thesis/view_etd.asp?URN=100423034
dc.contributor.department資訊管理學系zh_TW
DC.description國立中央大學zh_TW
DC.descriptionNational Central Universityen_US
dc.description.abstract本研究為針對飯店評論做摘要之處理。在早期有關文件之摘要主要是以單一文件為主,因此在分析文件時,不太需要注意內文中是否有衝突之意見,也毋須在意文件發布之時間。然而隨著科技的進步與網路論壇的興起,透過網路來發表自身觀點或經驗的人日益漸增,在這些網站上每天都有數以萬計的評論產生,倘若以早期摘要方式替這些評論做摘要,可能會產生突意見與時間、作者差異之問題。因此本研究針對該類型之文件提出一個新的摘要方式,並以最具代表性的K個句子做為摘要輸出。我們主要考量因素有以下四點,分別為不同作者之可信程度、不同時間點之影響程度、評論本身幫助與否,以及衝突意見之分析。 本研究將以Tripadvisor上的飯店評論作為分析之資料,並請二十位受試者比較三種不同方式所挑選出之句子,依據效果好壞排序三種方式。而三種方式分別為僅考慮語句外部特徵(A)、僅考慮內文分析(B),以及本研究方法(C)。倘若實驗結果C大於A與B,即可證明加入作者與時間差異的考量是必要的以及本研究方法是可行且效果良好的。zh_TW
dc.description.abstractThis study focuses on the summarization of hotel comments. In early work of summarization, the document we analyze is single-document. Therefore, we do not need to pay attention to neither conflict opinion nor the time the document posted. However, with the advance of science and technology and the flourishing of online forums, more and more people write their own opinions and experiences and post them by internet. Every day there are tens of thousands of new comments on these websites. If we summarize these comments by early work of summarization, they may cause the problems of conflict opinion and the differences in time and author. Therefore, this study proposes a new method of summarization for this type of documents and uses the most K informative sentences as a summarization. We have four main considerations and they include the credibility of each author, the influence of difference in time, the helpfulness of comment and the analysis of conflict opinion. This study adopts hotels’ reviews on Tripadvisor as our dataset and asks twenty subjects comparing sentences which are chosen by three different methods and sorting the method by effect. The three method respectively are only considering the external features of sentences (A), only considering the analysis of context (B) as well as we propose method (C). If the result of experiment shows that C method is better than A and B method, it can prove that it is necessary to take into account the difference in authors and time and the method we proposed is feasible and effective.en_US
DC.subject評論摘要zh_TW
DC.subjectNGDzh_TW
DC.subjectPMIzh_TW
DC.subjectK-meanszh_TW
DC.subjectComments summarizationen_US
DC.subjectNGDen_US
DC.subjectPMIen_US
DC.subjectK-meansen_US
DC.title從評論中找出最具代表性的K個句子zh_TW
dc.language.isozh-TWzh-TW
DC.titleFind the most K informative sentences from commentsen_US
DC.type博碩士論文zh_TW
DC.typethesisen_US
DC.publisherNational Central Universityen_US

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明