English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 80990/80990 (100%)
造訪人次 : 40251145      線上人數 : 345
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/60102


    題名: 從評論中找出最具代表性的K個句子;Find the most K informative sentences from comments
    作者: 周慧鈴;Chou,Hui-ling
    貢獻者: 資訊管理學系
    關鍵詞: 評論摘要;NGD;PMI;K-means;Comments summarization;NGD;PMI;K-means
    日期: 2013-06-21
    上傳時間: 2013-07-10 12:06:43 (UTC+8)
    出版者: 國立中央大學
    摘要: 本研究為針對飯店評論做摘要之處理。在早期有關文件之摘要主要是以單一文件為主,因此在分析文件時,不太需要注意內文中是否有衝突之意見,也毋須在意文件發布之時間。然而隨著科技的進步與網路論壇的興起,透過網路來發表自身觀點或經驗的人日益漸增,在這些網站上每天都有數以萬計的評論產生,倘若以早期摘要方式替這些評論做摘要,可能會產生突意見與時間、作者差異之問題。因此本研究針對該類型之文件提出一個新的摘要方式,並以最具代表性的K個句子做為摘要輸出。我們主要考量因素有以下四點,分別為不同作者之可信程度、不同時間點之影響程度、評論本身幫助與否,以及衝突意見之分析。
    本研究將以Tripadvisor上的飯店評論作為分析之資料,並請二十位受試者比較三種不同方式所挑選出之句子,依據效果好壞排序三種方式。而三種方式分別為僅考慮語句外部特徵(A)、僅考慮內文分析(B),以及本研究方法(C)。倘若實驗結果C大於A與B,即可證明加入作者與時間差異的考量是必要的以及本研究方法是可行且效果良好的。

    This study focuses on the summarization of hotel comments. In early work of summarization, the document we analyze is single-document. Therefore, we do not need to pay attention to neither conflict opinion nor the time the document posted. However, with the advance of science and technology and the flourishing of online forums, more and more people write their own opinions and experiences and post them by internet. Every day there are tens of thousands of new comments on these websites. If we summarize these comments by early work of summarization, they may cause the problems of conflict opinion and the differences in time and author. Therefore, this study proposes a new method of summarization for this type of documents and uses the most K informative sentences as a summarization. We have four main considerations and they include the credibility of each author, the influence of difference in time, the helpfulness of comment and the analysis of conflict opinion.
    This study adopts hotels’ reviews on Tripadvisor as our dataset and asks twenty subjects comparing sentences which are chosen by three different methods and sorting the method by effect. The three method respectively are only considering the external features of sentences (A), only considering the analysis of context (B) as well as we propose method (C). If the result of experiment shows that C method is better than A and B method, it can prove that it is necessary to take into account the difference in authors and time and the method we proposed is feasible and effective.
    顯示於類別:[資訊管理研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML554檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明