博碩士論文 103552021 完整後設資料紀錄

DC 欄位 語言
DC.contributor資訊工程學系在職專班zh_TW
DC.creator翁梓勝zh_TW
DC.creatorTzu-Sheng Wengen_US
dc.date.accessioned2016-7-20T07:39:07Z
dc.date.available2016-7-20T07:39:07Z
dc.date.issued2016
dc.identifier.urihttp://ir.lib.ncu.edu.tw:444/thesis/view_etd.asp?URN=103552021
dc.contributor.department資訊工程學系在職專班zh_TW
DC.description國立中央大學zh_TW
DC.descriptionNational Central Universityen_US
dc.description.abstract最近這幾年來,隨著網際網路 (World Wide Web) 的發展,社群問答的網站在最近這段時間也成長的非常多,大量的問答網站擁有非常多的資訊形成網路線上一個很有價值的知識寶庫,然而有一個現象,這些網站都會遇到的就是會有重複的問題,因此問題檢索的主要任務就是用來協助從存檔裡面找出之前已經被回答過的相關問題,然而詞語上同義詞性質的多樣性是問題檢索的一個極大挑戰,有些研究方法利用計算新的問題以及存檔問題之間相互關係的機率來處理這樣的狀況,另外也有許多研究是著重在字串之間的相似度。 在這篇論文裡,我們提出了一個方法首先利用 CBoW 的模型使用華碩 ROG 論壇的資料庫來做訓練資料,然後利用訓練出來的資料計算輸入的新問題以及存檔的問題之間的相似程度,與其他研究不同的地方在於我們將問題的標題以及問題的完整描述分開來看,將他們當作是兩個不同的特徵來做計算,另外我們也將使用者的榮譽點數拿來當做我們評估的一個要素, 我們的實驗顯示,對 ROG 論壇的資料庫做出來的結果優於其他的方法。zh_TW
dc.description.abstractIn recent years, there has been much development of community based question and answer (cQA) site. The number of large-scale Q&A sites has significantly increased over time, and the information on these sites represents a valuable online knowledge pool. However, one issue with such sites is the problem of duplicate questions. The task of question retrieval aims to find previously answered semantically similar questions in cQA archives. Nevertheless, synonymous lexical variations pose a big challenge for question retrieval. Some approaches address this issue by calculating the probability of correlation between new questions and archived questions. Much recent research has also focused on surface string similarity among questions. In this paper, we propose a method that first builds a continuous bag-of-word (CBoW) model with data from Asus’s Republic of Gamers (ROG) forum and then determines the similarity between a given new question and the Q&As in our database. Unlike most other studies, we calculate the similarity between the given question and the archived questions and descriptions separately with two different features. In addition, we factor user reputation into our ranking model. Our experimental results on ROG forum dataset show that our CBoW model with reputation features outperforms other top methods.en_US
DC.subject社群zh_TW
DC.subject論壇zh_TW
DC.subject問題zh_TW
DC.subject檢索zh_TW
DC.subjectQuestionen_US
DC.subjectRetrievalen_US
DC.subjectCommunityen_US
DC.subjectForumen_US
DC.title社群論壇之問題檢索zh_TW
dc.language.isozh-TWzh-TW
DC.titleQuestion Retrieval of Community Forumen_US
DC.type博碩士論文zh_TW
DC.typethesisen_US
DC.publisherNational Central Universityen_US

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明