English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 78937/78937 (100%)
造訪人次 : 39182037      線上人數 : 394
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/75934


    題名: 基於歷史資訊向量與主題專精程度向量應用於尋找社群問答網站中專家;Finding experts in Community Question Answering websites using History Post Embedding and Topic Expertise Model features
    作者: 陳沛伃;Chen, Pei-Yu
    貢獻者: 資訊工程學系
    關鍵詞: 詞嵌入;社群問答網站;TEM;佩奇排名;主題模型;專家;Word2Vec;CQA;TEM;PageRank;Topic Model;Experts
    日期: 2018-01-17
    上傳時間: 2018-04-13 11:22:51 (UTC+8)
    出版者: 國立中央大學
    摘要: 隨著科技的日新月異,我們隨時都要精進自己以獲取新知,避免被世界淘汰,於是帶動諸如Stack Overflow, Yahoo Answers, Quora, Zhihu (知乎)等社群問答網站(Community Question Answering,CQA)的興起。使用者可以在上面提問、回答問題,作為彼此交流與學習的平台。

    雖然社群問答網站的興起帶給使用者很大的便利,但是由於問題數量眾多,多數問題通常杳無音訊,想要及時得到問題正確的回覆,不可否認需要運氣與時間的等待。我們認為,若可於CQA 網站中正確地找出專家,則可藉由把對的問題推薦給有能力回答的專家,便可提升使用者互動,解決問題之效率。

    本研究首先透過非監督方法 -- Yang, Liu, et al. (2013)所建的TEM (Topic Expertise Model) 模型,擷取使用者對每個主題下專精程度的特徵向量,並利用History post embedding,以詞嵌入(Word Embedding)的特性,擷取語意程度的特徵向量,再利用問題與回答者之相似度作為推薦專家之依據。我們鎖定Stack Overflow (世界前幾大的程式設計領域的問答網站)作為研究目標,並獲得良好之準確率,並期望研究成果可於其他CQA 網站使用。

    本篇論文的貢獻是將TEM模型與詞嵌入的歷史資訊做結合,當在社群網路結構並非那麼完整時有效的把對的問題配對給對這個問題有能力回答的專家以提升社群網路參予度低的問題。


    ;With the ever-changing technology, we humans have to be willing to keep on learning in order to avoid being demoted by the world. Therefore, the reasons above led to the rise of the community question answering websites, such as Stack Overflow, Yahoo Answers, Quora, Zhihu (知乎), and so on and so forth. Users can ask questions, answer questions, exchange and discuss ideas with each other in the above platform.

    Although the rise of community question answering websites can surely bring great convenience to users, there is still room for improvement. Due to the large numbers of questions, most questions usually receive no response or get inappropriate answers. It is without doubt to rely on luck and time to get correct answers in time. Therefore, we believe that if we can find experts precisely in CQA websites, we can improve the efficiency of the participation rate by routing right questions to experts.

    In this study, we firstly utilize TEM (Topic Expertise Model), which is an unsupervised model published by Yang, Liu, et al. (2013), for capturing the degree of expertise of question and answerer under different topic. Furthermore, we utilize History Post Embedding, which is published in this thesis by using word embedding techniques, to extract semantic meanings to enhance the understanding of question sets. Finally, we combine the vector of topical expertise with History Post Embedding and perform a recommendation formula to rank experts. We target Stack Overflow, which is one of the biggest computer programming field CQA websites in the world, as our research goal and obtain good result. Moreover, we expect the research result to be available on other CQA websites.

    The main contribution of this thesis is combining TEM model with distributed representation of user historical information which can solve the problem of low participation rate in CQA websites when social network structure is not so complete.
    顯示於類別:[資訊工程研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML254檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明