姓名 蘇鼎文(Ting-Wen Su)  畢業系所 資訊管理學系
論文名稱 探討多重記憶系統應用於遺忘因子的使用者興趣模型
摘要(中) 隨著使用者的閱讀習慣從紙本轉成數位、電腦轉成手機平板,使得使用者能夠隨時隨地的閱讀,不僅也增加了平均閱讀量也造成了更容易分散注意力的環境,面對這些新的挑戰,系統除了需要解決使用者興趣的概念飄移的問題以外還需要解決因網路資料規模呈指數成長而所造成系統處理即時性的問題。
摘要(英) While user’s channels of reading is changing from physical to digital, desktop computer to mobile device. It becomes easier for user to read at anywhere, anytime. It have not only increasing the amount of average reading but also causing the user interest drift more often. To solve these problems, information filter system have to adapt the concept drift of user interests and trains fast enough to deal with the explosion of documents streaming.
The research try to use different centrality algorithm to find the core set of keywords in user profile′s graph. Using the strong keywords instead of all of the keywords in the graph, system improves the speed of building user profile and even the performance of the system. In addition, the research design the user profile′s interest base on the Atkinson-Shiffrin′s multi-store model, the framework divided user interests into long-term interest and short-term interest. The short-term interest use the dynamic forgetting factor to adapt the concept drift occurred in user profile. In contrast, the long-term interest using the static forgetting factor to store information for the system to use. the experiments proved short term forgetting factor can adapt the concept drift quicker, and the long term forgetting factor can save more information in the interest. In the end, research’s system shows better F-measure performance and more efficient than the other research.
關鍵字(中) ★ 多重記憶系統模型
★ 使用者模型
★ 遺忘因子
★ 文件過濾
★ 圖形居中
關鍵字(英) ★ Atkinson–Shiffrin memory model
★ User Profile
★ Forgetting Factor
★ Document Filter
★ Graph Centrality
論文目次 摘要 i
Abstract ii
目錄 iv
圖目錄 vi
表目錄 viii
一、緒論 1
1-1 研究背景 1
1-2 研究動機 2
1-3 研究目的 3
二、相關研究 4
2-1 前處理框架 4
2-2 文件的字詞圖形表達方法 7
2-2-1 字詞頻率 8
2-2-2 NGD距離 8
2-2-3 字詞主題 9
2-3 居中度演算法 11
2-4 概念飄移 12
2-4-1 滑動視窗 14
2-4-2 遺忘因子 15
2-4-3 結合使用者興趣的遺忘因子 15
三、系統架構 18
3-1 系統流程 18
3-2 研究前處理流程 20
3-3 使用者模型 20
3-3-1 主題字詞圖形 20
3-3-2 字詞圖形核心辨識 22
3-4 主題映射 24
3-5 過濾文件 26
3-6 長、短期遺忘因子 27
3-7 主題興趣的生命週期 30
3-8短期興趣移除 32
四、實驗 34
4-1 研究設定 34
4-2 資料集 34
4-3 門檻值實驗: 36
4-3-1 主題門檻比例 36
4-3-2 興趣移除比例實驗 38
4-3-3 核心數量與演算法實驗 40
4-4 概念飄移實驗: 43
4-5使用者模型學習效能實驗 45
五、結論與未來研究方向 49
5-1 結論 49
5-2 未來研究 50
參考文獻 52
指導教授 林熙禎(Shi-Jen Lin) 審核日期 2015-7-27
