基於語意之輿情分析系統

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：129

、訪客IP：18.222.120.133

姓名

曾昱智(YU-ZHI ZENG) 查詢紙本館藏

畢業系所

資訊工程學系在職專班

論文名稱

基於語意之輿情分析系統
(Semantic Based Public Opinion Analysis System)

相關論文

★ Single and Multi-Label Environmental Sound Recognition with Gaussian Process	★ 波束形成與音訊前處理之嵌入式系統實現
★ 語音合成及語者轉換之應用與設計	★ 高品質口述系統之設計與應用
★ 深度學習及加速強健特徵之CT影像跟骨骨折辨識及偵測	★ 基於風格向量空間之個性化協同過濾服裝推薦系統
★ RetinaNet應用於人臉偵測	★ 金融商品走勢預測
★ 整合深度學習方法預測年齡以及衰老基因之研究	★ 漢語之端到端語音合成研究
★ 基於 ARM 架構上的 ORB-SLAM2 的應用與改進	★ 基於深度學習之指數股票型基金趨勢預測
★ 探討財經新聞與金融趨勢的相關性	★ 基於卷積神經網路的情緒語音分析
★ 運用深度學習方法預測阿茲海默症惡化與腦中風手術存活	★ 運用LLM自動生成食譜方法與系統

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

至系統瀏覽論文 ( 永不開放)

摘要(中)

在分析語句情緒的研究中，為了提升準確率，通常會加入一些因素規則，比如情緒關鍵字的使用與人工定義的情緒規則；這些自制化的因素，往往會因為需求龐大的數據與漫長的訓練要求，造成系統架構的不靈活性與效能不佳。因此在論文的研究中，將以上述的需求為考量，建立一個能分析文句語意內容，並具有快速特性與一定效能的系統架構。
論文的系統架構分為三大部分，分別為資料訓練：其為情緒及情緒心理學的相關研究，主要根據知網的語料庫 (HowNet) 與中研院中文詞知識庫小組的中文詞類分析技術報告為參考資料生成情緒規則，產生稀疏表示特徵，建立稀疏表示字典，透過解出稀疏係數後，將兩類別各自的字典及係數還原原向量，並與原向量計算誤差，獲得最小誤差者即為所屬類別；再者為議題輸入與評論資料取得描述如何取得時下論壇的熱門討論文章之評論內容；最後為資料分類：可以根據資料訓練之結果分析議題分類的準確度。另外，在研究實驗上，論文將逐一辨識時下的流行論點作為情緒分類模組的實作議題。

摘要(英)

In the research of semantic sentiment analysis, it will normally use some factor rules such as the utilization of emotional keywords and the emotional rules defined manually to increase the accuracy. Because of the demand for large amounts of data and the training take lots of time, these manual factors will usually make the construction of system unportable and decrease efficiency. In this thesis, based on the above demands, we propose a semantic sentiment analysis system, and it also have better quality and increase efficiency.
The system structure of this thesis is organized as follows. First, the data training: It is the research of emotion and emotion psychology. According to the linguistic definition such as HowNet and CKIP technical report, we could make the emotional rules to generate the sparse representation characteristic, and build the sparse representation dictionary. By solved the sparse coefficient, return the dictionary and coefficient of two categories to original vector respectively. Then calculate the error with original vector, the dependent category which is obtain minimum error. Second, the input topic and the obtainment of comments: It present how to get the comments of the hot topic in the internet forum. Finally, the data classification: we will analyze the accuracy of classified topics by the result of data training. Besides, the experimental results will identify the hot topic as the implementation of semantic classification models.

關鍵字(中)

★ 語意
★ 輿情

關鍵字(英)

★ Semantic
★ Opinion Analysis

論文目次

中文摘要 i
英文摘要 ii
誌謝 iii
章節目次 iv
圖目錄 vi
表目錄 vii
第一章緒論 1
1.1前言 1
1.2研究動機與目的 2
1.3章節排序 2
第二章輿情分析文獻探討 3
2.1輿情分析 3
2.1.1支持向量機(SVM) 5
2.1.2 k鄰近演算法(kNN) 6
2.2 研究方向 6
第三章研究方法 7
3.1系統架構 7
3.2評論資料取得 8
3.2.1資料來源取得 9
3.2.2 Json資料庫 10
3.3訓練資料 11
3.3.1 情緒用詞 12
3.3.2 知網 13
3.3.2.1 知網資料庫的實現 14
3.3.3 文句處理器 17
3.3.3.1 中文斷詞系統 17
3.3.3.2 文句分析 18
3.3.4 資料探勘 20
3.3.4.1 關聯規則 20
3.3.4.2 Apriori演算法 21
3.3.5 特徵分析 22
3.3.6 稀疏表示分類模型 24
3.4資料分類 28
第四章實驗結果 29
4.1 實驗設置與環境 29
4.2 實驗語料 29
4.2.1 Uber 29
4.2.2 棒球比賽分析 30
4.2.3 忠孝橋拆除分析 31
4.3 實驗結果 31
第五章結論 32

參考文獻

[1] 中研院中文斷詞系統, “http://ckipsvr.iis.sinica.edu.tw/”.
[2] JIEBA, “https://github.com/fxsjy/jieba”.
[3] Stanford Word Segmenter, “http://nlp.stanford.edu/software/segmenter.shtml”.
[4] 曾元顯, 文件主題自動分類成效因素探討, 中國圖書館學會會報，第68期，頁62-83, 2002.
[5] 黃翊軒, “本體論為基之智慧型專利文件分類方法論研究”, 國立清華大學工業工程與工程管理學系碩士論文, 2007.
[6] wiki, “TF-IDF。https://zh.wikipedia.org/wiki/TF-IDF”.
[7] 王光耀, “基於稀疏表示之語者辨識之研究”, 國立中央大學資訊工程學系碩士論文, 2013.
[8] W. M. Campbell, J. P. Campbell, D. A. Reynolds, E. Singer, and P. A. Torres-Carrasquillo, “Support vector machines for speaker and language recognition,” Comput. Speech Lang., vol. 20, pp. 210–229, 2006.
[9] LIBSVM. http://www.csie.ntu.edu.tw/~cjlin/libsvmH. Accessed January, 2011.
[10] Ma, C. M., Yang, W. S., & Cheng, B. W. (2014). How the Parameters of K-nearest Neighbor Algorithm Impact on the Best Classification Accuracy: In Case of Parkinson Dataset. Journal of Applied Sciences, 14(2), 171-176
[11] R-SHINY統計分析平台“http://statisticprojct.weebly.com/2970235542.html”.
[12] Jia-Ching Wang et al. , "Speech Emotion Verification Using Emotion Variance Modeling and Discriminant Scale-Frequency Maps," IEEE Transactions on Audio, Speech and Language Processing,, (accepted for publication) (SCI)
[13] JavaScript Object Notation, “http://www.json.org/”.
[14] 董振東, “知網。http://www.keenage.com/html/c_index.html”, 1988.
[15] J. A. Russell & Pratt, “A description of the affective quality attributed to environments”, Journal of Personality and Social Psychology, 38(2), 311-322, 1980.
[16] R. J. Larson and E. Diener ,“Promises and Problems with the Circumplex Model of Emotion”, Review of Personality and Social Psychology: Emotion (Vol. 13, p. 31), 1992.
[17] J. Posner , J. A. Russell , B. S. Peterson ,“The circumplex model of affect: An integrative approach to affective neuroscience, cognitive development, and psychopathology”, Development and psychopathology,17(03), 715-734, 2005.
[18] 林宇中, “基於語意內容分析之情緒分類系統”, 國立成功大學資訊工程系碩士論文, 2003.
[19] 黃信華, “FACEBOOK塗鴉牆文本分析情緒文字的關係”, 國立台南大學數位科技學習系碩士論文, 2003.
[20] 王韋堯、黃詩珮、劉怡寧, “消費品廣告設計之情緒效價與喚起分析”, 設計學報，17（3），P.45-P.67, 2012.
[21] 王瀞誼, “衡量分類關聯規則的新方法”, 國立高雄大學電機工程研究所碩士論文, 2007.
[22] H. Chauhan, A.Chauhan, (2014). "Implementation of the Apriori algorithm for association rule mining" Compusoft 3.4 699-701.
[23] A. Chawla, K. S. Dhindsa, (2014). "Implementation of Association Rule Mining using Reverse Apriori Algorithmic Approach" International Journal of Computer Applications, 93.8

指導教授

王家慶(Jia-Ching Wang)

審核日期

2016-8-25

推文