姓名 魏晨珊(Chen-Shan Wei)  查詢紙本館藏   畢業系所 企業管理學系
論文名稱 適用於多領域虛假評論之判斷模型
(Devising a cross- domain model to detect deceptive review comments)
摘要(中) 網路購物中評論的影響力對消費者與店家銷售策略已經產生巨大影響,其中,正
論,例如:Li, Ott, Cardie, and Hovy (2014)、Ren and Ji (2017)、W. Liu, Jing, and Li
本論文使用:Ott et al.(2011)及 Li, Ott, Cardie, and Hovy (2014)所搜集的三個領域
(hotel、restaurant、doctor)真實與虛假評論資料,利用心理學理論,Stimuli Organism
Response (S-O-R)框架為基礎結合 LIWC (Linguistic Inquiry and Word Count),建立一個
跨領使用的分類模型,再加上透過 word2vec 詞向量頻繁特徵建萃取,克服過去論文跨
實驗結果得出若使用方法一,SOR 與評論之特徵權重進行分類演算法計算,表現最
佳的 DNN 方法中準確度達 63.6%。方法二,詞向量頻繁特徵進行分類演算法計算,表
現最佳的 random forest 準確度達 73.75%。
摘要(英) The online reviews not only have huge impact on consumer shopping behavior but also
online stores’ marketing strategy. Positive reviews will have positive influence for consumer’s
buying decision. Therefore, some sellers want to boost their sales volume. They will hire
spammers to write undeserving positive reviews to promote their products. Currently, some of
the researches related to detection of fake reviews based on the text feature, the model will
reach to high accuracy. However, the same model test on the other dataset the accuracy
decrease sharply.
Relevant researches have gradually explored the identification of false reviews through
field. For example, Li, Ott, Cardie, and Hovy (2014);Ren and Ji (2017)、W. Liu, Jing, and
Li (2019). Whether the model built using comprehensive methods such as text features or
neural networks, encountering the decreasing of accuracy. On the other hand, the method
didn’t explain why the model can be applied to cross-domain predictions.
In our research, we using the fake reviews and truthful reviews from Ott et al.(2011) and
Li, Ott, Cardie, and Hovy (2014) in the three domain (hotel, restaurant, doctor). The cross
domain detect model based on Stimuli Organism Response (S-O-R) combine LIWC
(Linguistic Inquiry and Word Count), add word2vec quantization feature, overcoming the
decreasing accuracy situation.
According to the research result, in the method one SOR calculation of feature weight of
reviews, the DNN classification algorithm accuracy is 63.6%. In the method two, calculation
of frequent features of word vectors, the random forest classification algorithm accuracy is
關鍵字(中) ★ 判斷虛假評論
★ Stimuli-Organism-Response (S-O-R) 框架
★ word2vec
論文目次 目錄
中文摘要 ................................................................. i
Abstract.................................................................. ii
目錄 ................................................................... iii
圖目錄 ................................................................... v
表目錄 .................................................................. vi
一、 緒論 ................................................................ 1
1-1 研究背景與動機 ................................................... 1
1-2 研究方法與目的 ................................................... 2
1-3 效益與貢獻 ....................................................... 4
1-4 研究架構 ......................................................... 5
二、 文獻探討 ............................................................ 6
2-1 線上評論之相關研究 ............................................... 6
2-2 辨別虛假評論之相關研究 ........................................... 7
2-3 Stimulus-Organism-Response(S-O-R)框架 ........................... 11
三、 研究方法與設計 ..................................................... 13
3-1 研究架構與步驟 .................................................. 13
3-2 SOR 類別篩選方法 ................................................. 15
3-3 方法一:SOR 與評論之特徵權重 ..................................... 16
3-4 方法二:詞向量頻繁特徵 .......................................... 16
四、 研究實驗 ........................................................... 18
4-1 實驗資料 ........................................................ 18
4-1-1 資料預處理 ................................................ 19
4-2 SOR 類別資料 ..................................................... 19

4-3 評論與 SOR 詞特徵權重 ............................................ 20
4-4 實驗一:SOR 與評論之特徵權重 ..................................... 25
4-5 實驗二:詞向量頻繁特徵 .......................................... 26
五、 結論與建議 ......................................................... 32
5-1 研究結果 ....................................................... 32
5-2 未來建議 ....................................................... 32
參考文獻 ................................................................ 33
附錄一 .................................................................. 37
附錄二 .................................................................. 38
附錄三 .................................................................. 41
指導教授 許秉瑜 審核日期 2020-1-13
