使用圖神經網路偵測 PTT 的低活躍異常帳號

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：48

、訪客IP：13.59.69.58

姓名

王瑞緣(JUI-YUAN WANG) 查詢紙本館藏

畢業系所

資訊工程學系

論文名稱

使用圖神經網路偵測 PTT 的低活躍異常帳號
(Using Graph Neural Networks to Detect Inactive Spammers on PTT)

相關論文

★ 透過網頁瀏覽紀錄預測使用者之個人資訊與性格特質	★ 透過矩陣分解之多目標預測方法預測使用者於特殊節日前之瀏覽行為變化
★ 動態多模型融合分析研究	★ 擴展點擊流：分析點擊流中缺少的使用者行為
★ 關聯式學習：利用自動編碼器與目標傳遞法分解端到端倒傳遞演算法	★ 融合多模型排序之點擊預測模型
★ 分析網路日誌中有意圖、無意圖及缺失之使用者行為	★ 基於自注意力機制產生的無方向性序列編碼器使用同義詞與反義詞資訊調整詞向量
★ 探索深度學習或簡易學習模型在點擊率預測任務中的使用時機	★ 空氣品質感測器之故障偵測--基於深度時空圖模型的異常偵測框架
★ 以同反義詞典調整的詞向量對下游自然語言任務影響之實證研究	★ 結合時空資料的半監督模型並應用於PM2.5空污感測器的異常偵測
★ 藉由權重之梯度大小調整DropConnect的捨棄機率來訓練神經網路	★ 針對個別使用者從其少量趨勢線樣本生成個人化趨勢線
★ 基於雙變量及多變量貝他分布的兩個新型機率分群模型	★ 一種可同時更新神經網路各層網路參數的新技術— 採用關聯式學習及管路化機制

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

[檢視]

[下載]

本電子論文使用權限為同意立即開放。
已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。
請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。

摘要(中)

隨著社群媒體的興起，雇用「公關公司」在網路上散播不實消息，成
為左右時事輿論的新興手法，公關公司的大量帳號，常被各大論壇視為
異常帳號。國內外皆有學者以深度學習偵測異常帳號，但我們發現，現
階段偵測異常帳號的論文中，並沒有針對帳號的活躍程度作探討。
本篇論文中，我們依據帳號在限定時間內的活動次數定義出「活
躍值」的概念，我們觀察到，用簡單的卷積類神經網路 (Convolutional
Neural Network) 模型，即可在偵測高活躍異常帳號的任務中達到 0.9169 的 ROC 曲線下的面積 (AUROC)，但是偵測低活躍的異常帳號卻只有 0.7830，顯示出偵測低活躍異常帳號是非常棘手的任務。我們利用使用者與使用者之間的關係建立社群網路，以提供額外的特徵作為訓練的資料，
並引入圖神經網路，成功改善偵測低活躍異常帳號的任務。

摘要(英)

With the rise of social media, hiring public relations companies to spread fake news on the Internet has become an emerging method to manipulate public opinions. These large number of accounts owned by public relations companies are regarded as spammers by most online forums. Researchers have used deep learning techniques to detect abnormal accounts.
However, we found that these studies likely conducted experiments mainly on the active users.
In this thesis, we define the concept of ”Active Value” based on the number of activities of an account within a unit period. For active users, even a simple Convolutional Neural Network model can distinguish a spammer from a regular user: the area under the ROC curve (AUROC) achieves
0.9169. However, for the inactive users, the score drops to 0.7830. The result indicates that detecting inactivity spammers is much more challenging. We use user-to-user relationships to build a social network. We apply graph neural networks to the social network and extract additional social features as training clues. Experimental results show that these strategies better distinguish the spammers from regular users, especially when these users have limited activities.

關鍵字(中)

★ 網軍
★ 異常帳號
★ 低活躍異常帳號
★ 低活躍帳號
★ 圖神經網路
★ 批踢踢
★ 批踢踢實業坊

關鍵字(英)

★ spammer
★ social networks
★ inactive user
★ inactive spammer
★ graph convolutional networks
★ PTT

論文目次

目錄
頁次
摘要 iv
Abstract v
致謝 vii
目錄 viii
圖目錄 xi
表目錄 xiii
一、緒論 1
二、相關研究 3
2.1 不同網路平台上的異常帳號 .......................................... 3
2.2 偵測異常帳號的研究方法 ............................................. 4
2.3 偵測台灣 PTT 異常帳號 .............................................. 5
三、研究模型及方法 7
3.1 活躍值 (Active Value).................................................. 7
3.2 資料集 ..................................................................... 7
3.2.1 PTT 的介紹及統計數字 ...................................... 7
3.2.2 PTT 官方認定的異常帳號 ................................... 8
3.2.3 帳號的篩選機制 ................................................ 8
viii
目錄目錄
3.3 訓練特徵 .................................................................. 10
3.3.1 帳號參與的文章的總留言數 ................................. 10
3.3.2 帳號參與的文章的推噓總分 ................................. 12
3.3.3 帳號的活動時間 ................................................ 13
3.4 GNN 模型介紹........................................................... 14
3.4.1 Graph Convolutional Networks ............................. 14
3.4.2 Topology Adaptive Graph Convolutional Networks.... 16
3.4.3 Graph Attention Network .................................... 17
四、實驗結果 21
4.1 實驗設置 .................................................................. 21
4.1.1 參數設置 ......................................................... 21
4.1.2 比較模型 ......................................................... 22
4.1.3 評估指標 ......................................................... 23
4.2 實驗結果與討論 ......................................................... 24
4.2.1 「偵測高活躍異常帳號」與「偵測低活躍異常帳號」
是否難度相同? ........................................................... 25
4.2.2 GNN-Method 是否有成功改善「偵測低活躍異常帳
號」的任務 ............................................................... 33
4.2.3 在 Baseline 加入 Social Network 的特徵，是否也能
夠改善「偵測低活躍異常帳號」的任務? .......................... 34
4.2.4 GNN-Method 模型加入 Social Network 的特徵，是
否在「偵測低活躍異常帳號」的任務中表現更為出色? ........ 38
4.2.5 模型認為前 K 個最有可能為異常的帳號，用 F1-
Score, Recall 與 Precision 評估效能。 ............................. 43
4.2.6 為什麼 F1-Score 並不會隨著活躍值上升?................ 55
五、總結與未來展望 56
ix
目錄目錄
參考文獻 57
附錄 59

參考文獻

參考文獻
[1] Nhut-Lam Nguyen, Ming-Hung Wang*, Yu-Chen Dai, and Chyi-Ren, “Understanding Malicious Accounts in Online Political Discussions: A Multilayer Network Approach,” MDPI Sensor, 2021.
[2] 蔡榮峰, 陳詠君, “激化情緒對立的口水戰-PTT 八卦板疫情輿論分析,” 國家安全
研究所, 資訊工業策進會, 2021.
[3] Hongyu Gao, Jun Hu, Christo Wilson, Zhichun Li, Yan Chen, “Detecting and Characterizing Social Spam Campaigns,” IMC ’10: Internet Measurement Conference,
2010.
[4] Nhut-Lam Nguyen, Ming-Hung Wang, Chyi-Ren Dow, “Learning to Recognize
Sockpuppets in Online Political Discussions,” IEEE Systems Journal, 2021.
[5] X. Hu, J. Tang, and H. Liu, “Online social spammer detection,” in In AAAI, 2014.
[6] Yuqing Lu, Lei Zhang, Yudong Xiao, Yangguang Li, “Simultaneously detecting fake
reviews and review spammers using factor graph model,” WebSci ’13: Proceedings
of the 5th Annual ACM Web Science Conference, 2013.
[7] Yongji Wu, Defu Lian, Yiheng Xu, Le Wu, Enhong Chen, “Graph Convolutional
Networks with Markov Random Field Reasoning for Social Spammer Detection,”
AAAI Technical Track: Applications, 2020.
[8] Leyan Deng, Chenwang Wu, Defu Lian, Yongji Wu, Enhong Chen, “Markov-Driven
Graph Convolutional Networks for Social Spammer Detection,” IEEE Transactions
on Knowledge and Data Engineering, 2022.
[9] Yu Liu, Bin Wu, Bai Wang, Guanchen Li, “SDHM: A Hybrid Model for Spammer
Detection in Weibo,” 2014 IEEE/ACM International Conference on Advances in
Social Networks Analysis and Mining (ASONAM 2014), 2014.
[10] Hao Fu, Xing Xie, Yong Rui, “Leveraging Careful Microblog Users for Spammer Detection,” WWW ’15 Companion: Proceedings of the 24th International Conference
on World Wide Web, 2015.
[11] L. A. Junting Ye, “Discovering opinion spammer groupsby network footprints,”
ECML Lecture Notes in Computer Science, 2015.
參考文獻
[12] 蔡秉承, “網軍判斷系統—以批踢踢電子布告欄系統為例,” 國立中山大學應用數學
系研究所, 2020.
[13] F. Benevenuto, G. Magno, T. Rodrigues, and V. Almeida, “Detecting spammers on
twitter,” in In Collaboration, Electronic messaging, Anti-Abuse and Spam Conference (CEAS, 2010.
[14] Enhua Tan, Lei Guo, Songqing Chen, Xiaodong Zhang, Yihong Zhao, “Spammer
Behavior Analysis and Detection in User Generated Content on Social Networks,”
2012 IEEE 32nd International Conference on Distributed Computing Systems, 2012.
[15] Xia Hu, Jiliang Tang, Yanchao Zhang, Huan Liu, “Social spammer detection in
microblogging,” IJCAI ’13: Proceedings of the Twenty-Third international joint
conference on Artificial Intelligence, 2013.
[16] Guan Wang, Sihong Xie, Bing Liu, Philip S. Yu, “Review Graph Based Online
Store Review Spammer Detection,” IEEE International Conference on Data Mining, 2011.
[17] Ming-Hung Wang, Nhut-Lam Nguyen, Shih-chan Dai, Po-Wen Chi, and Chyi-Ren,
“Understanding Potential Cyber-Armies in Elections: A Study of Taiwan,” MDPI
Sustainability, 2020.
[18] Thomas N. Kipf, Max Welling, “Semi-Supervised Classification with Graph Convolutional Networks,” ICLR 2017 conference submission, 2017.
[19] Jian Du, Shanghang Zhang, Guanhang Wu, José M. F. Moura & Soummya Kar,
“Topology Adaptive Graph Convolutional Networks,” ICLR 2018 Conference Blind
Submission, 2018.
[20] Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro
Liò, Yoshua Bengio, “Graph Attention Networks,” ICLR 2018 Conference Blind
Submission, 2018.

指導教授

陳弘軒

審核日期

2022-7-19

推文