應用強化式學習探勘活動來源網站

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：45

、訪客IP：3.144.43.47

姓名

廖于晴(Yu-Ching Liao) 查詢紙本館藏

畢業系所

資訊工程學系

論文名稱

應用強化式學習探勘活動來源網站
(Event Source Page Discovery via Reinforcement Learning)

相關論文

★ 行程邀約郵件的辨識與不規則時間擷取之研究	★ NCUFree校園無線網路平台設計及應用服務開發
★ 網際網路半結構性資料擷取系統之設計與實作	★ 非簡單瀏覽路徑之探勘與應用
★ 遞增資料關聯式規則探勘之改進	★ 應用卡方獨立性檢定於關連式分類問題
★ 中文資料擷取系統之設計與研究	★ 非數值型資料視覺化與兼具主客觀的分群
★ 關聯性字組在文件摘要上的探討	★ 淨化網頁：網頁區塊化以及資料區域擷取
★ 問題答覆系統使用語句分類排序方式之設計與研究	★ 時序資料庫中緊密頻繁連續事件型樣之有效探勘
★ 星狀座標之軸排列於群聚視覺化之應用	★ 由瀏覽歷程自動產生網頁抓取程式之研究
★ 動態網頁之樣版與資料分析研究	★ 同性質網頁資料整合之自動化研究

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

[檢視]

[下載]

本電子論文使用權限為同意立即開放。
已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。
請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。

摘要(中)

隨著交通方便性的提升，旅行已成為現代人們的常態，旅遊型態也漸漸地產生了改變，不再是單純的觀光而是希望可以更深入的體會當地的風土民情，其中參與當地的特色活動便是一種可以體會當地特色的方式。然而，在網站中搜尋當地的活動對不熟悉當地的人卻是一個負擔，因為無論是政府組織亦或是民間組織，他們通常都將活動的資訊更新在他們自己的網站中，散落在WWW中，因此在這邊我們希望可以使用一個智慧的爬蟲系統，可以自動且有效率的探勘並收集「活動來源網頁」。此文主要是想要介紹我們要如何訓練一個智慧爬蟲模型，讓他可以從起始頁面網站去探勘此網站的活動來源網頁，因為我們認為每個網站的活動來源頁面的個數都是不同的，因此智慧爬蟲於每個網站中走的步伐數皆為變動的，在這邊我們也會提及如何設定閥值讓我們的模型知道是否該停止探索此網站，在此模型中我們採用了強化式學習(Reinforcement learning)並結合了多任務學習(Multitask Learning)來訓練，也因為我們只有有限的標記資料，因此我們採用兩階段的訓練架構進行訓練，第一階段會運用少量的標記資料先進行預訓練而後再透過未標記資料與我們的「活動來源網頁分類器」進行微調模型，最後藉由我們所提出的方法，我們的爬蟲模型於真實世界的資料上達到了74\%的準確度。

摘要(英)

With the convenience of transportation, traveling is no longer about sightseeing or taking a professional photo but more about joining local event to experience local culture. Most event organizers such as governments, enterprises and organizations will update event information somewhere on their website. How to efficiently find the page where event announcements are listed for any given website is called the problem of event source discovery. In this paper, we show a deep reinforcement learning model to train our event source discovery agent. We use two stages to train our crawler, pre-training and fine-tuning. In the pre-training phase, the model is trained with limited labeled data, where each episode has a fixed time step. In the fine-tuning phase, the agent is trained using unlabeled data and a reward system based on an event source page classifier. The agent learns whether to continue exploring or stop exploring through an adaptive threshold, so the number of steps in each episode changes during the fine training. The proposed agent achieves extcolor{red}{74\%} Return-On-Investment (i.e. precision) with extcolor{red}{1.3} unit cost (the number of clicks for each event source page) on the real word data set.

關鍵字(中)

★ 強化式學習
★ 網路探勘
★ 活動來源頁面探勘
★ 活動來源頁面分類

關鍵字(英)

★ Reinforcement Learning
★ Web Mining
★ Event Source Page Discovery
★ Event Source Page Classification

論文目次

中文摘要…i
英文摘要…iii
目錄…v
圖目錄…vii
表目錄…ix
一、緒論…1
1.1問題描述…1
1.2動機…1
1.3研究目標…2
二、相關研究…5
三、活動來源網頁探勘…11
3.1任務簡述…11
3.2動作集特徵分析…11
3.3方法與模型…12
3.3.1訓練方法與演算法…12
3.3.2模型介紹…15
四、活動來源網頁分類器…23
4.1任務簡述…23
4.2特徵分析…23
4.3模型介紹…24
4.4實驗與效能分析…24
4.4.1資料分析…24
4.4.2評估方式…25
4.4.3基準模型…25
4.4.4實驗分析…25
五、結論與未來展望…29

參考文獻

[1] A. Amalia, D. Gunawan, A. Najwan, and F. Meirina. Focused crawler for the acquisition of health articles. In 2016 International Conference on Data and Software Engineering (ICoDSE), pages 1–6, 2016.
[2] Jonathan Baxter. A bayesian/information theoretic model of learning to learn via multiple task sampling. Machine Learning, 28:7–39, 2004.
[3] M. G. Bellemare, Y. Naddaf, J. Veness, and M. Bowling. The arcade learning environment: An evaluation platform for general agents. Journal of Artificial Intelligence Research, 47:253–279, Jun 2013.
[4] R. Caruana. Multitask learning: A knowledge-based source of inductive bias. In ICML, 1993.
[5] Soumen Chakrabarti, Martin van den Berg, and Byron Dom. Focused crawling: a new approach to topic-specific web resource discovery. Computer Networks, 31(11):1623–1640, 1999.
[6] Ronan Collobert and Jason Weston. A unified architecture for natural language processing: Deep neural networks with multitask learning. In Proceedings of the 25th International Conference on Machine Learning, ICML ’08, pages 160–167, New York, NY, USA, 2008. ACM.
[7] Saraswathi Devaraj and A. Krishnakumar. Effective search engine spam classi- fication. 8:1541–1545, 08 2019.
[8] Gary William Flake, Steve Lawrence, and C. Lee Giles. Efficient identification of web communities. In Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’00, page 150–160, New York, NY, USA, 2000. Association for Computing Machinery.
[9] Miyoung Han, Pierre-Henri Wuillemin, and P. Senellart. Focused crawling through reinforcement learning. In ICWE, 2018.
[10] H. V. Hasselt, A. Guez, and D. Silver. Deep reinforcement learning with double q-learning. In AAAI, 2016.
[11] N. Heess, TB Dhruva, S. Sriram, Jay Lemmon, J. Merel, Greg Wayne, Y. Tassa, T. Erez, Ziyu Wang, S. Eslami, Martin A. Riedmiller, and D. Silver. Emergence of locomotion behaviours in rich environments. ArXiv, abs/1707.02286, 2017.
[12] Bing Liu, Robert Grossman, and Yanhong Zhai. Mining data records in web pages. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’03, page 601–606, New York, NY, USA, 2003. Association for Computing Machinery.
[13] Mingsheng Long, Zhangjie Cao, Jianmin Wang, and Philip S. Yu. Learning multiple tasks with multilinear relationship networks. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, page 1593–1602, Red Hook, NY, USA, 2017. Curran Associates Inc.
[14] Houqing Lu, Donghui Zhan, Lei Zhou, and Dengchao He. An improved focused crawler: Using web page classification and link priority evaluation. Mathemat- ical Problems in Engineering, 2016:1–10, 01 2016.
[15] V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, Ioannis Antonoglou, Daan Wierstra, and Martin A. Riedmiller. Playing atari with deep reinforcement learning. ArXiv, abs/1312.5602, 2013.
[16] V. Mnih, K. Kavukcuoglu, D. Silver, Andrei A. Rusu, J. Veness, Marc G. Belle- mare, A. Graves, Martin A. Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, S. Petersen, C. Beattie, A. Sadik, Ioannis Antonoglou, Helen King, D. Kumaran, Daan Wierstra, S. Legg, and Demis Hassabis. Human-level control through deep reinforcement learning. Nature, 518:529–533, 2015.
[17] Shakir Mohamed and Danilo J. Rezende. Variational information maximisation for intrinsically motivated reinforcement learning. In Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 2, NIPS’15, page 2125–2133, Cambridge, MA, USA, 2015. MIT Press.
[18] Ioannis Partalas, G. Paliouras, and I. Vlahavas. Reinforcement learning with classifier selection for focused crawling. In ECAI, 2008.
[19] Xiaoguang Qi and Brian D. Davison. Web page classification: Features and algorithms. ACM Comput. Surv., 41(2), February 2009.
[20] John Schulman, F. Wolski, Prafulla Dhariwal, A. Radford, and Oleg Klimov. Proximal policy optimization algorithms. ArXiv, abs/1707.06347, 2017.
[21] D. Silver, G. Lever, N. Heess, T. Degris, Daan Wierstra, and Martin A. Ried- miller. Deterministic policy gradient algorithms. In ICML, 2014.
[22] Aixin Sun, Ee-Peng Lim, and Wee-Keong Ng. Web classification using sup- port vector machine. In Proceedings of the 4th International Workshop on Web Information and Data Management, WIDM ’02, page 96–99, New York, NY, USA, 2002. Association for Computing Machinery.
[23] Aixin Sun, Ee-Peng Lim, and Wee-Keong Ng. Web classification using sup- port vector machine. In Proceedings of the 4th International Workshop on Web Information and Data Management, WIDM ’02, page 96–99, New York, NY, USA, 2002. Association for Computing Machinery.
[24] R. Sutton and A. Barto. Reinforcement learning: An introduction. IEEE Transactions on Neural Networks, 16:285–286, 2005.
[25] Richard S. Sutton, David McAllester, Satinder Singh, and Yishay Mansour. Pol- icy gradient methods for reinforcement learning with function approximation. In Proceedings of the 12th International Conference on Neural Information Pro- cessing Systems, NIPS’99, page 1057–1063, Cambridge, MA, USA, 1999. MIT Press.
[26] Qifan Wang, Bhargav Kanagal, Vijay Garg, and D. Sivakumar. Constructing a comprehensive events database from the web. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management, CIKM ’19, page 229–238, New York, NY, USA, 2019. Association for Computing Ma- chinery.
[27] Chris Watkins. Learning from delayed rewards. 1989.
[28] W. Yan and L. Pan. Designing focused crawler based on improved genetic algorithm. In 2018 Tenth International Conference on Advanced Computational Intelligence (ICACI), pages 319–323, March 2018.
[29] B. W. Yohanes, H. Handoko, and H. K. Wardana. Focused crawler optimiza- tion using genetic algorithm. TELKOMNIKA Telecommunication Computing Electronics and Control, 9:403–410, 2011.
[30] Banu Yohanes, Peter Handoko, and Hartanto Wardana. Focused crawler opti- mization using genetic algorithm. Telkomnika, 9, 12 2011.
[31] Jia Zhu, Qing Xie, Shoou-I Yu, and Wai Hung Wong. Exploiting link structure for web page genre identification. Data Min. Knowl. Discov., 30(3):550–575, May 2016.

指導教授

張嘉惠(Chia-Hui Chang)

審核日期

2021-8-4

推文