參考文獻 |
[1] G. O. Arocena and A. O. Mendelzon, “WebOQL: Restructuring Documents, Databases, and Webs” Int’l Conf. Data Eng (ICDE), pp.24-33, 1998.
[2] Saeid Asadi, Guowei Yang, Xiaofang Zhou, Yuan Shi, Boxuan Zhai, Wendy Wen-Rong Jiang, “Pattern-Based Extraction of Addresses from Web Page Content” APWeb, pp.407-418, 2008.
[3] D. Buttler, L. Liu, and C. Pu, “A Fully Automated Object Extraction System for the World Wide Web” Int’l Conf. Distributed Computing Systems (ICDCS), pp.361-370, 2001.
[4] Deng Cai, Shipeng Yu, Ji-Rong Wen, and Wei-Ying Ma. “Extracting Content Structure for Web Pages Based on Visual Representation” Asia Pacific Web Conf. (APWeb), pp.406-417, 2003.
[5] Lin Can, Zhang Qian, Xiaofeng Meng, Wenyin Lin, “Postal Address Detection from Web Documents” WIRI, pp.40-45, 2005.
[6] C.-H. Chang, C.-N. Hsu, and S.-C. Lui, “Automatic Informatio Extraction from Semi-Structured Web Pages by Pattern Discovery” Decision Support Systems, pp.129-147, 2003.
[7] Chia-Hui Chang and Chia-Yi Huang. “On Chinese Postal Address and Associated Information Extraction” Japanese Society for Artificial Intelligence (JSAI), 2012.
[8] Chia-Hui Chang and Shu-Ying Li. “MapMarker: Extraction of Postal Addresses and Associated Information for General Web Pages” IEEE/WIC/ACM Web Intelligence, pp.105-111, 2010.
[9] V. Crescenzi and G. Mecca, “Grammars Have Exceptions” Information Systems, pp.539-565, 1998.
[10] Thomas G. Dietterich, “Machine Learning for Sequential Data” SSPR/SPR, pp.15-30, 2002.
[11] Dayne Freitag: Information Extraction from HTML, “Application of a General Machine Learning Approach” AAAI/IAAI, pp.517-523, 1998.
[12] J. Hammer, J. McHugh, and H. Garcia-Molina, “Semistructure Data: The TSIMMIS Experience” East-European Workshop Advances in Databases and Information Systems (ADBIS), pp.1-8, 1997.
[13] C. -N. Hsu and M. -T. Dung, “Generating Finite-State Transducer for Semi-Structured Data Extraction from the Web” Information Systems, pp.521-538, 1998.
[14] N. Kushmerick, “Wrapper Induction: Efficiency and Expressiveness” Artificial Intelligence, pp.15-68, 2000.
[15] B. Liu, R. L. Grossman, and Y. Zhai, “Mining Data Records in Web Pages” Proc. Int’l Conf. Knowledge Discovery and Data Mining (KDD), pp.601-606, 2003.
[16] L. Liu, C. Pu, and W. Han, “XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources” Int’l Conf. Data Eng. (ICDE), pp.611-621, 2000.
[17] Wei Liu, Xiaofeng Meng, Weiyi Meng. “ViDE: A Vision-Based Approach for Deep Web Data Extraction” Transactions on Knowledge and Data Engineering, IEEE, pp.447-460, 2010.
[18] I. Muslea, S. Minton, and C. A. Knoblock, “Hierarchical Wrapper Induction for Semi-Structured Information Sources” Autonomous Agents and Multi-Agent Systems, vol.4, nos.1/2, pp.93-114, 2001.
[19] P. Nagabhushan, S. A. Angadi, Basavaraj S. Anami, “A Fuzzy Symbolic Inference System for Postal Address Component Extraction and Labelling” FSKD, pp.937-946, 2006.
[20] A. Sahuguet and F. Azavant, “Building Intelligent Web Applications Using Lightweight Wrappers” Data and Knowledge Eng, pp.283-316, 2001.
[21] Zheyuan Yu. “High Accuracy Postal Address Extraction From Web Pages” Master Thesis, Dalhousie University. 2007.
[22] Y. Zhai and B. Liu, “Web Data Extraction Based on Partial Tree Alignment” Proc. Int’l World Wide Web Conf. (WWW), pp.76-85, 2005.
[23] CRF++ Yet Another CRF toolkit : http://crfpp.sourceforge.net/
[24] HTML Tidy : http://tidy.sourceforge.net/
[25] Yahoo API 斷章取義 : http://tw.developer.yahoo.com/cas/
[26] Yahoo!奇摩搜尋引擎 : http://tw.yahoo.com/
|