參考文獻 |
[1] C.-H. Chang, M. Kayed, M. R. Girgis, and K. F. Shaalan, "A Survey of Web
Information Extraction Systems," IEEE Transactions on Knowledge and Data
Engineering, vol. 18, no. 10, pp. 1411-1428, 2006.
[2] C.-H. Chang, S. Yang, C.-M. Liou, and M. Kayed, "Gadget creation for
personal information integration on web portals," IEEE International
Conference on Information Reuse and Integration, 2008.
[3] M. Dontcheva, S. M. Drucker, G. Wade, D. Salesin, and M. F. Cohen,
"Summarizing personal web browsing sessions," Proceedings of the 19th
annual ACM symposium on User interface software and technology, 2006, pp.
115-124.
[4] O. Etzioni, M. Cafarella, D. Downey, A.-M. Popescu, T. Shaked, S. Soderland,
D. S. Weld, and A. Yates, "Methods for domain-independent information
extraction from the web: an experimental comparison," Proceedings of the
19th national conference on Artifical intelligence, 2004, pp. 391-398.
[5] C.-N. Hsu and C.-C. Chang, "Finite-state transducers for semi-structured text
mining," Proceedings of IJCAI-99 Workshop on Text Mining: Foundations,
Techniques and Applications, 1999, pp. 38-49.
[6] M. Kayed and C.-H. Chang, "Page-Level Web Data Extraction from Template
Pages," IEEE Transactions on Knowledge and Data Engineering, vol. 22, no.
2, pp. 249-263, 2010.
[7] N. Kushmerick, "Wrapper Induction for Information Extraction." Ph.D.
University of Washington, Seattle, WA, 1997.
[8] N. Kushmerick, "Regression testing for wrapper maintenance," Proceedings of
the sixteenth national conference on Artificial intelligence and the eleventh
Innovative applications of artificial intelligence conference innovative
applications of artificial intelligence, 1999, pp. 74-79.
[9] N. Kushmerick, "Wrapper Verification," World Wide Web Journal, vol. 3, no. 2,
pp. 79-94, 2000.
[10] N. Kushmerick, "Wrapper Induction: Efficiency and Expressiveness,"
Artificial Intelligence, vol. 118, no. 1-2, pp. 15-68, 2000.
[11] K. Lerman, S. N. Minton, and C. A. Knoblock, "Wrapper Maintenance: A
Machine Learning Approach," Journal of Artificial Intelligence Research, vol.
18, no. 1, pp. 149-181, 2003.
[12] J.-H. Li, "Differentiating Templates and Data Values from Semi-Structured
Web Pages." Master’s Computer Science and Information Engineering at
National Center University, 2005.
[13] L. Liu, C. Pu, and W. Han, "XWRAP: An XML-Enabled Wrapper
Construction System for Web Information Sources," Proceedings of the 16th
International Conference on Data Engineering, 2000, pp. 611-621.
[14] X. Meng, D. Hu, and C. Li, "Schema-Guided Wrapper Maintenance for
Web-Data Extraction," Proceedings of the 5th ACM international workshop on
Web information and data management, 2003, pp. 1-8.
[15] X. Meng, H. Lu, M. Gu, and H. Wang, "SG-WRAP: A Schema-Guided
Wrapper Generator," Proceedings of the 18th International Conference on
Data Engineering, 2002, p. 331.
[16] I. Muslea, S. Minton, and C. A. Knoblock, "Hierarchical Wrapper Induction
for Semistructured Information Sources," Autonomous Agents and Multi-Agent
Systems, vol. 4, no. 1-2, pp. 93-114, 2001.
[17] E.-H. Pek, X. Li, and Y. Liu, "Web Wrapper Validation," In Proceedings of
APWeb, 2003.
[18] Y.-L. Lin, "Page-level Wrapper Verification based on Structure, Semantic
and Schema." Master’s Computer Science and Information Engineering
at National Center University, 2008.
[19] C. A. Knoblock Projects,
http://www.isi.edu/integration/people/knoblock/projects/prj_wrapper_maintain.html
[20] Document Object Model (DOM), http://www.w3.org/DOM/
[21] XML Schema Definition Language (XSD), http://www.w3.org/TR/xmlschema11-1/ |