參考文獻 |
[1] J. P. Bigham, A. C. Cavender, R. S. Kaminsky, C. M. Prince, and T. S. Robison, "Transcendence: enabling a personal view of the deep web," Proceedings of the 13th international conference on Intelligent user interfaces, 2008, pp. 169-178.
[2] C.-H. Chang, M. Kayed, M. R. Girgis, and K. F. Shaalan, "A Survey of Web Information Extraction Systems," IEEE Transactions on Knowledge and Data Engineering, vol. 18, no. 10, pp. 1411-1428, 2006.
[3] M. Dontcheva, S. M. Drucker, D. Salesin, and M. F. Cohen, "Relations, cards, and search templates: user-guided web data integration and layout," Proceedings of the 20th annual ACM symposium on User interface software and technology, 2007, pp. 61-70.
[4] M. Dontcheva, S. M. Drucker, G. Wade, D. Salesin, and M. F. Cohen, "Summarizing personal web browsing sessions," Proceedings of the 19th annual ACM symposium on User interface software and technology, 2006, pp. 115-124.
[5] O. Etzioni, M. Cafarella, D. Downey, A.-M. Popescu, T. Shaked, S. Soderland, D. S. Weld, and A. Yates, "Methods for domain-independent information extraction from the web: an experimental comparison," Proceedings of the 19th national conference on Artifical intelligence, 2004, pp. 391-398.
[6] C.-N. Hsu and C.-C. Chang, "Finite-state transducers for semi-structured text mining," Proceedings of IJCAI-99 Workshop on Text Mining: Foundations, Techniques and Applications, 1999, pp. 38-49.
[7] M. Kayed and C.-H. Chang, "Page-Level Web Data Extraction from Template Pages," IEEE Transactions on Knowledge and Data Engineering, vol. 22, no. 2, pp. 249-263, 2010.
[8] N. Kushmerick, "Wrapper Induction for Information Extraction." Ph.D. University of Washington, Seattle, WA, 1997.
[9] N. Kushmerick, "Regression testing for wrapper maintenance," Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence, 1999, pp. 74-79.
[10] N. Kushmerick, "Wrapper Verification," World Wide Web Journal, vol. 3, no. 2, pp. 79-94, 2000.
[11] N. Kushmerick, "Wrapper Induction: Efficiency and Expressiveness," Artificial Intelligence, vol. 118, no. 1-2, pp. 15-68, 2000.
[12] K. Lerman, S. N. Minton, and C. A. Knoblock, "Wrapper Maintenance: A Machine Learning Approach," Journal of Artificial Intelligence Research, vol. 18, no. 1, pp. 149-181, 2003.
[13] J.-H. Li, "Differentiating Templates and Data Values from Semi-Structured Web Pages." Master's Computer Science and Information Engineering at National Center University, 2005.
[14] L. Liu, C. Pu, and W. Han, "XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources," Proceedings of the 16th International Conference on Data Engineering, 2010, pp. 611-621.
[15] P. C. Mahalanobis, "On the generalised distance in statistics,", 2 ed In Proceedings National Institute of Science, 1936, pp. 49-55.
[16] X. Meng, D. Hu, and C. Li, "Schema-Guided Wrapper Maintenance for Web-Data Extraction," Proceedings of the 5th ACM international workshop on Web information and data management, 2003, pp. 1-8.
[17] X. Meng, H. Lu, M. Gu, and H. Wang, "SG-WRAP: A Schema-Guided Wrapper Generator," Proceedings of the 18th International Conference on Data Engineering, 2002, p. 331.
[18] I. Muslea, S. Minton, and C. A. Knoblock, "Hierarchical Wrapper Induction for Semistructured Information Sources," Autonomous Agents and Multi-Agent Systems, vol. 4, no. 1-2, pp. 93-114, 2001.
[19] A. Pan, J. Raposo, M. Alvarez, J. Hidalgo, and A. Vina, "Semi-Automatic Wrapper Generation for Commercial Web Sources," 2002, pp. 265-283.
[20] E.-H. Pek, X. Li, and Y. Liu, "Web Wrapper Validation," In Proceedings of APWeb, 2003.
[21] J. Raposo, A. Pan, M. Alvarez, and J. Hidalgo, "Automatically maintaining wrappers for semi-structured web sources," Data & Knowledge Engineering, vol. 61, no. 2, pp. 331-358, 2007.
[22] D. E. Simmen, M. Altinel, V. Markl, S. Padmanabhan, and A. Singh, "Damia: data mashups for intranet applications," Proceedings of the 2008 ACM SIGMOD international conference on Management of data, 2008, pp. 1171-1182.
[23] C.-T. Ting, "User-centric Web Data Integration: Design and Implementation of Gadget on Demand System." Master's Computer Science and Information Engineering at National Center University, 2008.
[24] Base Class Library, http://msdn.microsoft.com/en-us/netframework/aa569603.aspx
[25] Dapper: The Data Mapper, “http://www.dapper.net/”
[26] HTML Tidy Library Project, http://tidy.sourceforge.net/
[27] Html Agility Pack, http://htmlagilitypack.codeplex.com/Wikipage
[28] XML, http://www.w3.org/XML/
[29] XML Path, http://www.w3.org/TR/xpath/
[30] XML Schema, http://www.w3.org/XML/Schema/
[31] XML Schema Elements, http://msdn.microsoft.com/en-us/library/ms256142(v=VS.90).aspx
|