參考文獻 |
1. A. Arasu and H. Garcia-Molina. Extracting Structured Data from Web Pages. In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pp. 337-348, 2003
2. R. Agrawal and R. Srikant. On integrating catalogs. In Proceedings of the 10th International Conference on World Wide Web, pp. 603-612, 2001
3. M. K. Bergman. The Deep Web: Surfacing Hidden Value. http://www.brightplanet.com/technology/deepweb.asp, July 2001
4. S. Castano and V. D. Antonellis. A schema analysis and reconciliation tool environment for heterogeneous databases. In Proceedings of the 1999 International Symposium on Database Engineering & Applications, pp. 53-62, 1999
5. C. E. H. Chua, R. H. L. Chiang, and E.-P. Lim. Instance-based attribute identification in database integration. The International Journal on Very Large Data Bases, Volume 12, Issue 3, pp. 228-243, 2003
6. S. Chakrabarti, B. E. Dom, D. Gibson, J. Kleinberg, R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. Mining the Link Structure of the World Wide Web. IEEE Computer, Volume 32, Number 8, pp. 60-67, 1999
7. K. C.-C. Chang, B. He, C. Li, M. Patel, and Z. Zhang. Structured databases on the web: Observations and implications. ACM SIGMOD Record, Volume 33, Issue 3, pp. 61-70, 2004
8. K. C.-C. Chang, B. He, C. Li, and Z. Zhang. Structured databases on the web: Observations and implications. Technical Report UIUCDCS-R-2003-2321, Department of Computer Science, UIUC, 2003
9. C.-H. Chang and S.-C. Kuo. OLERA: OnLine Extraction Rule Analysis for Semi-structured Documents. IEEE Intelligent Systems, Volume 19, Number 6, pp. 56-64, 2004
10. C.-H. Chang and S.-C. Lui. IEPAD: information extraction based on pattern discovery. In Proceedings of the 10th International Conference on World Wide Web, pp. 681-688, 2001
11. V. Crescenzi, G. Mecca, and P. Merialdo. ROADRUNNER: Towards Automatic Data Extraction from Large Web Sites. In Proceedings of 27th International Conference on Very Large Data Bases, pp. 109-118, 2001
12. A. Doan, P. Domingos, and A. Y. Halevy. Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach. In Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, pp. 509-520, 2001
13. A. Doan, J. Madhavan, R. Dhamankar, P. Domingos, A. Y. Halevy. Learning to match ontologies on the Semantic Web. The International Journal on Very Large Data Bases, Volume 12, Issue 4, pp. 303-319, 2003
14. B. He, K. C.-C. Chang, and J. Han. Discovering Complex Matchings across Web Query Interfaces: A Correlation Mining Approach. In Proceedings of the 2004 ACM SIGKDD International Conference on Knowledge Discovery and Data mining, pp. 148-157, 2004
15. C.-N. Hsu and M.-T. Dung. Generating finite-state transducers for semi-structured data. Information Systems, Volume 23, Issue 9, pp. 521-538, 1998
16. F. Hakimpour and A. Geppert. Resolving Semantic Heterogeneity in Schema Integration: an Ontology Based Approach. In Proceedings of the International Conference on Formal Ontology in Information Systems - Volume 2001, pp. 297-308, 2001
17. M. A. Hernández, R. J. Miller, and L. M. Haas. Clio: a semi-automatic tool for schema mapping. In Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, pp. 607, 2001
18. R. Ichise, H. Takeda and S. Honiden. Integrating Multiple Internet Directories by Instance-based Learning. In Proceedings of the 18th International Joint Conference on Artificial Intelligence, pp. 22-28, 2003
19. N. Kushmerick, D. S. Weld, and R. Doorenbos. Wrapper Induction for information extraction. In Proceedings of the 15th International Joint Conference on Artificial Intelligence, pp. 729-737, 1997
20. J. Madhavan, P. A. Bernstein and E. Rahm. Generic Schema Matching with Cupid. In Proceedings of the 27th International Conference on Very Large Data Bases, pp. 49-58, 2001
21. I. Muslea, S. Minton, and C. Knoblock. STALKER: learning extraction rules for semi-structured, Web-based information sources. In Proceedings of AAAI-98 Workshop on AI and Information Integration, pp. 74-81, 1998
22. S. Melnik, H. Garcia-Molona, and E. Rahm. Similarity Flooding: A Versatile Graph Matching Algorithm and its Application to Schema Matching. In Proceedings of the International Conference on Data Engineering, pp. 117-128, 2002
23. B. Magnini, L. Sera_ni, and M. Speranza. Linguistic based matching of local ontologies. In Proceedings of AAAI-02 workshop on Meaning Negotiation, 2002
24. L. Page and S. Brin. The Anatomy of a Search Engine. The 7th International WWW Conference, 1998
25. E. Rahm and P. A. Bernstein. A survey of approaches to automatically schema matching. The International Journal on Very Large Data Bases, Volume 10, Issue 4, pp. 334-350, 2001
26. S. Sarawagi, S. Chakrabarti, and S. Godbole. Cross-training: learning probabilistic mappings between topics. In Proceedings of the ninth ACM SIGKDD International Conference on Knowledge Discovery and Data mining, pp. 177-186, 2003
27. J. Wang and F. H. Lochovsky. Data Extraction and Label Assignment for Web Databases. In Proceedings of the 12th International Conference on World Wide Web, pp. 187-196, 2003
28. W. Wu, C. Yu, A. Doan, and W. Meng. An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web. In Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, pp. 95-106, 2004
29. Z. Zhang, B. He, and K. C.-C. Chang. Understanding web query interfaces: Best effort parsing with hidden syntax. In Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, 107-118, 2004
30. Z. Zhang, B. He, and K. C.-C. Chang. On-the-fly constraint mapping across web query interfaces. In Proceedings of the VLDB Workshop on Information Integration on the Web, 2004
31. D. Zhang and W. S. Lee. Web taxonomy integration through co-bootstrapping. In Proceedings of the 27th annual International Conference on Research and Development in Information Retrieval, pp. 410-417, 2004 |