摘要(英) |
This study combines Term Frequency-Inverse Document Frequency technique with compatibility and applies it to the “Regulations of National Central University and Extensions of Off-campus Regulations” and establishes them on the cloud platform for tax classification.
Term Frequency-Inverse Document Frequency technique can only present one type of measurement and quantitative method and is not capable of presenting diverse selection. Therefore, through the combination of compatibility, Cosine Similarity, Hierarchical Clustering and other techniques, a regulation can produce different results in different compatibility. A wide range of selection can be produced through classification, helping users to find the proper regulations which is related.
keyword:text mining、TF-IDF、Cosine Similarity、Hierarchical Clustering
This study combines Term Frequency-Inverse Document Frequency technique with compatibility and applies it to the “Regulations of National Central University and Extensions of Off-campus Regulations” and establishes them on the cloud platform for tax classification.
Term Frequency-Inverse Document Frequency technique can only present one type of measurement and quantitative method and is not capable of presenting diverse selection. Therefore, through the combination of compatibility, Cosine Similarity, Hierarchical Clustering and other techniques, a regulation can produce different results in different compatibility. A wide range of selection can be produced through classification, helping users to find the proper regulations which is related.
keyword:text mining、TF-IDF、Cosine Similarity、Hierarchical Clustering
This study combines Term Frequency-Inverse Document Frequency technique with compatibility and applies it to the “Regulations of National Central University and Extensions of Off-campus Regulations” and establishes them on the cloud platform for tax classification.
Term Frequency-Inverse Document Frequency technique can only present one type of measurement and quantitative method and is not capable of presenting diverse selection. Therefore, through the combination of compatibility, Cosine Similarity, Hierarchical Clustering and other techniques, a regulation can produce different results in different compatibility. A wide range of selection can be produced through classification, helping users to find the proper regulations which is related. |
參考文獻 |
[1] P.‐N. Tan, M. Steinbach, V. Kumar, Introduction to Data Mining, Addison‐Wesley, Pearson International Edition, 2018.
[2] A. Ochiai. Zoogeographical studies on the solenoid fish found in japan and its neighboring regions. Bull, Japan Soc. Sci. Fisheries 22, 526–530, 1957.
[3] J. J. Barkman, Phytosociology and ecology of cryptogamic epiphytes, 1958.
[4] Chowdhury, G. G. Introduction to modern information retrieval, Facet publishing, 2010.
[5] G. Salton, E. A. Fox, H. Wu, Extended Boolean information retrieval. Cornell University, 1022–1036, 1982.
[6] G. Salton, C. Buckley, Term-weighting approaches in automatic text retrieval, Information processing & management, 24(5), 513-523, 1988.
[7] V. Zappala, A. Cellino, P. Farinella, Z. Knezevic, Asteroid families. I-Identification by hierarchical clustering and reliability assessment, The Astronomical Journal, 100, 2030-2046, December 1990.
[8] W. J. Frawley, G. Piatetsky-Shapiro, C. J. Matheus, Knowledge discovery in databases: An overview, AI magazine, 13(3), 57-57, 1992.
[9] M. Bramer, Principles of data mining (Vol. 180), London: Springer, 2007.
[10] I. H. Witten, E. Frank, M. A. Hall, C. J. Pal, Data Mining: Practical machine learning tools and techniques, Morgan Kaufmann, 2016.
[11] K. A. Taipale, Data mining and domestic security: Connecting the dots to make sense of data, Columbia Science and Technology Law Review, 5(2), 2003.
[12] C. Pitts, The End of Illegal Domestic Spying? Don′t Count on It. Washington Spectator, 2007.
[13] F. Schwed, J. Zweig, Where are the Customers′ Yachts? Or A Good Hard Look at Wall Street (p. 212). New York: Simon and Schuster, 1940.
[14] T. Menzies, Y. Hu, Data mining for very busy people. Computer, 36(11), 22-29, 2003.
[15] R. R. Bouckaert, E. Frank, M. A. Hall, G. Holmes, B. Pfahringer, P. Reutemann, I. H. Witten, WEKA−Experiences with a Java Open-Source Project. Journal of Machine Learning Research, 11(Sep), 2533-2541, 2010.
[16] J. Forcier, P. Bissex, W. J. Chun, Python web development with Django. Addison-Wesley Professional, 2008. |