摘要(英) |
Cross-level association rules mining with multiple minimum supports is an important generalization of the association rule mining problem. Instead of setting a single minimum support for all items, Liu et al. proposed a method, named MSApriori, to allow users for specifying multiple minimum supports to reflect the natures of the items. Because not all items are sold in a whole year, we should consider the transactions in the sold periods of items as we calculate the supports of items. Previous techniques for mining cross-level association rules with multiple minimum supports are most top-down, progressive depending method extended from Apriori algorithm, e.g. MMS_Cumulate. Previous approaches result in worse mining efficiency and incompleteness of mined rules.
In this research, we propose a bottom-up, simultaneously merging method based on CL_FP-tree, called CL_FP-tree (MIS), to improve the mining efficiency and completeness of mining cross-level association rule with multiple minimum supports. We extend the procedure, which proposed by Alex H.W. Lin (2003) for supports counting, not only to count the supports for all items, but also to judge the sold periods of all items as the basis of support counting. CL_FP-tree (MIS) aims to reduce the number of database rescans for finding the cross-level information.
We implement the CL_FP_tree (MIS) with real data and find the results that the efficiency of CL_FP_tree (MIS) is better than CL_Apriori(MIS). And the number of cross-level association rules found by CL_FP-tree (MIS) algorithm is more than CL_Apripri(MIS). Furthermore, we solve the problem that the number of 1-frequent items decreasing as long as the number of transactions increasing. |
參考文獻 |
Agrawal, R., Imielinski, T., & Swami, A. (1993). Mining Association Rules Between Sets of Items in Large Databases. Proc. 1993 ACM SIGMOD Int’l Conf. on Management of Data, Washington, D.C. (pp. 207 – 216).
Agrawal, R., H. Mannila, R. Shikant, H. Toivonen, A. I. Verkamo. (1996). Fast Discovery of Association Rules. Advances in Knowledge Discovery and Data Mining. (pp.307 – 328)
Agrawal, R., & Srikant, R. (1996). Mining Sequential Patterns: Generalizations and Performance Improvements. Research Report. IBM Research Division.
Agrawal, R. & Srikant, R. (1995). Mining sequential patterns. International Conference on Database Engineering, IEEE, 1995. (pp. 3–14).
Ale, J. & Rossi, G. (2000). An approach to discovering temporal association rules. Proc. of the 2000 ACM Symposium on Applied Computing, (pp. 294–300).
A. U. Tansel & N. F. Ayan. (1998). Discovery of Association Rules in Temporal Databases. American Association for Artificial Intelligence, NY. USA.
Han, J., & Fu, Y. (1995). Discovery of Multiple-Level Association Rules from Large Databases. Proc. of the 21st VLDB Conf., Zurich, Switzerland.
Han, J., & Kamber, M. (2001). Data Mining:Concept and Techniques. Morgan Kaufmann Publishers. San Francisco, CA.
Han, J., Pei, J., & Yin, Y. (2000). Mining Frequent Patterns without Candidate Generation. Proc. of the ACM Int. Conf. on Management of Data, Dallas, TX.
Hu, Y. H. & Chen, Y. L. (2003). Discovering and Maintaining Association Rules with Multiple Minimum Supports. Department of Information Management, National Central University Chung-Li 320, Taiwan , R.O.C
Liu, B., Hsu, W. & Ma,Y. (1999). Mining Association Rules with Multiple Minimum supports. Proc. 1999 ACM SIGMOD Int’l Conf. on Management of Data, San Diego, CA, USA.
Lu, H., Feng, L., & Han, J. (2000). Beyond Intratransaction Association Analysis:Mining Multidimensional Intertransaction Association Rules. ACM Press, NY, USA. 18(4).
Mannila, H., Toivonen, H., & Verkamo, A. I. (1995). Discovering frequent episodes in sequences (extended abstract). Proc. 1st Conference on Knowledge Discovery and Data Mining, AAAI Press, Menlo Park, (pp. 210-215).
Park, J. S., Chen, M., & Yu, P. S. (1999). An Effective Hash-Based Algorithm for Mining Association Rules. Database Systems for Advanced Applications. (pp. 179-186).
Tseng, M. C. & Lin, W. Y. (2001). Mining Generalized Association Rules with
Multiple Minimum Supports. I-Shou University, Kaohsiung 84008, Taiwan.
Wang. K., He, Y. & Han, J. (2003). Pushing Support Constrains Into Association Rules Mining. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 15, NO. 3, MAY/JUNE 2003. (pp.642-658)
Y. Li, P. Ning, X. S. Wang & S. Jajodia. (2001). Discovering Calendar-based Temporal Association Rules. Center for Secure Information Systems, George Mason University, Fairfax, VA 22030, USA. |