การเปรียบเทียบประสิทธิภาพอัลกอริทึมสำหรับค้นหาไอเท็มเซตที่ปรากฏร่วมกันบ่อย

ทวีศักดิ์ คงตุก

PDF

Published: Mar 13, 2018

ทวีศักดิ์ คงตุก

มหาวิทยาลัยเทคโนโลยีราชมงคลสุวรรณภูมิ

Abstract

This research aimed to 1) study and compare algorithm used for searching Frequent Itemsets, which is one of the process of association rule mining; 2) explore procedure of algorithm method and techniques used for searching of frequent Itemsets; and 3) summarize which algorithm is the best fit with each type of data and which one has the best performance of executing time or with the least memory.

According to the findings of the study, they can be concluded that (1) each algorithm used for searching Frequent Itemsets has different advantages and disadvantages. Thus, each algorithm will be used to analyze different type of dataset. (2) The fastest algorithm for large dataset with high density were FP-Growth, Apriori and PrePost+. (3) The algorithm that consumes the shortest time for large dataset with low density was LCMFreq. (4) The algorithm that consumes the shortest time for small dataset with low density was LCMFreq. And (5) the algorithm that consumes the shortest time for small dataset with high density were PrePost+ and LCMFreq.

How to Cite

คงตุก ท. (2018). A Comparison of Frequent Itemsets Mining Algorithms. Journal of Technology Management Rajabhat Maha Sarakham University, 4(1), 34–42. retrieved from https://ph02.tci-thaijo.org/index.php/itm-journal/article/view/115234

Issue

Vol. 4 No. 1 (2017): January-June

Section

บทความวิจัย

References

[1] R. Agrawal, and R. Srikant. (1994) . Fast algorithms for mining association rules. In J.B. Bocca, Proceedings of the 20th International Confrerence on Very Large Data
Bases (VLDB’94), Santiago dc Chile, Morgan Kaufmann.
[2] M.J.Zaki, and K.Gouda. (2000). Generating non-redundant association rules. In 6th ACM SIGKDD Int’l Conf. Knoledge Discovery and Data Mining, August 2000.
[3] Deng Z. H., Lv S. L. (2014). Fast mining frequent itemsets using Nodesets. Expert systems with Applications, No.41(2014), 4505-4512.
[4] J.Han, J. Pei, and Y. Yin (2000). Mining Frequent Patterns Without Candidate Generation. Proc. ACM-SIGMOD Int’l Conf, Management of Data. pp. 1-12, May 2000.
[5] J. Pei, et.al. (2007). H-Mine: Fast and space-preserving frequent pattern mining in large databases. IIE transaction (2007) 39, 593-605.
[6] T. Uno, et.al. (2004). LCM ver. 2: Efficient mining algorithms for Frequent/Closed/Maximal Itemsets. National Institute of informatics 2-1-2 Hototsubashi, Chiyoda-ku,
Tokyo, Japan.
[7] Deng Z. H., and Lv S. L. (2015). PrePost+: An efficient N-lists based algorithm for mining frequent itemsets via Children-Parent Equivalence pruning. Expert systems with
Applications, No.42(2015), 5424-5432.
[8] C. Borgelt. (2005). Keeping Things Simple: Finding Frequent Item Sets by Recursive Elimination. University of Magdeburg, Germany.
[9] M.J.Zaki, and K.Gouda. (2001). Fast Vertical Mining Using Diffsets. Rensselaer Polytechnic Institute, Troy, NY, USA. Kyushu University, Fukuoka Japan.
[10] M.J. Zaki, and W.M. JR. (2014). Data Mining and Analysis Fundamental Concepts and Algorithms. Cambridge University Press. New York, NY 10013-2473, USA.

Article Sidebar

Main Article Content

Abstract

Article Details

References