An efficient algorithm for mining closed itemsets |
| |
Authors: | Liu Jun-qiang Pan Yun-he |
| |
Affiliation: | Institute of Artificial Intelligence, Zhejiang University; Hangzhou University of Commerce, Hangzhou 310035, China. liujunq@mail.hz.zj.cn |
| |
Abstract: | This paper presents a new efficient algorithm for mining frequent closed itemsets. It enumerates the closed set of frequent itemsets by using a novel compound frequent itemset tree that facilitates fast growth and efficient pruning of search space. It also employs a hybrid approach that adapts search strategies, representations of projected transaction subsets, and projecting methods to the characteristics of the dataset. Efficient local pruning, global subsumption checking, and fast hashing methods are detailed in this paper. The principle that balances the overhead of search space growth and pruning is also discussed. Extensive experimental evaluations on real world and artificial datasets showed that our algorithm outperforms CHARM by a factor of five and is one to three orders of magnitude more efficient than CLOSET and MAFIA. |
| |
Keywords: | |
本文献已被 维普 PubMed 等数据库收录! |
|