A New Hybrid Algorithm for Association Rule Mining |
| |
Authors: | ZHANG Min-cong YAN Cun-liang ZHU Kai-yu |
| |
Affiliation: | National Die and Mold CAD Engineering Research Center, Shanghai Jiaotong University, Shanghai 200030, China |
| |
Abstract: | HA (hashing array), a new algorithm, for mining frequent itemsets of large database is proposed. It employs a structure hash array, ItemArray ( ) to store the information of database and then uses it instead of database in later iteration. By this improvement, only twice scanning of the whole database is necessary, thereby the computational cost can be reduced significantly. To overcome the performance bottleneck of frequent 2-itemsets mining, a modified algorithm of HA, DHA (direct-addressing hashing and array) is proposed, which combines HA with direct-addressing hashing technique. The new hybrid algorithm, DHA, not only overcomes the performance bottleneck but also inherits the advantages of HA. Extensive simulations are conducted in this paper to evaluate the performance of the proposed new algorithm, and the results prove the new algorithm is more efficient and reasonable. |
| |
Keywords: | association rule data mining hashing database analysis |
本文献已被 维普 万方数据 等数据库收录! |
|