首页 | 本学科首页   官方微博 | 高级检索  
     

基于自适应哈希链的分布式频繁模式挖掘算法
引用本文:叶飞跃. 基于自适应哈希链的分布式频繁模式挖掘算法[J]. 系统工程与电子技术, 2005, 27(3): 560-564
作者姓名:叶飞跃
作者单位:南京航空航天大学信息科学与技术学院,江苏,南京,210016;江苏技术师范学院计算机科学与技术系,江苏,常州,213001
基金项目:江苏省高校自然科学研究计划基金资助课题(04KJB46003)
摘    要:针对分布式系统,提出了自适应哈希链结构的频繁模式挖掘算法。该算法首先在每个站点产生局部频繁1-项集,再产生全局频繁1-项集,根据全局频繁1-项集产生各站点的投影数据库,在各个站点分别扫描投影数据库中的交易,并根据站点可用内存情况形成相应大小的哈希链结构。通过挖掘各站点的哈希链结构得到全局频繁项集。给出了基本步骤和挖掘算法。研究表明该算法不但效率高,而且适应性强。

关 键 词:数据挖掘  频繁模式  分布式  自适应  哈希链
文章编号:1001-506X(2005)03-560-05
修稿时间:2003-12-16

Distributed algorithm for mining frequent pattern based on adaptive hash chain structure
YE Fei-yue. Distributed algorithm for mining frequent pattern based on adaptive hash chain structure[J]. System Engineering and Electronics, 2005, 27(3): 560-564
Authors:YE Fei-yue
Affiliation:YE Fei-yue~
Abstract:An algorithm for mining frequent pattern is put forward based on adaptive hash chain structures for a distributed system. In this algorithm, first the frequent 1-itemsets are generated at every site, then global frequent1-itemsets are generated and the projection database of the global frequent 1-itemsets is formed at every site. After the transaction of the projection database is scanned at every site respectively, corresponding hash chain structures that are fit for the available memory are constructed at every site and mined to gain the global frequent itemsets. The basic process and the mining algorithm are presented. The study shows that the algorithm has higher efficiency and adaptability than the exiting approaches.
Keywords:data mining  frequent pattern  distributed  adaptive  hash chain
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号