首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于图的四叉链表存储结构的最大频繁项集挖掘算法
引用本文:王春华,宁慧,邹韵,郭江鸿.基于图的四叉链表存储结构的最大频繁项集挖掘算法[J].应用科技,2013,40(1):76-79.
作者姓名:王春华  宁慧  邹韵  郭江鸿
作者单位:1. 哈尔滨工程大学计算机科学与技术学院,黑龙江哈尔滨,150001
2. 哈尔滨工程大学经济管理学院,黑龙江哈尔滨,150001
基金项目:国家自然科学基金资助项目(60975071);黑龙江省教育厅科学技术研究资助项目(12513055)
摘    要:虽然已有的最大频繁项集挖掘算法在结构和技术上已经做了很多改进,但还是存在挖掘速度慢、效率低的缺点,在此提出了图的四叉链表存储结构和基于该存储结构的最大频繁项集挖掘算法,该结构具有一次生成多次使用,不必耗用额外的存储空间等特点,基于该存储结构的最大频繁项集挖掘算法充分利用了该存储结构的特点以及频繁扩展集的性质,有效地减少了冗余候选集的生成,降低了串的冗余存储,将串集合间的比较转化为整型数组的比较,从而使得它比已有的最大频繁项集挖掘算法在挖掘效率上有了明显的提高,最后通过实验证明了该算法较其他已有算法效率有了较大的提高.

关 键 词:四叉链表  频繁项集  存储结构  挖掘算法

The maximum frequent item set mining algorithm based on the four-fork linked storage structure
WANG Chunhua , NING Hui , ZHOU Yun , GUO Jianghong.The maximum frequent item set mining algorithm based on the four-fork linked storage structure[J].Applied Science and Technology,2013,40(1):76-79.
Authors:WANG Chunhua  NING Hui  ZHOU Yun  GUO Jianghong
Institution:1.College of Computer Science and Technology,Harbin Engineering University,Harbin 150001,China 2.College of Economics and Management,Harbin Engineering University,Harbin 150001,China
Abstract:Although a variety of improvements have been done on the existing maximum frequent item mining algorithms in terms of structures and technologies, they still suffer from low efficiency. Given these shortcomings of the existing algorithms, we propose the quad-pointer linked list structure for graph and the maximum frequent item mining algorithm based on this structure. This structure possesses once-created-multiple-used property, without the need for extra storage space. This structure property and the characteristics of the frequent extension set are utilized fully by our algorithm, which effectively reduce the redundancy for the candidate generation and storage. Besides, we convert the comparison between strings into the comparison between integer arrays, which improves the efficiency greatly for the maximum frequent item mining algorithm. Through the experiments, the efficiency of our algorithm is proved to outperform the other existing algorithms.
Keywords:four-fork link  frequent item set  storage structure  mining algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号