首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于FP-tree最大频繁模式超集挖掘算法
引用本文:王君,任永功.基于FP-tree最大频繁模式超集挖掘算法[J].郑州大学学报(自然科学版),2011(1):33-36,41.
作者姓名:王君  任永功
作者单位:辽宁师范大学计算机与信息技术学院,辽宁大连116000
基金项目:辽宁省科技计划项目 编号2008216014; 辽宁省教育厅高等学校科研基金资助项目 编号L2010229; 大连市优秀青年科技人才基金资助项目 编号2008J23JH026
摘    要:数据挖掘应用中的最大频繁项集挖掘算法大多存在候选项目集冗余问题,造成时间和空间的浪费.针对此问题,通过构造条件FP-tree,对不符合要求的项目进行剪除并对MFIT算法进行改进,提出一种基于FP-tree的最大频繁模式超集挖掘算法.此算法无需产生大量的候选集,同时减少数据集扫描次数,降低数据库遍历时间,提高算法效率.实验证明,此算法在降低候选项目集冗余度的同时有效减少了算法运行时间.

关 键 词:数据挖掘  最大频繁项目集  条件频繁模式树  超集检测

Maximal Frequent Pattern Superset Mining Algorithm Based on FP-tree
WANG Jun,REN Yong-gong.Maximal Frequent Pattern Superset Mining Algorithm Based on FP-tree[J].Journal of Zhengzhou University (Natural Science),2011(1):33-36,41.
Authors:WANG Jun  REN Yong-gong
Institution:(School of Computer and Information Technology,Liaoning Normal University,Dalian 116000,China)
Abstract:The main problem existing in maximal frequent itemsets mining algorithms of data mining applications was candidate set redundancy,waste of time and space.The constructed conditioning FP-tree would cut off the items which did not meet the requirements and improve MFIT algorithm.The conditioning FP-tree was proposed as the largest frequent pattern superset mining algorithm based on FP-tree.This algorithm did not produce numerous candidate sets,at the mean time,reduced the frequency of scan data set and the database traversal,improving efficiency of the algorithm.Experiments results showed that the algorithm reduced redundant candidate itemsets and effectively decreased the algorithm running time.
Keywords:data mining  maximal frequent itemsets  conditional FP-tree  superset checking
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号