首页 | 本学科首页   官方微博 | 高级检索  
     

频繁项集的简洁表示方法研究
引用本文:王自强,冯博琴. 频繁项集的简洁表示方法研究[J]. 系统工程理论与实践, 2004, 24(7): 74-81. DOI: 10.12011/1000-6788(2004)7-74
作者姓名:王自强  冯博琴
作者单位:西安交通大学计算机科学系
摘    要:提出了通过逻辑蕴涵的演绎规则来构建频繁项集的简洁表示,而不是挖掘所有的频繁项,来降低频繁项集中的冗余性,并利用演绎规则来构建所有频繁项集的简洁表示,从而减少存取数据库和对项集进行计数的次数,最终提供一个更加紧凑的频繁项集输出.最后,通过试验说明了该方法的有效性.

关 键 词:数据挖掘  频繁项集  演绎规则  简洁表示   
文章编号:1000-6788(2004)07-0074-08
修稿时间:2003-09-26

Research on Methods of Concise Representation for Frequent Itemsets
WANG Zi-qiang,FENG Bo-qin. Research on Methods of Concise Representation for Frequent Itemsets[J]. Systems Engineering —Theory & Practice, 2004, 24(7): 74-81. DOI: 10.12011/1000-6788(2004)7-74
Authors:WANG Zi-qiang  FENG Bo-qin
Affiliation:Department of Computer Science,Xi'an Jiaotong University
Abstract:In this paper, a new method for constructing concise representation of frequent itemsets based on deduction rule is proposed. Lower and upper bounds on the frequency of a candidate are calculated with deduction rules, based on the frequencies of all its subsets, the rules derive best possible bounds on the frequency of an itemset. In this way, the number of candidate itemsets to count can be avoided with these bounds. Therefore, it is possible to generate a summary of the frequent itemsets that only contains the non-derivable sets, then the collection of none-derivable sets forms a concise representation of the frequent itemset. The experimental results show the effectiveness and efficiency of this method on a real-life dataset.
Keywords:data mining  frequent itemset  deductive rule  concise representation
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《系统工程理论与实践》浏览原始摘要信息
点击此处可从《系统工程理论与实践》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号