首页 | 本学科首页   官方微博 | 高级检索  
     

基于包含索引的频繁闭序列模式挖掘的新算法
引用本文:李晋宏,杨炳儒,宋威,侯伟. 基于包含索引的频繁闭序列模式挖掘的新算法[J]. 系统工程与电子技术, 2009, 31(10): 2485-2488
作者姓名:李晋宏  杨炳儒  宋威  侯伟
作者单位:1. 北京科技大学信息工程学院, 北京, 100083;2. 北方工业大学信息工程学院, 北京, 100144
基金项目:国家自然科学基金,北京市属市管高等学校人才强教计划资助课题 
摘    要:频繁闭序列模式惟一确定全体频繁序列模式,且规模小得多.传统的闭序列模式挖掘算法对每个频繁项目都进行扩展,往往会产生大量的非闭合序列.为解决这一问题,提出了一种新的基于包含索引的频繁闭序列模式挖掘算法,其主要思想是只对闭项集进行扩展,大大减少了非闭合序列的产生.首先,论证了闭序列模式只能由闭项集组成;其次,说明了如何利用包含索引来快速发现闭项集;最后,给出了一种深度优先的挖掘频繁闭序列模式的新算法.实验结果表明,该算法具有较高的效率.

关 键 词:数据挖掘  频繁闭项集  频繁闭序列模式  包含索引
收稿时间:2008-05-26
修稿时间:2008-10-20

New mining algorithm for frequent closed sequential pattern based on subsume index
LI Jin-hong,YANG Bing-ru,SONG Wei,HOU Wei. New mining algorithm for frequent closed sequential pattern based on subsume index[J]. System Engineering and Electronics, 2009, 31(10): 2485-2488
Authors:LI Jin-hong  YANG Bing-ru  SONG Wei  HOU Wei
Affiliation:1. School of Information Engineering, Univ. of Science and Technology Beijing, Beijing 100083, China;2. Coll. of Information Engineering, North China Univ. of Technology, Beijing 100144, China
Abstract:The set of frequent closed sequential pattern determines exactly the complete set of all frequent sequential patterns and is usually much smaller than the latter.Traditional closed sequential pattern mining algorithms extend a frequent sequence with every frequent single item,which leads to the generation of a lot of non-closed sequence.To solve these problems,a new mining algorithm for frequent closed sequential pattern based on subsume index is proposed.The main idea of the proposed algorithm is to extend a frequent sequence with closed itemsets only.Thus,the generation of non-closed sequences is avoided greatly.Firstly,it is proved that a closed sequential pattern is only composed of closed itemsets.Then,it is explained that the closed itemsets can be discovered efficiently by using a subsume index.Finally,a depth-first algorithm for mining frequent closed sequential pattern is presented.The experimental results show that the proposed algorithm is efficient.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《系统工程与电子技术》浏览原始摘要信息
点击此处可从《系统工程与电子技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号