快速频繁序列模式挖掘算法 Fast Frequent Sequential Pattern Mining Algorithm期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

快速频繁序列模式挖掘算法

引用本文：	管恩政,常晓宇,王喆,周春光.快速频繁序列模式挖掘算法[J].吉林大学学报(理学版),2005,43(6):768-772.

作者姓名：	管恩政常晓宇王喆周春光

作者单位：	(吉林大学计算机科学与技术学院, 长春 130012)

基金项目：	国家自然科学重点项目基金(批准号:60433020)，教育部“符号计算和知识工程重点实验室”资助项目基金

摘要：	为解决从数据库中挖掘长模式可能遇到较高的计算复杂度问题, 提出一种新的算法FFSPAN. 传统上，要判断一个序列是否频繁, 需要在原数据库中判断整个序列是否频繁；而算法FFSPAN是通过在序列数据库中寻找一个频繁项或一个频繁项集来代替寻找一个完整的频繁序列，而且FFSPAN算法每次扫描的数据库都是迅速减小的, 这使得算法在挖掘的序列模式越长时越有效. 在标准测试数据集上的实验结果表明， FFSPAN算法非常有效.
关键词：	序列模式长模式深度优先数据挖掘
文章编号：	1671-5489(2005)06-0768-05
收稿时间：	2005-03-07
修稿时间：	2005年3月7日
Fast Frequent Sequential Pattern Mining Algorithm

GUAN En-zheng,CHANG Xiao-yu,WANG Zhe,ZHOU Chun-guang.Fast Frequent Sequential Pattern Mining Algorithm[J].Journal of Jilin University: Sci Ed,2005,43(6):768-772.

Authors:	GUAN En-zheng CHANG Xiao-yu WANG Zhe ZHOU Chun-guang

Institution:	(College of Computer Science and Technology, Jilin University, Changchun 130012, China)

Abstract:	A novel algorithm FFSPAN(fast frequent sequential pattern mining algorithm) is proposed to solve the problem that the computational complexity may become very high when mining long patterns in a(sequence) database.Traditionally,to judge whether a sub-sequence is frequent in a database,one need to compare the whole sub-sequence with every sequence in the original database,however the algorithm FFSPAN succeeds in solving the problem that in a sequence database,instead of searching a whole frequent sequence,we only need to search a frequent item or a frequent itemset.Moreover,the databases scanned via FFSPAN keep shrinking,which makes the algorithm more efficient when the sequential patterns are longer.Experiments on standard test data show that FFSPAN is very effective.

Keywords:	sequential pattern long pattern depth-first data mining
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《吉林大学学报(理学版)》浏览原始摘要信息
	点击此处可从《吉林大学学报(理学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏