首页 | 本学科首页   官方微博 | 高级检索  
     

一种使用概念近似度约简的序列模式挖掘方法
引用本文:胡学钢,张晶,张玉红,谭喆. 一种使用概念近似度约简的序列模式挖掘方法[J]. 烟台大学学报(自然科学与工程版), 2009, 22(3): 202-205
作者姓名:胡学钢  张晶  张玉红  谭喆
作者单位:合肥工业大学,计算机与信息学院,安徽,合肥,230009
基金项目:安徽省自然科学基金资助项目 
摘    要:传统的序列模式挖掘算法虽然能够挖掘所有的频繁序列,但在挖掘海量数据时可能因结果规模过于庞大而无法理解.基于概念格的序列模式挖掘有效地减少了中间序列的生成数量,在时间性能上具有一定的优越性,而概念格的结构特点也为自身的约简提供了便利.本文提出了近似概念的定义,首先对交易数据库建格,然后约简满足近似条件的概念,减少了频繁1-序列的数量,进而减少了总的频繁序列的数量.实验表明,在允许一定误差的情况下该方法提高了挖掘结果的可理解性和挖掘效率.

关 键 词:数据挖掘  频繁序列  概念格  概念约简

Sequential Patterns Mining Using Concept Reduction for Similitude Degree
HU Xue-gang,ZHANG Jing,ZHANG Yu-hong,TAN Zhe. Sequential Patterns Mining Using Concept Reduction for Similitude Degree[J]. Journal of Yantai University(Natural Science and Engineering edirion), 2009, 22(3): 202-205
Authors:HU Xue-gang  ZHANG Jing  ZHANG Yu-hong  TAN Zhe
Affiliation:Zhe ( School of Computer and Information, Hefei University of Technology, Hefei 230009, China)
Abstract:Most of the algorithms for sequential pattern mining can find out all the frequent sequences, however, when the data is huge, the number of the mining results may be too large to be understood. The algorithm for sequence patterns based on the concept lattice can reduce the number of middle results effectively, and therefore is superior to other methods in time performance. And the structure of concept lattice is suitable to reduction. In this paper, the approximation concept is proposed. In the method, concept lattice is constructed based on the business database first, and then the concepts obeying the law of approximation defined is reduced. As a result, the number of frequent 1-sequences and the number of all the frequent sequences will decrease. The experimental results demonstrate that the present approach outperforms the others much in the efficiency and understandability within error.
Keywords:data mining  frequent sequences  concept lattice  concept reduction
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号