首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到18条相似文献,搜索用时 218 毫秒
1.
基于PSO的加权关联规则挖掘算法   总被引:1,自引:0,他引:1  
简要描述了加权关联规则问题及离散粒子群优化算法,提出了一种基于粒子群优化(PSO)算法的加权关联规则挖掘算法(PSO-WMAR).实验证明,本算法运行时间更省,产生的规则数更少且更有效.该算法具有以下特点:1)把关联规则挖掘的两个阶段结合在一起,无须先挖掘出全部频繁项目集然后再提取规则;2)只需要扫描一次数据库;3)把兴趣度引入适合度函数之中,挖掘出的规则数量更少、更有效;4)求加权频繁项目集无须查找所有候选加权频繁项目集,或者求频繁项目集的高序子集或非频繁项目集的低序超集.  相似文献   

2.
给出了一个基于约束的关联规则挖掘算法,首先依赖加权支持度产生频繁项目集,然后利用兴趣度产生关联规则,并对过滤掉的频繁项目集进一步分析发现包含负项集的关联规则。  相似文献   

3.
针对仅有的挖掘算法不能较好地解决负关联规则的候选集数量爆炸问题,为满足用户的实际需求,提出带约束负关联规则概念,建立带约束负关联规则挖掘算法CNARM.同时,在挖掘过程中,利用最大频繁模式的性质来生成候选集,通过限制负关联规则中的前后件项目个数和利用负关联规则的性质来缩小候选集的规模.理论分析和实验结果表明本文提出的算法是有效可行的,具有较好的挖掘效率.  相似文献   

4.
提出了一种基于二进制编码的优化关联规则挖掘算法,该算法是按项目支持数的升序从高到低地编制二进制位,然后将事务转换成数字事务,通过构建候选数字事务区间来搜索频繁数字事务,最后产生关联规则.该算法的原理简单,减少了冗余候选项和扫描次数;实验结果表明该算法比同类挖掘算法更快速而有效.  相似文献   

5.
关联规则算法是数据挖掘中的核心技术,本文给出了数据库中挖掘关系规则的一种新算法,该算法通过二次扫描,第一次将可能出现的频繁项目集加入到ISC中,第二次扫描采用逐步求精算法将频繁项目集加到项目集中,减少了数据库的扫描次数.  相似文献   

6.
张青 《河南科学》2015,(1):65-68
Apriori算法是关联规则挖掘的经典算法,该算法在处理规模巨大的候选项目集时存在耗时长和效率低的问题,提出了采用分割法对数据进行分片的优化算法.实验证明该算法不仅能减少数据挖掘对系统资源的占用,而且解决了数据库中数据分割下局部频繁项目序列集产生和全局频繁项目序列集的转换问题.  相似文献   

7.
 在移动计算中挖掘满足用户需求的长频繁邻近类别集时,为了避免产生冗余候选项和减少重复计算量,提出一种基于幂集数递减的约束频繁邻近类别集挖掘算法,其能够提取包含约束条件的长频繁邻近类别集;该算法用幂集数递减序列来产生候选频繁邻近类别集,有效地删除了不满足用户需求的冗余候选项和减少了重复扫描空间实例的计算量.实验表明在挖掘满足用户需求的长频繁邻近类别集时,该算法比现有算法更快速.  相似文献   

8.
关联规则算法是数据挖掘中的核心技术 ,本文给出了数据库中挖掘关系规则的一种新算法 ,该算法通过二次扫描 ,第一次将可能出现的频繁项目集加入到ISC中 ,第二次扫描采用逐步求精算法将频繁项目集加到项目集中 ,减少了数据库的扫描次数  相似文献   

9.
研究分布式环境下约束性关联规则更新问题,包括数据库中事务增加和删除2种情况.引入向导集的概念,提出基于全局局部模式的约束性关联规则增量式更新算法DUCAR,其中包括局部约束性频繁项目集更新算法ULFC和全局约束性频繁项目集更新算法UGFC.该算法充分利用原先的挖掘结果提高更新效率,首先从最高维的频繁n项目集进行更新,在更新过程中考虑约束条件,结合剪枝算法,生成较少数量的满足约束条件的候选项目集.将该算法用Java加以实现,采用多组数据对此算法的性能进行测试,并与其他算法作对比实验,实验结果表明,该算法是高效可行的.  相似文献   

10.
传统的基于支持度—置信度框架的关联规则挖掘方法可能会产生大量不相关的、甚至是误导的关联规则,同时也不能区分正负关联规则。在充分考虑用户感兴趣模式的基础上,采用一阶谓词逻辑作为用户感兴趣的背景知识表示技术,提出了一种基于背景知识的包含正负项目集的频繁模式树,给出了针对正负项目集的约束频繁模式树的构造算法NCFP-Construct,从而提高了关联规则挖掘的效率和针对性,实验结果显示该方法是有效的。  相似文献   

11.
Web日志挖掘   总被引:19,自引:1,他引:19  
提出了一种新颖的MBP算法,它利用关联规则挖掘发现的频繁项目集以加快速度,能找出所有满足阀值约束的频繁浏览路径,该算法是有很效的,同时,针对Web浏览和日志文件固有的模糊性和不确定性,还讲座了Web面面的模糊聚类问题,最后,对发现的知识讨论了其在推荐系统及自适应Web站点中的应用并给出了相应算法。  相似文献   

12.
In data mining from transaction DB, the relationships between the attributes have been focused, but the relationships between the tuples have not been taken into account. In spatial database, there are relationships between the attributes and the tuples, and most of the associations occur between the tuples, such as adjacent, intersection, overlap and other topological relationships. So the tasks of spatial data association rules mining include mining the relationships between attributes of spatial objects, which are called as vertical direction DM, and the relationships between the tuples, which are called as horizontal direction DM. This paper analyzes the storage models of spatial data, uses for reference the technologies of data mining in transaction DB, defines the spatial data association rule, including vertical direction association rule, horizontal direction association rule and twodirection association rule, discusses the measurement of spatial association rule interestingness, and puts forward the work flows of spatial association rule data mining. During twodirection spatial association rules mining, an algorithm is proposed to get nonspatial itemsets. By virtue of spatial analysis, the spatial relations were transferred into nonspatial associations and the nonspatial itemsets were gotten. Based on the nonspatial itemsets, the Apriori algorithm or other algorithms could be used to get the frequent itemsets and then the spatial association rules come into being. Using spatial DB, the spatial association rules were gotten to validate the algorithm, and the test results show that this algorithm is efficient and can mine the interesting spatial rules.  相似文献   

13.
In data mining from transaction DB, the relationships between the attributes have been focused, but the relationships between the tuples have not been taken into account. In spatial database, there are relationships between the attributes and the tuples, and most of the associations occur between the tuples, such as adjacent, intersection, overlap and other topological relationships. So the tasks of spatial data association rules mining include mining the relationships between attributes of spatial objects, which are called as vertical direction DM, and the relationships between the tuples, which are called as horizontal direction DM. This paper analyzes the storage models of spatial data, uses for reference the technologies of data mining in transaction DB, defines the spatial data association rule, including vertical direction association rule, horizontal direction association rule and two-direction association rule, discusses the measurement of spatial association rule interestingness, and puts forward the work flows of spatial association rule data mining. During two-direction spatial association rules mining, an algorithm is proposed to get non-spatial itemsets. By virtue of spatial analysis, the spatial relations were transferred into non-spatial associations and the non-spatial itemsets were gotten. Based on the non-spatial itemsets, the Apriori algorithm or other algorithms could be used to get the frequent itemsets and then the spatial association rules come into being. Using spatial DB, the spatial association rules were gotten to validate the algorithm, and the test results show that this algorithm is efficient and can mine the interesting spatial rules.  相似文献   

14.
传统方法实现过程复杂、历史复杂时态数据的片面性,导致其无法全面地描述时态数据;且相似性计算无法准确匹配具有动态性与复杂性的时态数据,造成提取精度低。为此,提出一种新的分布式多空间数据库复杂时态数据提取技术。设计动态RBF神经网络,对分布式多空间数据库中未知动态进行识别和建模;通过建模结果完成对复杂时态数据的描述。依据加权关联规则与时态关联规则对支持度和置信度的定义,获取T-FS-tree加权时态关联规则中支持度和置信度。将复杂时态数据描述序列、最小支持度、最小置信度作为输入,将加权时态关联规则作为输出,建立T-FS-tree加权时态关联规则挖掘算法。按照向量计算获取加权时态频繁1项集以及频繁2项集,依据获取的加权时态频繁项集建立初始频繁项集树;依据初始频繁项集树获取全部时态频繁项集;通过获取的频繁项集产生加权时态关联规则。从所有关联规则中选择优先度高的规则,构建的复杂时态数据提取器,实现复杂时态数据提取。实验结果表明,所提方法复杂性低,提取结果更加全面、可靠,有很高的准确性。  相似文献   

15.
在数据库中挖掘关联规则是数据挖掘领域的一个重要的研究课题,在应用中具有非常重要的意义.在分析Apriori算法和IUA算法经典关联规则挖掘算法的基础上,提出了一种基于最近挖掘结果的更新算法称为IIUA.IIUA算法吸收了Apriori算法和IUA算法的优点,在改变最小支持度和基于最近挖掘结果的条件下,从生成尽可能少的候选项集考虑,得到完整的新频繁项集,从而提高算法的效率.  相似文献   

16.
针对传统数据挖掘技术的劣势,提出一种以利润为基础的约束关联规则挖掘算法.在使用关联规则进行数据挖掘之前,算法按照商品利润的权重信息对购物篮中的原始商品交易信息实施预处理,可以使后续的数据关联规则挖掘更加的精确可靠,提升数据挖掘的效果.结果表明:基于利润的约束关联规则挖掘算法对数据库的原始数据实施了利润约束修正,增加了利润加权阈值,可有效提升数据挖掘算法的知识挖掘性能.  相似文献   

17.
Constraint pushing techniques have been developed for mining frequent patterns and association rules. How ever, multiple constraints cannot be handled with existing techniques in frequent pattern mining. In this paper, a new algorithm MCFMC (mining complete set of frequent itemsets with multiple constraints) is introduced. The algorithm takes advantage of the fact that a convertible constraint can be pushed into mining algorithm to reduce mining research spaces. By using a sample database, the algorithm develops techniques which select an optimal method based on a sample database to convert multiple constraints into multiple convert ible constraints, disjoined by conjunction and/or, and then partition these constraints into two parts. One part is pushed deep inside the mining process to reduce the research spaces for frequent itemsets, the other part that cannot be pushed in algorithm is used to filter the complete set of frequent itemsets and get the final result. Results from our detailed experi ment show the feasibility and effectiveness of the algorithm.  相似文献   

18.
在关联规则挖掘中,大量的数据是多维的,且带有时态特性,所以往往需要在时态约束的前提下挖掘多维关联规则.本文从一个实际问题出发,在单维Apriori算法和已有的工作基础上,提出了一种新的多维时态关联规则挖掘算法,并与类似算法进行了比较.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号