首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
一种改进的Rough集属性约简启发式遗传算法   总被引:4,自引:0,他引:4  
属性约简是知识发现中的关键问题之一 .为了能够有效地获取决策表中属性最小相对约简 ,提出了一种在优化初始群体基础上提高算法性能的启发式遗传算法 .首先 ,通过构造一个新的算子 ,将信息论角度定义的属性重要性度量作为启发式信息 ,来描述所选择的属性子集对论域中确定分类子集的影响 ;接着 ,以此为基础并结合遗传算法 ,选择一些经过优化的染色体作为初始群体 ,在加强局部搜索能力的同时保持了该算法全局寻优的特性 .最后 ,从理论上对算法做了分析 ,证明了新算子所选择的属性子集对原有属性分类能力保持不变 .试验分析表明 ,该算法能有效地对决策表属性进行约简  相似文献   

2.
应用粗糙集的方法,分析决策系统中不同的属性分类方法,以及不同分类方法引起的属性重要性与属性相对约简极小子集的变化情况,寻求属性分类方法与属性约简结果相互影响的内在因素,给出高效的属性分类方法和合理确定约简子集的策略,生成策略对应软件的实现算法,并运用软件实现算法来选取相对约简子集.试验结果显示了该策略及算法的有效性.  相似文献   

3.
基于互信息和疑义度相结合的知识约简方法   总被引:3,自引:0,他引:3  
提出一种基于互信息和疑义度相结合的知识约简方法,遵循修正的互信息准则,发展了一种类似于正交化特性的启发式算法,从决策系统中找出属性集的约简;该方法采用可增可删的双向回归算法,克服了目前前向选择或后向删除的知识约简方法中存在的属性相互依赖或依赖于决策类别的缺点,可保证分类精度不变的情况下,得到更为简化的决策属性集。最后,通过一个简单实例的仿真分析过程验证了文中所提方法的有效性。  相似文献   

4.
由于数据自身的不确定性和观测条件有限,现实问题中许多数据以区间值形式呈现。其中,优势关系下的区间值信息表研究对于多属性决策问题有重要意义。目前针对该系统的属性约简方法主要是辨识矩阵法或基于互信息的增量式约简,但前者计算效率较低,而后者没有利用到决策信息。文章探讨了条件熵作为不确定性度量在该系统下的性质,通过比较不同属性缺失时信息系统的条件熵变化,引入了属性重要度概念,基于此提出启发式属性约简算法。最后,通过对比实验验证了本算法具有低冗余的特点,在约简率上比基于粗糙熵和正域不变等序信息系统的启发式约简。  相似文献   

5.
面向属性的粗集数据挖掘方法研究   总被引:5,自引:2,他引:3  
指出粗集理论的主要思想是在保持分类能力不变的情况下,利用等价类,通过属性约简和决策规则约简,达到挖掘知识并简化知识的目的.但约简问题是一个NP问题,只能通过启发式算法实现.针对这一问题,提出了属性约简和决策规则约简的启发式算法,构成了一个基于粗集理论的挖掘集成算法.最后通过实例表明,该集成算法能够以较高的效率发现良好的分类规则.  相似文献   

6.
针对不完整决策系统属性约简算法时间复杂度较高问题,基于正域不变条件下,决策系统分类能力保持不变原则,提出不完整决策系统前向顺序特征选择算法.该算法从约简集为空集开始,根据在约简集合中加入各属性后对正域影响程度大小将属性降序排列,采用顺序前向搜索,选择当前最佳特征加入特征约简集合,确定最佳特征子集.将该算法扩展到基于邻域...  相似文献   

7.
为了去除系统中的冗余属性,保持系统的分类能力,研究了连续值分布式数据的属性约简.给出了连续值分布式决策信息系统中邻域粗糙集的定义,讨论了分布式连续值决策信息系统中正域计算的可分解性.以保持分布式决策信息系统的正域不变为前提,探讨了分布式决策信息系统中属性的可约性,提出了分布式连续值决策信息系统的属性约简算法.为了验证该算法的有效性,在7份数据集上进行了3组实验.实验使用提出的算法对分布式数据进行属性约简,进而采用加权集成的方式进行分类测试.实验结果表明,该算法能够有效去除连续值分布式数据中的冗余属性,使得约简后的连续值分布式数据的集成分类能力与约简前相差不大.甚至更高.  相似文献   

8.
在有效处理噪声数据的基于区分能力大小的启发式算法的基础上,引入了属性的相对知识量重要度的概念.以属性相对知识量重要度为启发式信息,提出了一种属性约简算法,通过实例证明了该算法的有效性.  相似文献   

9.
通过粗糙集理论对一种实值属性约简算法进行了研究,给出了实值决策系统属性约简的算法,并采用UCI中的数据集进行分析,实验结果表明:该约简方法可以选择较少的属性而保持或改善分类能力.  相似文献   

10.
基于决策支持度的不完备信息系统约简算法   总被引:1,自引:0,他引:1  
提出了一种基于决策属性支持度的属性相对约简算法。通过引入决策属性支持度对不完备决策表中属性的重要性进行了定义,并以此作为启发信息进行属性的选择,该算法的时间复杂度是多项式的。寻找决策表中最小相对约简问题是典型的NP-hard问题,采用该算法可降低问题复杂度。通过实例说明,该算法能得到不完备决策表的最小相对约简。  相似文献   

11.
The discovery of the prolific Ordovician Red River reservoirs in 1995 in southeastern Saskatchewan was the catalyst for extensive exploration activity which resulted in the discovery of more than 15 new Red River pools. The best yields of Red River production to date have been from dolomite reservoirs. Understanding the processes of dolomitization is, therefore, crucial for the prediction of the connectivity, spatial distribution and heterogeneity of dolomite reservoirs.The Red River reservoirs in the Midale area consist of 3~4 thin dolomitized zones, with a total thickness of about 20 m, which occur at the top of the Yeoman Formation. Two types of replacement dolomite were recognized in the Red River reservoir: dolomitized burrow infills and dolomitized host matrix. The spatial distribution of dolomite suggests that burrowing organisms played an important role in facilitating the fluid flow in the backfilled sediments. This resulted in penecontemporaneous dolomitization of burrow infills by normal seawater. The dolomite in the host matrix is interpreted as having occurred at shallow burial by evaporitic seawater during precipitation of Lake Almar anhydrite that immediately overlies the Yeoman Formation. However, the low δ18O values of dolomited burrow infills (-5.9‰~ -7.8‰, PDB) and matrix dolomites (-6.6‰~ -8.1‰, avg. -7.4‰ PDB) compared to the estimated values for the late Ordovician marine dolomite could be attributed to modification and alteration of dolomite at higher temperatures during deeper burial, which could also be responsible for its 87Sr/86Sr ratios (0.7084~0.7088) that are higher than suggested for the late Ordovician seawaters (0.7078~0.7080). The trace amounts of saddle dolomite cement in the Red River carbonates are probably related to "cannibalization" of earlier replacement dolomite during the chemical compaction.  相似文献   

12.
AcomputergeneratorforrandomlylayeredstructuresYUJia shun1,2,HEZhen hua2(1.TheInstituteofGeologicalandNuclearSciences,NewZealand;2.StateKeyLaboratoryofOilandGasReservoirGeologyandExploitation,ChengduUniversityofTechnology,China)Abstract:Analgorithmisintrod…  相似文献   

13.
本文叙述了对海南岛及其毗邻大陆边缘白垩纪到第四纪地层岩石进行古地磁研究的全部工作过程。通过分析岩石中剩余磁矢量的磁偏角及磁倾角的变化,提出海南岛白垩纪以来经历的构造演化模式如下:早期伴随顺时针旋转而向南迁移,后期伴随逆时针转动并向北运移。联系该地区及邻区的地质、地球物理资料,对海南岛上述的构造地体运动提出以下认识:北部湾内早期有一拉张作用,主要是该作用使湾内地壳显著伸长减薄,形成北部湾盆地。从而导致了海南岛的早期构造运动,而海南岛后期的构造运动则主要是受南海海底扩张的影响。海南地体运动规律的阐明对于了解北部湾油气盆地的形成演化有重要的理论和实际意义。  相似文献   

14.
Various applications relevant to the exciton dynamics,such as the organic solar cell,the large-area organic light-emitting diodes and the thermoelectricity,are operating under temperature gradient.The potential abnormal behavior of the exicton dynamics driven by the temperature difference may affect the efficiency and performance of the corresponding devices.In the above situations,the exciton dynamics under temperature difference is mixed with  相似文献   

15.
The elongation method,originally proposed by Imamura was further developed for many years in our group.As a method towards O(N)with high efficiency and high accuracy for any dimensional systems.This treatment designed for one-dimensional(ID)polymers is now available for three-dimensional(3D)systems,but geometry optimization is now possible only for 1D-systems.As an approach toward post-Hartree-Fock,it was also extended to  相似文献   

16.
17.
The explosive growth of the Internet and database applications has driven database to be more scalable and available, and able to support on-line scaling without interrupting service. To support more client's queries without downtime and degrading the response time, more nodes have to be scaled up while the database is running. This paper presents the overview of scalable and available database that satisfies the above characteristics. And we propose a novel on-line scaling method. Our method improves the existing on-line scaling method for fast response time and higher throughputs. Our proposed method reduces unnecessary network use, i.e. , we decrease the number of data copy by reusing the backup data. Also, our on-line scaling operation can be processed parallel by selecting adequate nodes as new node. Our performance study shows that our method results in significant reduction in data copy time.  相似文献   

18.
R-Tree is a good structure for spatial searching. But in this indexing structure,either the sequence of nodes in the same level or sequence of traveling these nodes when queries are made is random. Since the possibility that the object appears in different MBR which have the same parents node is different, if we make the subnode who has the most possibility be traveled first, the time cost will be decreased in most of the cases. In some case, the possibility of a point belong to a rectangle will shows direct proportion with the size of the rectangle. But this conclusion is based on an assumption that the objects are symmetrically distributing in the area and this assumption is not always coming into existence. Now we found a more direct parameter to scale the possibility and made a little change on the structure of R-tree, to increase the possibility of founding the satisfying answer in the front sub trees. We names this structure probability based arranged R-tree (PBAR-tree).  相似文献   

19.
The geographic information service is enabled by the advancements in general Web service technology and the focused efforts of the OGC in defining XML-based Web GIS service. Based on these models, this paper addresses the issue of services chaining,the process of combining or pipelining results from several interoperable GIS Web Services to create a customized solution. This paper presents a mediated chaining architecture in which a specific service takes responsibility for performing the process that describes a service chain. We designed the Spatial Information Process Language (SIPL) for dynamic modeling and describing the service chain, also a prototype of the Spatial Information Process Execution Engine (SIPEE) is implemented for executing processes written in SIPL. Discussion of measures to improve the functionality and performance of such system will be included.  相似文献   

20.
Advances in wireless technologies and positioning technologies and spread of wireless devices, an interest in LBS (Location Based Service) is arising. To provide location based service, tracking data should have been stored in moving object database management system (called MODBMS) with proper policies and managed efficiently. So the methods which acquire the location information at regular time intervals then, store and manage have been studied. In this paper, we suggest tracking data management techniques using topology that is corresponding to the moving path of moving object. In our techniques, we update the MODBMS when moving object arrived at a street intersection or a curved road which is represented as the node in topology and predict the location at past and future with attribute of topology and linear function. In this technique, location data that are corresponding to the node in topology are stored, thus reduce the number of update and amount of data. Also in case predicting the location,because topology are used as well as existing location information, accuracy for prediction is increased than applying linear function or spline function.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号