首页 | 本学科首页   官方微博 | 高级检索  
     检索      

遗传算法在决策表连续属性离散化中的应用研究
引用本文:赵卫东,戴伟辉,蔡斌.遗传算法在决策表连续属性离散化中的应用研究[J].系统工程理论与实践,2003,23(1):62-67.
作者姓名:赵卫东  戴伟辉  蔡斌
作者单位:复旦大学管理学院
摘    要:连续属性的离散化是压缩数据和简化分析的重要手段 ,也是模式识别、机器学习和粗集分析等领域研究的难点 .目前已出现多种离散方法 ,存在的主要问题是对离散效果影响较大的侯选分割点集选择带有较强的主观性 .最优离散化是 NP-困难问题 ,大多数离散化算法采用的启发式也难以得到较满意的离散效果 .基于粗集理论 ,探讨了上述问题 ,把分割点的优选问题转化为 0 -1整数规划 ,并提出一种用实数编码的遗传算法来计算最优分割点集.

关 键 词:离散  决策表  粗集  遗传算法    
文章编号:1000-6788(2003)01-0062-06
修稿时间:2001年6月11日

The Discretization of Continuous Attributes Using Genetic Algorithms
ZHAO Wei\|dong,DAI Wei\|hui,CAI Bin.The Discretization of Continuous Attributes Using Genetic Algorithms[J].Systems Engineering —Theory & Practice,2003,23(1):62-67.
Authors:ZHAO Wei\|dong  DAI Wei\|hui  CAI Bin
Institution:School of Management, Fudan University
Abstract:The discretization of continuous attributes is an important method for compressing data and simplifying analysis, which is of the focuses in the domains of pattern recognition, machine learning and rough sets. Some discretization algorithms have been used such as MD, discretization based on entropy but there exist disadvantages in them. For example, the choice of initial set of cut dots is hard to be determined. The optimal discretization has been proved to be NP\|hard. Heuristics used by most algorithms usually give local minima though results sometimes are satisfactory. Based on the rough set theory, the problems mentioned above are firstly discussed in this paper. Then we transform the discretization of continuous attributes into 0\|1\|integer programming, which can be solved successfully by existent software such as lindo. Furthermore, a genetic algorithm using decimal encoding is proposed to compute the optimal discretization.
Keywords:discretization  decision table  rough set  0\|1\|integer programming  genetic algorithm
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《系统工程理论与实践》浏览原始摘要信息
点击此处可从《系统工程理论与实践》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号