首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于闭合模式的高维基因表达谱多类分类
引用本文:李宏,李翔,吴敏,陈松乔,易丽君.基于闭合模式的高维基因表达谱多类分类[J].中南大学学报(自然科学版),2008,39(5).
作者姓名:李宏  李翔  吴敏  陈松乔  易丽君
作者单位:中南大学,信息科学与工程学院,湖南,长沙,410083
基金项目:国家自然科学基金,中南大学校科研和教改项目
摘    要:针对多类高维基因表达谱的特点,提出一种基于闭合模式的多类分类算法CBCP,即根据垂直格式的数据集采用路径枚举的方法挖掘闭合模式,极大地减少了冗余模式的产生.然后,对所有闭合模式进行排序,通过覆盖训练集建立分类器.针对分类器无法识别的样本提出权重算法进行判断,克服了使用Default类预测不精确的问题.研究结果表明,CBCP与经典分类算法如CBA和C4.5相比具有更高的预测准确率,并且在基因数大幅增加而样本数不变的情况下仍具有较强的稳定性,证明CBCP的可扩展性强,适用于高维数据集的多类分类预测.

关 键 词:关联规则  闭合模式  多类别  权重算法

Multi-class classification of high-dimension gene expression profile based on closed patterns
LI Hong,LI Xiang,WU Min,CHEN Song-qiao,YI Li-jun.Multi-class classification of high-dimension gene expression profile based on closed patterns[J].Journal of Central South University:Science and Technology,2008,39(5).
Authors:LI Hong  LI Xiang  WU Min  CHEN Song-qiao  YI Li-jun
Institution:LI Hong,LI Xiang,WU Min,CHEN Song-qiao,YI Li-jun(School of Information Science , Engineering,Central South University,Changsha 410083,China)
Abstract:According to the characteristics of multi-class high-dimension gene expression profile,a new multi-class classification algorithm(CBCP) based on closed pattern was designed.Firstly an approach called path enumeration was proposed to mine closed patterns based on the vertical formatted data-table,which can reduce most redundant patterns.Then closed patterns were sorted and used to cover train dataset for building the classifier.The unrecognized samples were classified by weight algorithm,which can overcome t...
Keywords:association rules  closed pattern  multi-class  weight algorithm  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号