一种基于粗糙均方残基的模糊双聚类方法 A Fuzzy Biclustering Approach Based on Rough Average Square Residue期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种基于粗糙均方残基的模糊双聚类方法

作者单位：	;1.河南师范大学计算机与信息工程学院;2.计算智能与数据挖掘河南省高校工程技术研究中心

摘要：	双聚类作为一种无监督的学习方法,其作用是对基因表达数据进行分析.为了获取较大容量的双聚类簇,弥补传统的双聚类方法在基因表达数据一致波动性方面的不足,引入粗糙集的上、下近似集概念,将粗糙集理论运用到模糊双聚类算法中,将粗糙上、下近似集与加权均方残差相结合,得到新的粗糙均方残基,进而提出一种基于粗糙均方残基的模糊双聚类算法.针对基因表达数据集,首先进行缺失值填补;其次,用非负矩阵分解算法对基因数据集进行降维;最后,计算数据矩阵的粗糙均方残基,结合综合评判度量函数与贴近度原则对矩阵的行列进行删除和添加,得到容量更大的双聚类结果.实验结果表明,该模糊双聚类算法是有效的.
关键词：	粗糙集粗糙均方残基双聚类
A Fuzzy Biclustering Approach Based on Rough Average Square Residue

Affiliation:	,Collage Computer and Information Engineering,Henan Normal University,Engineering Technology Research Center for Computing Intelligence & Data Mining of Henan Province

Abstract:	Biclustering as an unsupervised learning method can analyze gene expression data.However,some traditional biclustering methods have the shortcoming of consistent volatility for gene expression data.To solve this problem,and obtain large capacity clusters of biclustering,the upper and lower approximation of rough set was introduced in this paper,and the rough set theory was applied into fuzzy biclustering algorithm.By combining upper and lower approximation with weighted mean square residual,a novel rough mean square residue was defined.Then an improved fuzzy biclustering algorithm based on rough mean square residue was proposed.For gene expression dataset,the missing values were filled up firstly.A factorization algorithm of non-negative matrix was used to reduce dimension of gene dataset.And the rough mean square residue of data matrix was calculated.Finally,through integrating a comprehensive evaluation measure function and nearness degree,the rows and columns of matrixes were deleted or added in order to obtain a larger of biclustering results.Experimental results show that the proposed fuzzy biclustering algorithm is efficient.

Keywords:	rough set rough average square residue biclustering
本文献已被 CNKI 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏