首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于广义相似性的共调控基因聚类算法
引用本文:赵宇海,乔百友,林天亮,王国仁.一种基于广义相似性的共调控基因聚类算法[J].东北大学学报(自然科学版),2009,30(11):1558-1561.
作者姓名:赵宇海  乔百友  林天亮  王国仁
作者单位:1. 东北大学,医学影像计算教育部重点实验室,辽宁,沈阳,110004;东北大学,信息科学与工程学院,辽宁,沈阳,110004
2. 东北大学,计算中心,辽宁,沈阳,110004
基金项目:国家自然科学基金,教育部博士学科点新教师基金,教育部重大培育项目,国家重点基础研究发展规划(973计划) 
摘    要:针对共调控基因的特殊性质和现有共调控基因聚类算法存在的不足,提出了基于广义相似性的聚类模型g-Cluster.正负共调控基因因具有相同的编码而被聚集到同一个共调控基因簇中.进一步提出了一种基于树结构的聚类算法FBTD,采用先宽度优先后深度优先的搜索策略,挖掘所有符合条件的最大g-Cluster,同时应用了高效的削减规则和优化策略.将该算法用于真实数据集.理论分析和实验结果都表明,该算法是实用和有效的.

关 键 词:共调控基因  聚类  模式相似性  基因本体  

A Clustering Algorithm Based on Generalized Similarity for Co-regulated Genes
ZHAO Yu-hai,QIAO Bai-you,LIN Tian-liang,WANG Guo-ren.A Clustering Algorithm Based on Generalized Similarity for Co-regulated Genes[J].Journal of Northeastern University(Natural Science),2009,30(11):1558-1561.
Authors:ZHAO Yu-hai  QIAO Bai-you  LIN Tian-liang  WANG Guo-ren
Institution:(1) Key Laboratory of Medical Image Computing, Ministry of Education, Northeastern University, Shenyang 110004, China; (2) School of Information Science and Engineering, Northeastern University, Shenyang 110004, China; (3) Computer Center, Northeastern University, Shenyang 110004, China
Abstract:A novel clustering model, i. e. , the g-Cluster, is developed on the basis of generalized similarity for the special properties and disadvantages of existing clustering algorithms of co-regulated genes. The positive and negative co-regulated genes in this model are integrated into the same cluster if and only if they are provided with the same code. Further, a tree-based clustering algorithm FBTD(first breadth then depth) is proposed, where the priorities in search strategy is that the breadth is taken first then the depth, to find out all the maximal g-Clusters with high-efficiency pruning rules and optimizing strategy performed simultaneously. Applying the FBTD algorithm to real datasets involving genes, both the theoretic and testing results showed that the algorithm is practically efficient.
Keywords:co-regulated genes  clustering  pattern similarity  gene ontology
本文献已被 万方数据 等数据库收录!
点击此处可从《东北大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《东北大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号