首页 | 本学科首页   官方微博 | 高级检索  
     

Finding finer functions for partially characterized proteins by protein-protein interaction networks
引用本文:LI YanHui GUO Zheng MA WenCai YANG Da WANG Dong ZHANG Min ZHU ding ZHONG GuoCai LI YongJin YAO Chen WANG Jing. Finding finer functions for partially characterized proteins by protein-protein interaction networks[J]. 科学通报(英文版), 2007, 52(24): 3363-3370. DOI: 10.1007/s11434-008-0016-z
作者姓名:LI YanHui GUO Zheng MA WenCai YANG Da WANG Dong ZHANG Min ZHU ding ZHONG GuoCai LI YongJin YAO Chen WANG Jing
作者单位:[1]School of Life Science and Bioinformatics Centre, University of Electronic Science and Technology of China, Chengdu 610054, China [2]Department of Bioinformatics, Bio-pharmaceutical Key Laboratory of Heilongjiang Province-Incubator of State Key Laboratory,Harbin Medical University, Harbin 150086, China
基金项目:Supported in part by the National Natural Science Foundation of China (Grant Nos. 30370388 and 30670539)
摘    要:Based on high-throughput data, numerous algorithms have been designed to find functions of novel proteins. However, the effectiveness of such algorithms is currently limited by some fundamental factors, including (1) the low a-priori probability of novel proteins participating in a detailed function; (2) the huge false data present in high-throughput datasets; (3) the incomplete data coverage of functional classes; (4) the abundant but heterogeneous negative samples for training the algorithms; and (5) the lack of detailed functional knowledge for training algorithms. Here, for partially characterized proteins, we suggest an approach to finding their finer functions based on protein interaction sub-networks or gene expression patterns, defined in function-specific subspaces. The proposed approach can lessen the above-mentioned problems by properly defining the prediction range and functionally filtering the noisy data, and thus can efficiently find proteins’ novel functions. For thousands of yeast and human proteins partially characterized, it is able to reliably find their finer functions (e.g., the translational functions) with more than 90% precision. The predicted finer functions are highly valuable both for guiding the follow-up wet-lab validation and for providing the necessary data for training algorithms to learn other proteins.

关 键 词:蛋白质 相互作用 基因存在论 基因函数 算法
收稿时间:2007-05-21
修稿时间:2007-09-25

Finding finer functions for partially characterized proteins by protein-protein interaction networks
YanHui Li,Zheng Guo,WenCai Ma,Da Yang,Dong Wang,Min Zhang,Jing Zhu,GuoCai Zhong,YongJin Li,Chen Yao,Jing Wang. Finding finer functions for partially characterized proteins by protein-protein interaction networks[J]. Chinese science bulletin, 2007, 52(24): 3363-3370. DOI: 10.1007/s11434-008-0016-z
Authors:YanHui Li  Zheng Guo  WenCai Ma  Da Yang  Dong Wang  Min Zhang  Jing Zhu  GuoCai Zhong  YongJin Li  Chen Yao  Jing Wang
Affiliation:(1) School of Life Science and Bioinformatics Centre, University of Electronic Science and Technology of China, Chengdu, 610054, China;(2) Department of Bioinformatics, Bio-pharmaceutical Key Laboratory of Heilongjiang Province-Incubator of State Key Laboratory, Harbin Medical University, Harbin, 150086, China
Abstract:Based on high-throughput data, numerous algorithms have been designed to find functions of novel proteins. However, the effectiveness of such algorithms is currently limited by some fundamental factors, including (1) the low a-priori probability of novel proteins participating in a detailed function; (2) the huge false data present in high-throughput datasets; (3) the incomplete data coverage of functional classes; (4) the abundant but heterogeneous negative samples for training the algorithms; and (5) the lack of detailed functional knowledge for training algorithms. Here, for partially characterized proteins, we suggest an approach to finding their finer functions based on protein interaction sub-networks or gene expression patterns, defined in function-specific subspaces. The proposed approach can lessen the above-mentioned problems by properly defining the prediction range and functionally filtering the noisy data, and thus can efficiently find proteins' novel functions. For thousands of yeast and human proteins partially characterized, it is able to reliably find their finer functions (e.g., the translational functions) with more than 90% precision. The predicted finer functions are highly valuable both for guiding the follow-up wet-lab validation and for providing the necessary data for training algorithms to learn other proteins.
Keywords:protein-protein interaction   Gene Ontology   gene function   algorithm   prediction
本文献已被 维普 万方数据 SpringerLink 等数据库收录!
点击此处可从《科学通报(英文版)》浏览原始摘要信息
点击此处可从《科学通报(英文版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号