首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于复合金字塔模型的蛋白质二级结构预测系统
引用本文:杨炳儒,谢永红,侯伟,周谆.基于复合金字塔模型的蛋白质二级结构预测系统[J].科学通报,2009,54(21):3311-3319.
作者姓名:杨炳儒  谢永红  侯伟  周谆
作者单位:北京科技大学信息工程学院, 北京 100083
基金项目:国家自然科学基金(批准号: 69835001, 60675030和60875029)及教育部科技重点项目(批准号: [2000]175)资助
摘    要:利用预测系统方法, 对蛋白质二级结构预测提出了一种逐步求精、多层递阶的预测系统模型, 即复合金字塔模型. 这种模型由4个独立协同的层面组成, 通过智能接口有机融合了SAC, AAC, KDD*等源于KDTICM理论的模型和方法. 模型整体贯穿物化属性与结构序列, 采用因果细胞自动机选择有效物化属性, 构造纯度较高的结构数据库作为训练数据源, 利用领域知识与背景知识进行优化. 本模型在数据集RS126及CB513分别取得83.06%与80.49%的Q3准确度, 在对偏α/β型蛋白质的预测实验中, 取得了93.12%的Q3准确度, 并存在着进一步提高准确度的优化空间.

关 键 词:蛋白质二级结构预测    复合金字塔模型    预测系统方法    数据挖掘
收稿时间:2009-05-13

A novel protein secondary structure prediction sys-tem based on Compound Pyramid Model
Institution:University of Science and Technology Beijing, Information Engineering School, Beijing 100083, China
Abstract:To attack the urgent problem in bioinformatics, protein secondary structure prediction, a gradually enhanced, multi-layered prediction systematic model, Compound Pyramid Model, is proposed. This model is composed of four independent coordination's layers by intelligent interfaces, synthesizes several methods, such as SVM, KDD* process model and so on. In this model, which intersects the amino acid phy-chemical attributes and the structural information, the effective attributes are chosen by Causal Cellular Automata, and the highly pure structure database is constructed for training. The model obtained Q3 accuracy 83.06%, 80.49% separately on data sets RS126 and CB513. And to the alpha/beta protein's forecast experiment, the Q3 accuracy obtained is 93.12%.
Keywords:Compound Pyramid Model  protein secondary structure prediction  Hybrid Prediction Model  data mining
点击此处可从《科学通报》浏览原始摘要信息
点击此处可从《科学通报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号