首页 | 本学科首页   官方微博 | 高级检索  
     检索      

模式生物基因序列的识别
引用本文:陈翠霞,李前忠.模式生物基因序列的识别[J].内蒙古大学学报(自然科学版),2005,36(4):417-424.
作者姓名:陈翠霞  李前忠
作者单位:内蒙古大学理工学院物理系,呼和浩特,010021
基金项目:国家自然科学基金项目(No.30160025)
摘    要:真核生物的全基因组序列可分为三种:外显子、内含子和基因间序列.基于剪切位点附近序列的保守性,序列的组分特征和编码序列阅读框存在三周期性,三种序列的标准离散源由序列上64个三联体的概率和5′端与3′尾剪切位点附近(共30位点)上4个碱基的概率,共184个参数构成.某条序列的类型就可以由该序列的离散量与上面三个标准离散源的离散量之间的离散增量最小值决定.当标准离散源具有184个信息参数时预测率比64参数预测的成功率至少提高4.61%,前者的预测成功率依次如下:线虫88.37%,酵母菌90.72%,拟南芥91.08%,果蝇92.28%,大肠杆菌92.88%.对预测成功的和错误的两类序列进行比较,发现这些预测错误序列的184个参数值与其预测结果所属的那类序列本身的参数值十分类似.

关 键 词:基因序列  剪切位点  离散增量
文章编号:1000-1638(2005)04-0417-08
修稿时间:2004年8月20日

An Identification of the Model Species Genomes
CHEN Cui-xia,LI Qian-zhong.An Identification of the Model Species Genomes[J].Acta Scientiarum Naturalium Universitatis Neimongol,2005,36(4):417-424.
Authors:CHEN Cui-xia  LI Qian-zhong
Abstract:Based on the conservation of nucleotides around the splice sites,the compositional feature and the existence of reading frame with 3-periodicity in coding sequence, the complete sequences of the eukaryotes genomes can be grouped into three kinds: introns, exons and intergenic DNA.The standard sources of diversity are respectively determined by the probability of 64 trimers on the whole sequence and 4 bases at 30 positions around the splice sites. The classification of a sequence can be determined by the least increment of diversity. The results show that the higher rates of correct prediction with the densities of 64 trimers and 120 bases have been obtained from standard sets and the test sets.The rates are better than that only with 64 trimers in terms of sensitivity (Sn) and specificity (Tn). The overall rates are as follows:C.elegans 88.37%,S.cerevisiae 90.72%,A.thaliana 91.08%,D.melanogaster 92.28%,E.coli 92.88%.On the analysis of the falsely predicted sequences,it can be seen that there are some similarities between the two kinds of sequences (the positive and the false).
Keywords:gene sequences  splice site  the increment of diversity
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号