首页 | 本学科首页   官方微博 | 高级检索  
     检索      

原核基因识别中的一种负样本生成算法
引用本文:马彬广.原核基因识别中的一种负样本生成算法[J].天津理工大学学报,2004,20(1):89-92.
作者姓名:马彬广
作者单位:天津大学,物理系,天津,300072
摘    要:在基因识别的两类算法中,判别算法通常需要正负两类样本来训练参数.在原核生物的基因组中,由于可充当负样本的基因间序列太少,如何产生负样本便成为原核基因识别中的一个问题.本文提供了一种基于"自相似映射"的负样本生成算法,与通常使用的随机生成算法不同,该算法不需要生成随机数.本文给出了两种负样本生成算法的比较,并初步讨论了自相似性对于DNA序列分析的意义.

关 键 词:基因识别  负样本  自相似映射  Z曲线  Fisher判别
文章编号:1004-2261(2004)01-0089-04
修稿时间:2003年9月19日

A self-similarity-map-based algorithm for generating negative samples and its application in prokaryotic gene recognition
MA Bin-guang.A self-similarity-map-based algorithm for generating negative samples and its application in prokaryotic gene recognition[J].Journal of Tianjin University of Technology,2004,20(1):89-92.
Authors:MA Bin-guang
Abstract:In the two types of gene recognition algorithms discriminant and clustered, the discriminant algorithms usually need two groups of samples as training parameters. For the prokaryotic genomes, few intergenic sequence acts as negative samples. Therefore,an unsolved problem is how to generate negative samples for prokaryotic gene recognition. In the paper, a simple algorithm for generating negative samples is presented based on the self-similar map. In contrast to widely used randomness-based algorithms,our algorithm doesn't need random numbers. A comparison between self-similarity-based and randomness-based algorithms is given , and the significance and utility of self-similarity in the analysis of DNA sequence is also disscussed.
Keywords:gene recognition  negative samples  self-similar map  Z curves  Fisher diseriminanee
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号