首页 | 本学科首页   官方微博 | 高级检索  
     检索      

全基因组关联研究中的遗传位点分析方法
引用本文:卜庆伟,杨雨诗,葛方丽,陈雄.全基因组关联研究中的遗传位点分析方法[J].科学技术与工程,2017,17(17).
作者姓名:卜庆伟  杨雨诗  葛方丽  陈雄
作者单位:南京理工大学国防重点实验室,南京理工大学国防重点实验室,南京理工大学国防重点实验室,南京理工大学国防重点实验室
基金项目:国家自然科学基金项目(51606098),江苏省自然科学基金(BK20140772)
摘    要:提出多种模型方法,对高维位点数据进行分析,为基因定位和复杂疾病性状遗传等方面的研究提供新的技术支持。为了实现关联位点在基因中的定位,首先建立映射模型,对每个位点的碱基对重新编码;然后,提出将质量控制模型与关联分析模型相结合的方法,确定位点的关联程度;随后利用基于随机森林的重要性排序,筛选与该遗传疾病最相关的致病位点;最后,设计出高维RBF神经网络,得到每个位点对性状的相关性系数,探索出与疾病多类性状相关的位点。结合多种检验方式,验证所建模型能够较为准确地定位与疾病相关的位点及基因。各类模型具有极强的推广性,广泛适用于筛选占有各自权值的大样本数据。

关 键 词:GWAS  关联分析模型  RBF神经网络  随机森林  卡方检验  决策树
收稿时间:2016/12/25 0:00:00
修稿时间:2017/2/14 0:00:00

Genetic Loci Analysis Method in Genome-wide Association Studies
Bu Qingwei,Yang Yushi,Ge Fangli and Chen Xiong.Genetic Loci Analysis Method in Genome-wide Association Studies[J].Science Technology and Engineering,2017,17(17).
Authors:Bu Qingwei  Yang Yushi  Ge Fangli and Chen Xiong
Institution:National Defense Key Laboratory, Nanjing University of Science and Technology,National Defense Key Laboratory, Nanjing University of Science and Technology,National Defense Key Laboratory, Nanjing University of Science and Technology,National Defense Key Laboratory, Nanjing University of Science and Technology
Abstract:In this paper, a variety of model methods are proposed to analyze the high dimensional data. They provides new technical support for the study of gene mapping and complex disease traits inheritance, etc. In order to realize the localization of associated locus in genes, a mapping model was first constructed to recode the base pairs of each locus. Then, the method of combining quality control model with correlation analysis model is proposed to determine the correlation degree of locus. After that, the most relevant pathogenic locus associated with this genetic disease is screened out, according to the importance ranking of random forests. Finally, a high dimensional RBF neural network was designed to obtain the correlation coefficient of each locus and to explore the loci related to multiple traits of the disease. Combined with a variety of testing methods, the proposed model can accurately locate the loci and genes related to the disease. All kinds of models have strong generalization and are widely applicable to select large sample data with their own weights.
Keywords:GWAS  correlation analysis model  RBF neural network  random forests  Chi-square test  decision tree
本文献已被 CNKI 等数据库收录!
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号