共查询到20条相似文献,搜索用时 15 毫秒
1.
SNP genotyping has emerged as a technology to incorporate copy number variants (CNVs) into genetic analyses of human traits. However, the extent to which SNP platforms accurately capture CNVs remains unclear. Using independent, sequence-based CNV maps, we find that commonly used SNP platforms have limited or no probe coverage for a large fraction of CNVs. Despite this, in 9 samples we inferred 368 CNVs using Illumina SNP genotyping data and experimentally validated over two-thirds of these. We also developed a method (SNP-Conditional Mixture Modeling, SCIMM) to robustly genotype deletions using as few as two SNP probes. We find that HapMap SNPs are strongly correlated with 82% of common deletions, but the newest SNP platforms effectively tag about 50%. We conclude that currently available genome-wide SNP assays can capture CNVs accurately, but improvements in array designs, particularly in duplicated sequences, are necessary to facilitate more comprehensive analyses of genomic variation. 相似文献
2.
3.
A framework for variation discovery and genotyping using next-generation DNA sequencing data 总被引:7,自引:0,他引:7
DePristo MA Banks E Poplin R Garimella KV Maguire JR Hartl C Philippakis AA del Angel G Rivas MA Hanna M McKenna A Fennell TJ Kernytsky AM Sivachenko AY Cibulskis K Gabriel SB Altshuler D Daly MJ 《Nature genetics》2011,43(5):491-498
Recent advances in sequencing technology make it possible to comprehensively catalog genetic variation in population samples, creating a foundation for understanding human disease, ancestry and evolution. The amounts of raw data produced are prodigious, and many computational steps are required to translate this output into high-quality variant calls. We present a unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs. Our process includes (i) initial read mapping; (ii) local realignment around indels; (iii) base quality score recalibration; (iv) SNP discovery and genotyping to find all potential variants; and (v) machine learning to separate true segregating variation from machine artifacts common to next-generation sequencing technologies. We here discuss the application of these tools, instantiated in the Genome Analysis Toolkit, to deep whole-genome, whole-exome capture and multi-sample low-pass (~4×) 1000 Genomes Project datasets. 相似文献
4.
A genome-wide survey of RAS transformation targets 总被引:28,自引:0,他引:28
Zuber J Tchernitsa OI Hinzmann B Schmitz AC Grips M Hellriegel M Sers C Rosenthal A Schäfer R 《Nature genetics》2000,24(2):144-152
5.
Volkman SK Sabeti PC DeCaprio D Neafsey DE Schaffner SF Milner DA Daily JP Sarr O Ndiaye D Ndir O Mboup S Duraisingh MT Lukens A Derr A Stange-Thomann N Waggoner S Onofrio R Ziaugra L Mauceli E Gnerre S Jaffe DB Zainoun J Wiegand RC Birren BW Hartl DL Galagan JE Lander ES Wirth DF 《Nature genetics》2007,39(1):113-119
Genetic variation allows the malaria parasite Plasmodium falciparum to overcome chemotherapeutic agents, vaccines and vector control strategies and remain a leading cause of global morbidity and mortality. Here we describe an initial survey of genetic variation across the P. falciparum genome. We performed extensive sequencing of 16 geographically diverse parasites and identified 46,937 SNPs, demonstrating rich diversity among P. falciparum parasites (pi = 1.16 x 10(-3)) and strong correlation with gene function. We identified multiple regions with signatures of selective sweeps in drug-resistant parasites, including a previously unidentified 160-kb region with extremely low polymorphism in pyrimethamine-resistant parasites. We further characterized 54 worldwide isolates by genotyping SNPs across 20 genomic regions. These data begin to define population structure among African, Asian and American groups and illustrate the degree of linkage disequilibrium, which extends over relatively short distances in African parasites but over longer distances in Asian parasites. We provide an initial map of genetic diversity in P. falciparum and demonstrate its potential utility in identifying genes subject to recent natural selection and in understanding the population genetics of this parasite. 相似文献
6.
7.
Detecting genetic variants that are highly divergent from a reference sequence remains a major challenge in genome sequencing. We introduce de novo assembly algorithms using colored de Bruijn graphs for detecting and genotyping simple and complex genetic variants in an individual or population. We provide an efficient software implementation, Cortex, the first de novo assembler capable of assembling multiple eukaryotic genomes simultaneously. Four applications of Cortex are presented. First, we detect and validate both simple and complex structural variations in a high-coverage human genome. Second, we identify more than 3 Mb of sequence absent from the human reference genome, in pooled low-coverage population sequence data from the 1000 Genomes Project. Third, we show how population information from ten chimpanzees enables accurate variant calls without a reference sequence. Last, we estimate classical human leukocyte antigen (HLA) genotypes at HLA-B, the most variable gene in the human genome. 相似文献
8.
Bradfield JP Taal HR Timpson NJ Scherag A Lecoeur C Warrington NM Hypponen E Holst C Valcarcel B Thiering E Salem RM Schumacher FR Cousminer DL Sleiman PM Zhao J Berkowitz RI Vimaleswaran KS Jarick I Pennell CE Evans DM St Pourcain B Berry DJ Mook-Kanamori DO Hofman A Rivadeneira F Uitterlinden AG van Duijn CM van der Valk RJ de Jongste JC Postma DS Boomsma DI Gauderman WJ Hassanein MT Lindgren CM Mägi R Boreham CA Neville CE Moreno LA Elliott P Pouta A Hartikainen AL Li M Raitakari O 《Nature genetics》2012,44(5):526-531
Multiple genetic variants have been associated with adult obesity and a few with severe obesity in childhood; however, less progress has been made in establishing genetic influences on common early-onset obesity. We performed a North American, Australian and European collaborative meta-analysis of 14 studies consisting of 5,530 cases (≥95th percentile of body mass index (BMI)) and 8,318 controls (<50th percentile of BMI) of European ancestry. Taking forward the eight newly discovered signals yielding association with P < 5 × 10(-6) in nine independent data sets (2,818 cases and 4,083 controls), we observed two loci that yielded genome-wide significant combined P values near OLFM4 at 13q14 (rs9568856; P = 1.82 × 10(-9); odds ratio (OR) = 1.22) and within HOXB5 at 17q21 (rs9299; P = 3.54 × 10(-9); OR = 1.14). Both loci continued to show association when two extreme childhood obesity cohorts were included (2,214 cases and 2,674 controls). These two loci also yielded directionally consistent associations in a previous meta-analysis of adult BMI(1). 相似文献
9.
10.
Roest Crollius H Jaillon O Bernot A Dasilva C Bouneau L Fischer C Fizames C Wincker P Brottier P Quétier F Saurin W Weissenbach J 《Nature genetics》2000,25(2):235-238
The number of genes in the human genome is unknown, with estimates ranging from 50,000 to 90,000 (refs 1, 2), and to more than 140,000 according to unpublished sources. We have developed 'Exofish', a procedure based on homology searches, to identify human genes quickly and reliably. This method relies on the sequence of another vertebrate, the pufferfish Tetraodon nigroviridis, to detect conserved sequences with a very low background. Similar to Fugu rubripes, a marine pufferfish proposed by Brenner et al. as a model for genomic studies, T. nigroviridis is a more practical alternative with a genome also eight times more compact than that of human. Many comparisons have been made between F. rubripes and human DNA that demonstrate the potential of comparative genomics using the pufferfish genome. Application of Exofish to the December version of the working draft sequence of the human genome and to Unigene showed that the human genome contains 28,000-34,000 genes, and that Unigene contains less than 40% of the protein-coding fraction of the human genome. 相似文献
11.
Suhre K Wallaschofski H Raffler J Friedrich N Haring R Michael K Wasner C Krebs A Kronenberg F Chang D Meisinger C Wichmann HE Hoffmann W Völzke H Völker U Teumer A Biffar R Kocher T Felix SB Illig T Kroemer HK Gieger C Römisch-Margl W Nauck M 《Nature genetics》2011,43(6):565-569
We present a genome-wide association study of metabolic traits in human urine, designed to investigate the detoxification capacity of the human body. Using NMR spectroscopy, we tested for associations between 59 metabolites in urine from 862 male participants in the population-based SHIP study. We replicated the results using 1,039 additional samples of the same study, including a 5-year follow-up, and 992 samples from the independent KORA study. We report five loci with joint P values of association from 3.2 × 10(-19) to 2.1 × 10(-182). Variants at three of these loci have previously been linked with important clinical outcomes: SLC7A9 is a risk locus for chronic kidney disease, NAT2 for coronary artery disease and genotype-dependent response to drug toxicity, and SLC6A20 for iminoglycinuria. Moreover, we identify rs37369 in AGXT2 as the genetic basis of hyper-β-aminoisobutyric aciduria. 相似文献
12.
Turnbull C Perdeaux ER Pernet D Naranjo A Renwick A Seal S Munoz-Xicola RM Hanks S Slade I Zachariou A Warren-Perry M Ruark E Gerrard M Hale J Hewitt M Kohler J Lane S Levitt G Madi M Morland B Neefjes V Nicholdson J Picton S Pizer B Ronghe M Stevens M Traunecker H Stiller CA Pritchard-Jones K Dome J Grundy P Rahman N 《Nature genetics》2012,44(6):681-684
Wilms tumor is the most common renal malignancy of childhood. To identify common variants that confer susceptibility to Wilms tumor, we conducted a genome-wide association study in 757 individuals with Wilms tumor (cases) and 1,879 controls. We evaluated ten SNPs in regions significantly associated at P < 5 × 10(-5) in two independent replication series from the UK (769 cases and 2,814 controls) and the United States (719 cases and 1,037 controls). We identified clear significant associations at 2p24 (rs3755132, P = 1.03 × 10(-14); rs807624, P = 1.32 × 10(-14)) and 11q14 (rs790356, P = 4.25 × 10(-15)). Both regions contain genes that are plausibly related to Wilms tumorigenesis. We also identified candidate association signals at 5q14, 22q12 and Xp22. 相似文献
13.
Di Bernardo MC Crowther-Swanepoel D Broderick P Webb E Sellick G Wild R Sullivan K Vijayakrishnan J Wang Y Pittman AM Sunter NJ Hall AG Dyer MJ Matutes E Dearden C Mainou-Fowler T Jackson GH Summerfield G Harris RJ Pettitt AR Hillmen P Allsup DJ Bailey JR Pratt G Pepper C Fegan C Allan JM Catovsky D Houlston RS 《Nature genetics》2008,40(10):1204-1210
We conducted a genome-wide association study of 299,983 tagging SNPs for chronic lymphocytic leukemia (CLL) and performed validation in two additional series totaling 1,529 cases and 3,115 controls. We identified six previously unreported CLL risk loci at 2q13 (rs17483466; P = 2.36 x 10(-10)), 2q37.1 (rs13397985, SP140; P = 5.40 x 10(-10)), 6p25.3 (rs872071, IRF4; P = 1.91 x 10(-20)), 11q24.1 (rs735665; P = 3.78 x 10(-12)), 15q23 (rs7176508; P = 4.54 x 10(-12)) and 19q13.32 (rs11083846, PRKD2; P = 3.96 x 10(-9)). These data provide the first evidence for the existence of common, low-penetrance susceptibility to a hematological malignancy and new insights into disease causation in CLL. 相似文献
14.
Onouchi Y Ozaki K Burns JC Shimizu C Terai M Hamada H Honda T Suzuki H Suenaga T Takeuchi T Yoshikawa N Suzuki Y Yasukawa K Ebata R Higashi K Saji T Kemmotsu Y Takatsuki S Ouchi K Kishi F Yoshikawa T Nagai T Hamamoto K Sato Y Honda A Kobayashi H Sato J Shibuta S Miyawaki M Oishi K Yamaga H Aoyagi N Iwahashi S Miyashita R Murata Y Sasago K Takahashi A Kamatani N Kubo M Tsunoda T Hata A Nakamura Y Tanaka T;Japan Kawasaki Disease Genome Consortium;US Kawasaki Disease Genetics Consortium 《Nature genetics》2012,44(5):517-521
We performed a genome-wide association study (GWAS) of Kawasaki disease in Japanese subjects using data from 428 individuals with Kawasaki disease (cases) and 3,379 controls genotyped at 473,803 SNPs. We validated the association results in two independent replication panels totaling 754 cases and 947 controls. We observed significant associations in the FAM167A-BLK region at 8p22-23 (rs2254546, P = 8.2 × 10(-21)), in the human leukocyte antigen (HLA) region at 6p21.3 (rs2857151, P = 4.6 × 10(-11)) and in the CD40 region at 20q13 (rs4813003, P = 4.8 × 10(-8)). We also replicated the association of a functional SNP of FCGR2A (rs1801274, P = 1.6 × 10(-6)) identified in a recently reported GWAS of Kawasaki disease. Our findings provide new insights into the pathogenesis and pathophysiology of Kawasaki disease. 相似文献
15.
A new multipoint method for genome-wide association studies by imputation of genotypes 总被引:23,自引:0,他引:23
Genome-wide association studies are set to become the method of choice for uncovering the genetic basis of human diseases. A central challenge in this area is the development of powerful multipoint methods that can detect causal variants that have not been directly genotyped. We propose a coherent analysis framework that treats the problem as one involving missing or uncertain genotypes. Central to our approach is a model-based imputation method for inferring genotypes at observed or unobserved SNPs, leading to improved power over existing methods for multipoint association mapping. Using real genome-wide association study data, we show that our approach (i) is accurate and well calibrated, (ii) provides detailed views of associated regions that facilitate follow-up studies and (iii) can be used to validate and correct data at genotyped markers. A notable future use of our method will be to boost power by combining data from genome-wide scans that use different SNP sets. 相似文献
16.
Rothman N Garcia-Closas M Chatterjee N Malats N Wu X Figueroa JD Real FX Van Den Berg D Matullo G Baris D Thun M Kiemeney LA Vineis P De Vivo I Albanes D Purdue MP Rafnar T Hildebrandt MA Kiltie AE Cussenot O Golka K Kumar R Taylor JA Mayordomo JI Jacobs KB Kogevinas M Hutchinson A Wang Z Fu YP Prokunina-Olsson L Burdett L Yeager M Wheeler W Tardón A Serra C Carrato A García-Closas R Lloreta J Johnson A Schwenn M Karagas MR Schned A Andriole G Grubb R Black A Jacobs EJ Diver WR Gapstur SM 《Nature genetics》2010,42(11):978-984
We conducted a multi-stage, genome-wide association study of bladder cancer with a primary scan of 591,637 SNPs in 3,532 affected individuals (cases) and 5,120 controls of European descent from five studies followed by a replication strategy, which included 8,382 cases and 48,275 controls from 16 studies. In a combined analysis, we identified three new regions associated with bladder cancer on chromosomes 22q13.1, 19q12 and 2q37.1: rs1014971, (P = 8 × 10?12) maps to a non-genic region of chromosome 22q13.1, rs8102137 (P = 2 × 10?11) on 19q12 maps to CCNE1 and rs11892031 (P = 1 × 10??) maps to the UGT1A cluster on 2q37.1. We confirmed four previously identified genome-wide associations on chromosomes 3q28, 4p16.3, 8q24.21 and 8q24.3, validated previous candidate associations for the GSTM1 deletion (P = 4 × 10?11) and a tag SNP for NAT2 acetylation status (P = 4 × 10?11), and found interactions with smoking in both regions. Our findings on common variants associated with bladder cancer risk should provide new insights into the mechanisms of carcinogenesis. 相似文献
17.
Chu X Pan CM Zhao SX Liang J Gao GQ Zhang XM Yuan GY Li CG Xue LQ Shen M Liu W Xie F Yang SY Wang HF Shi JY Sun WW Du WH Zuo CL Shi JX Liu BL Guo CC Zhan M Gu ZH Zhang XN Sun F Wang ZQ Song ZY Zou CY Sun WH Guo T Cao HM Ma JH Han B Li P Jiang H Huang QH Liang L Liu LB Chen G Su Q Peng YD Zhao JJ Ning G Chen Z Chen JL Chen SJ Huang W Song HD;China Consortium for Genetics of Autoimmune Thyroid Disease 《Nature genetics》2011,43(9):897-901
Graves' disease is a common autoimmune disorder characterized by thyroid stimulating hormone receptor autoantibodies (TRAb) and hyperthyroidism. To investigate the genetic architecture of Graves' disease, we conducted a genome-wide association study in 1,536 individuals with Graves' disease (cases) and 1,516 controls. We further evaluated a group of associated SNPs in a second set of 3,994 cases and 3,510 controls. We confirmed four previously reported loci (in the major histocompatibility complex, TSHR, CTLA4 and FCRL3) and identified two new susceptibility loci (the RNASET2-FGFR1OP-CCR6 region at 6q27 (P(combined) = 6.85 × 10(-10) for rs9355610) and an intergenic region at 4p14 (P(combined) = 1.08 × 10(-13) for rs6832151)). These newly associated SNPs were correlated with the expression levels of RNASET2 at 6q27, of CHRNA9 and of a previously uncharacterized gene at 4p14, respectively. Moreover, we identified strong associations of TSHR and major histocompatibility complex class II variants with persistently TRAb-positive Graves' disease. 相似文献
18.
A mixed-model approach for genome-wide association studies of correlated traits in structured populations 总被引:1,自引:0,他引:1
Genome-wide association studies (GWAS) are a standard approach for studying the genetics of natural variation. A major concern in GWAS is the need to account for the complicated dependence structure of the data, both between loci as well as between individuals. Mixed models have emerged as a general and flexible approach for correcting for population structure in GWAS. Here, we extend this linear mixed-model approach to carry out GWAS of correlated phenotypes, deriving a fully parameterized multi-trait mixed model (MTMM) that considers both the within-trait and between-trait variance components simultaneously for multiple traits. We apply this to data from a human cohort for correlated blood lipid traits from the Northern Finland Birth Cohort 1966 and show greatly increased power to detect pleiotropic loci that affect more than one blood lipid trait. We also apply this approach to an Arabidopsis thaliana data set for flowering measurements in two different locations, identifying loci whose effect depends on the environment. 相似文献
19.
Uterine fibroids are a common benign tumor of the female genital tract. We conducted a genome-wide association study in which 457,044 SNPs were analyzed in 1,607 individuals with clinically diagnosed uterine fibroids and 1,428 female controls. SNPs showing suggestive associations (P < 5 × 10(-5)) were further genotyped in 3,466 additional cases and 3,245 female controls. Three loci on chromosomes 10q24.33, 22q13.1 and 11p15.5 revealed genome-wide significant associations with uterine fibroids. The SNPs showing the most significant association in a combination analysis at each of these loci were rs7913069 (P = 8.65 × 10(-14), odds ratio (OR) = 1.47), rs12484776 (P = 2.79 × 10(-12), OR = 1.23) and rs2280543 (P = 3.82 × 10(-12), OR = 1.39), respectively. Subsequent fine mapping of these regions will be necessary to pinpoint the causal variants. Our findings should shed light on the pathogenesis of uterine fibroids. 相似文献