首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Schadt EE  Woo S  Hao K 《Nature genetics》2012,44(5):603-608
RNA profiling can be used to capture the expression patterns of many genes that are associated with expression quantitative trait loci (eQTLs). Employing published putative cis eQTLs, we developed a Bayesian approach to predict SNP genotypes that is based only on RNA expression data. We show that predicted genotypes can accurately and uniquely identify individuals in large populations. When inferring genotypes from an expression data set using eQTLs of the same tissue type (but from an independent cohort), we were able to resolve 99% of the identities of individuals in the cohort at P(adjusted) ≤ 1 × 10(-5). When eQTLs derived from one tissue were used to predict genotypes using expression data from a different tissue, the identities of 90% of the study subjects could be resolved at P(adjusted) ≤ 1 × 10(-5). We discuss the implications of deriving genotypic information from RNA data deposited in the public domain.  相似文献   

2.
Accurate and complete measurement of single nucleotide (SNP) and copy number (CNV) variants, both common and rare, will be required to understand the role of genetic variation in disease. We present Birdsuite, a four-stage analytical framework instantiated in software for deriving integrated and mutually consistent copy number and SNP genotypes. The method sequentially assigns copy number across regions of common copy number polymorphisms (CNPs), calls genotypes of SNPs, identifies rare CNVs via a hidden Markov model (HMM), and generates an integrated sequence and copy number genotype at every locus (for example, including genotypes such as A-null, AAB and BBB in addition to AA, AB and BB calls). Such genotypes more accurately depict the underlying sequence of each individual, reducing the rate of apparent mendelian inconsistencies. The Birdsuite software is applied here to data from the Affymetrix SNP 6.0 array. Additionally, we describe a method, implemented in PLINK, to utilize these combined SNP and CNV genotypes for association testing with a phenotype.  相似文献   

3.
4.
A high-resolution survey of deletion polymorphism in the human genome   总被引:20,自引:0,他引:20  
Recent work has shown that copy number polymorphism is an important class of genetic variation in human genomes. Here we report a new method that uses SNP genotype data from parent-offspring trios to identify polymorphic deletions. We applied this method to data from the International HapMap Project to produce the first high-resolution population surveys of deletion polymorphism. Approximately 100 of these deletions have been experimentally validated using comparative genome hybridization on tiling-resolution oligonucleotide microarrays. Our analysis identifies a total of 586 distinct regions that harbor deletion polymorphisms in one or more of the families. Notably, we estimate that typical individuals are hemizygous for roughly 30-50 deletions larger than 5 kb, totaling around 550-750 kb of euchromatic sequence across their genomes. The detected deletions span a total of 267 known and predicted genes. Overall, however, the deleted regions are relatively gene-poor, consistent with the action of purifying selection against deletions. Deletion polymorphisms may well have an important role in the genetics of complex traits; however, they are not directly observed in most current gene mapping studies. Our new method will permit the identification of deletion polymorphisms in high-density SNP surveys of trio or other family data.  相似文献   

5.
Autism spectrum disorders (ASDs) are common, heritable neurodevelopmental conditions. The genetic architecture of ASDs is complex, requiring large samples to overcome heterogeneity. Here we broaden coverage and sample size relative to other studies of ASDs by using Affymetrix 10K SNP arrays and 1,181 [corrected] families with at least two affected individuals, performing the largest linkage scan to date while also analyzing copy number variation in these families. Linkage and copy number variation analyses implicate chromosome 11p12-p13 and neurexins, respectively, among other candidate loci. Neurexins team with previously implicated neuroligins for glutamatergic synaptogenesis, highlighting glutamate-related genes as promising candidates for contributing to ASDs.  相似文献   

6.
Genome-wide association studies are set to become the method of choice for uncovering the genetic basis of human diseases. A central challenge in this area is the development of powerful multipoint methods that can detect causal variants that have not been directly genotyped. We propose a coherent analysis framework that treats the problem as one involving missing or uncertain genotypes. Central to our approach is a model-based imputation method for inferring genotypes at observed or unobserved SNPs, leading to improved power over existing methods for multipoint association mapping. Using real genome-wide association study data, we show that our approach (i) is accurate and well calibrated, (ii) provides detailed views of associated regions that facilitate follow-up studies and (iii) can be used to validate and correct data at genotyped markers. A notable future use of our method will be to boost power by combining data from genome-wide scans that use different SNP sets.  相似文献   

7.
Characterizing genetic diversity within and between populations has broad applications in studies of human disease and evolution. We propose a new approach, spatial ancestry analysis, for the modeling of genotypes in two- or three-dimensional space. In spatial ancestry analysis (SPA), we explicitly model the spatial distribution of each SNP by assigning an allele frequency as a continuous function in geographic space. We show that the explicit modeling of the allele frequency allows individuals to be localized on the map on the basis of their genetic information alone. We apply our SPA method to a European and a worldwide population genetic variation data set and identify SNPs showing large gradients in allele frequency, and we suggest these as candidate regions under selection. These regions include SNPs in the well-characterized LCT region, as well as at loci including FOXP2, OCA2 and LRP1B.  相似文献   

8.
Gaut B 《Nature genetics》2012,44(2):115-116
A new study reports SNP genotypes of over 1,300 Arabidopsis thaliana accessions from throughout Eurasia, providing a resource for genome-wide association studies and studies of local adaptation. The extensive data are also used to identify targets of natural selection and to describe genome-wide patterns of recombination.  相似文献   

9.
Chanock SJ 《Nature genetics》2011,43(3):178-179
A new study uses genome-wide SNP genotypes to identify a subset of children undergoing therapy for acute lymphoblastic leukemia that are at increased risk for relapse. Borrowing from the classical approach of admixture mapping, the work shows how genome-wide assessment of genetic ancestry can be used as a biomarker for disease outcome.  相似文献   

10.
Nested association mapping (NAM) offers power to resolve complex, quantitative traits to their causal loci. The maize NAM population, consisting of 5,000 recombinant inbred lines (RILs) from 25 families representing the global diversity of maize, was evaluated for resistance to southern leaf blight (SLB) disease. Joint-linkage analysis identified 32 quantitative trait loci (QTLs) with predominantly small, additive effects on SLB resistance. Genome-wide association tests of maize HapMap SNPs were conducted by imputing founder SNP genotypes onto the NAM RILs. SNPs both within and outside of QTL intervals were associated with variation for SLB resistance. Many of these SNPs were within or near sequences homologous to genes previously shown to be involved in plant disease resistance. Limited linkage disequilibrium was observed around some SNPs associated with SLB resistance, indicating that the maize NAM population enables high-resolution mapping of some genome regions.  相似文献   

11.
Dissecting the genetic basis of disease risk requires measuring all forms of genetic variation, including SNPs and copy number variants (CNVs), and is enabled by accurate maps of their locations, frequencies and population-genetic properties. We designed a hybrid genotyping array (Affymetrix SNP 6.0) to simultaneously measure 906,600 SNPs and copy number at 1.8 million genomic locations. By characterizing 270 HapMap samples, we developed a map of human CNV (at 2-kb breakpoint resolution) informed by integer genotypes for 1,320 copy number polymorphisms (CNPs) that segregate at an allele frequency >1%. More than 80% of the sequence in previously reported CNV regions fell outside our estimated CNV boundaries, indicating that large (>100 kb) CNVs affect much less of the genome than initially reported. Approximately 80% of observed copy number differences between pairs of individuals were due to common CNPs with an allele frequency >5%, and more than 99% derived from inheritance rather than new mutation. Most common, diallelic CNPs were in strong linkage disequilibrium with SNPs, and most low-frequency CNVs segregated on specific SNP haplotypes.  相似文献   

12.
The detection of sequence variation, for which DNA sequencing has emerged as the most sensitive and automated approach, forms the basis of all genetic analysis. Here we describe and illustrate an algorithm that accurately detects and genotypes SNPs from fluorescence-based sequence data. Because the algorithm focuses particularly on detecting SNPs through the identification of heterozygous individuals, it is especially well suited to the detection of SNPs in diploid samples obtained after DNA amplification. It is substantially more accurate than existing approaches and, notably, provides a useful quantitative measure of its confidence in each potential SNP detected and in each genotype called. Calls assigned the highest confidence are sufficiently reliable to remove the need for manual review in several contexts. For example, for sequence data from 47-90 individuals sequenced on both the forward and reverse strands, the highest-confidence calls from our algorithm detected 93% of all SNPs and 100% of high-frequency SNPs, with no false positive SNPs identified and 99.9% genotyping accuracy. This algorithm is implemented in a software package, PolyPhred version 5.0, which is freely available for academic use.  相似文献   

13.
Genome-wide association studies (GWAS) have proven to be a powerful method to identify common genetic variants contributing to susceptibility to common diseases. Here, we show that extremely low-coverage sequencing (0.1-0.5×) captures almost as much of the common (>5%) and low-frequency (1-5%) variation across the genome as SNP arrays. As an empirical demonstration, we show that genome-wide SNP genotypes can be inferred at a mean r(2) of 0.71 using off-target data (0.24× average coverage) in a whole-exome study of 909 samples. Using both simulated and real exome-sequencing data sets, we show that association statistics obtained using extremely low-coverage sequencing data attain similar P values at known associated variants as data from genotyping arrays, without an excess of false positives. Within the context of reductions in sample preparation and sequencing costs, funds invested in extremely low-coverage sequencing can yield several times the effective sample size of GWAS based on SNP array data and a commensurate increase in statistical power.  相似文献   

14.
Genetic susceptibility to multiple sclerosis is associated with genes of the major histocompatibility complex (MHC), particularly HLA-DRB1 and HLA-DQB1 (ref. 1). Both locus and allelic heterogeneity have been reported in this genomic region. To clarify whether HLA-DRB1 itself, nearby genes in the region encoding the MHC or combinations of these loci underlie susceptibility to multiple sclerosis, we genotyped 1,185 Canadian and Finnish families with multiple sclerosis (n = 4,203 individuals) with a high-density SNP panel spanning the genes encoding the MHC and flanking genomic regions. Strong associations in Canadian and Finnish samples were observed with blocks in the HLA class II genomic region (P < 4.9 x 10(-13) and P < 2.0 x 10(-16), respectively), but the strongest association was with HLA-DRB1 (P < 4.4 x 10(-17)). Conditioning on either HLA-DRB1 or the most significant HLA class II haplotype block found no additional block or SNP association independent of the HLA class II genomic region. This study therefore indicates that MHC-associated susceptibility to multiple sclerosis is determined by HLA class II alleles, their interactions and closely neighboring variants.  相似文献   

15.
SNP genotyping has emerged as a technology to incorporate copy number variants (CNVs) into genetic analyses of human traits. However, the extent to which SNP platforms accurately capture CNVs remains unclear. Using independent, sequence-based CNV maps, we find that commonly used SNP platforms have limited or no probe coverage for a large fraction of CNVs. Despite this, in 9 samples we inferred 368 CNVs using Illumina SNP genotyping data and experimentally validated over two-thirds of these. We also developed a method (SNP-Conditional Mixture Modeling, SCIMM) to robustly genotype deletions using as few as two SNP probes. We find that HapMap SNPs are strongly correlated with 82% of common deletions, but the newest SNP platforms effectively tag about 50%. We conclude that currently available genome-wide SNP assays can capture CNVs accurately, but improvements in array designs, particularly in duplicated sequences, are necessary to facilitate more comprehensive analyses of genomic variation.  相似文献   

16.
To identify susceptibility alleles associated with rheumatoid arthritis, we genotyped 397 individuals with rheumatoid arthritis for 116,204 SNPs and carried out an association analysis in comparison to publicly available genotype data for 1,211 related individuals from the Framingham Heart Study. After evaluating and adjusting for technical and population biases, we identified a SNP at 6q23 (rs10499194, approximately 150 kb from TNFAIP3 and OLIG3) that was reproducibly associated with rheumatoid arthritis both in the genome-wide association (GWA) scan and in 5,541 additional case-control samples (P = 10(-3), GWA scan; P < 10(-6), replication; P = 10(-9), combined). In a concurrent study, the Wellcome Trust Case Control Consortium (WTCCC) has reported strong association of rheumatoid arthritis susceptibility to a different SNP located 3.8 kb from rs10499194 (rs6920220; P = 5 x 10(-6) in WTCCC). We show that these two SNP associations are statistically independent, are each reproducible in the comparison of our data and WTCCC data, and define risk and protective haplotypes for rheumatoid arthritis at 6q23.  相似文献   

17.
After imputation of data from the 1000 Genomes Project into a genome-wide dataset of Ghanaian individuals with tuberculosis and controls, we identified a resistance locus on chromosome 11p13 downstream of the WT1 gene (encoding Wilms tumor 1). The strongest signal was obtained at the rs2057178 SNP (P = 2.63 × 10(-9)). Replication in Gambian, Indonesian and Russian tuberculosis case-control study cohorts increased the significance level for the association with this SNP to P = 2.57 × 10(-11).  相似文献   

18.
Metformin is the most commonly used pharmacological therapy for type 2 diabetes. We report a genome-wide association study for glycemic response to metformin in 1,024 Scottish individuals with type 2 diabetes with replication in two cohorts including 1,783 Scottish individuals and 1,113 individuals from the UK Prospective Diabetes Study. In a combined meta-analysis, we identified a SNP, rs11212617, associated with treatment success (n = 3,920, P = 2.9 × 10(-9), odds ratio = 1.35, 95% CI 1.22-1.49) at a locus containing ATM, the ataxia telangiectasia mutated gene. In a rat hepatoma cell line, inhibition of ATM with KU-55933 attenuated the phosphorylation and activation of AMP-activated protein kinase in response to metformin. We conclude that ATM, a gene known to be involved in DNA repair and cell cycle control, plays a role in the effect of metformin upstream of AMP-activated protein kinase, and variation in this gene alters glycemic response to metformin.  相似文献   

19.
The high-incidence erythrocyte blood group antigen Jr(a) has been known in transfusion medicine for over 40 years. To identify the gene encoding Jr(a), we performed SNP analysis of genomic DNA from six Jr(a-) individuals. All individuals shared a homozygous region of 397,000 bp at chromosome 4q22.1 that contained the gene ABCG2, and DNA sequence analysis showed that ABCG2 null alleles define the Jr(a-) phenotype.  相似文献   

20.
We report a genome-wide association study for melanoma that was conducted by the GenoMEL Consortium. Our discovery phase included 2,981 individuals with melanoma and 1,982 study-specific control individuals of European ancestry, as well as an additional 6,426 control subjects from French or British populations, all of whom were genotyped for 317,000 or 610,000 single-nucleotide polymorphisms (SNPs). Our analysis replicated previously known melanoma susceptibility loci. Seven new regions with at least one SNP with P < 10(-5) and further local imputed or genotyped support were selected for replication using two other genome-wide studies (from Australia and Texas, USA). Additional replication came from case-control series from the UK and The Netherlands. Variants at three of the seven loci replicated at P < 10(-3): an SNP in ATM (rs1801516, overall P = 3.4 × 10(-9)), an SNP in MX2 (rs45430, P = 2.9 × 10(-9)) and an SNP adjacent to CASP8 (rs13016963, P = 8.6 × 10(-10)). A fourth locus near CCND1 remains of potential interest, showing suggestive but inconclusive evidence of replication (rs1485993, overall P = 4.6 × 10(-7) under a fixed-effects model and P = 1.2 × 10(-3) under a random-effects model). These newly associated variants showed no association with nevus or pigmentation phenotypes in a large British case-control series.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号