首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
More than 5 million single-nucleotide polymorphisms (SNPs) with minor-allele frequency greater than 10% are expected to exist in the human genome. Some of these SNPs may be associated with risk of developing common diseases. To assess the power of currently available SNPs to detect such associations, we resequenced 50 genes in two ethnic samples and measured patterns of linkage disequilibrium between the subset of SNPs reported in dbSNP and the complete set of common SNPs. Our results suggest that using all 2.7 million SNPs currently in the database would detect nearly 80% of all common SNPs in European populations but only 50% of those common in the African American population and that efficient selection of a minimal subset of SNPs for use in association studies requires measurement of allele frequency and linkage disequilibrium relationships for all SNPs in dbSNP.  相似文献   

2.
Bayesian inference of epistatic interactions in case-control studies   总被引:1,自引:0,他引:1  
Zhang Y  Liu JS 《Nature genetics》2007,39(9):1167-1173
Epistatic interactions among multiple genetic variants in the human genome may be important in determining individual susceptibility to common diseases. Although some existing computational methods for identifying genetic interactions have been effective for small-scale studies, we here propose a method, denoted 'bayesian epistasis association mapping' (BEAM), for genome-wide case-control studies. BEAM treats the disease-associated markers and their interactions via a bayesian partitioning model and computes, via Markov chain Monte Carlo, the posterior probability that each marker set is associated with the disease. Testing this on an age-related macular degeneration genome-wide association data set, we demonstrate that the method is significantly more powerful than existing approaches and that genome-wide case-control epistasis mapping with many thousands of markers is both computationally and statistically feasible.  相似文献   

3.
Humans show great variation in phenotypic traits such as height, eye color and susceptibility to disease. Genomic DNA sequence differences among individuals are responsible for the inherited components of these complex traits. Reports suggest that intermediate and large-scale DNA copy number and structural variations are prevalent enough to be an important source of genetic variation between individuals. Because association studies to identify genomic loci associated with particular phenotypic traits have focused primarily on genotyping SNPs, it is important to determine whether common structural polymorphisms are in linkage disequilibrium with common SNPs, and thus can be assessed indirectly in SNP-based studies. Here we examine 100 deletion polymorphisms ranging from 70 bp to 7 kb. We show that common deletions and SNPs ascertained with similar criteria have essentially the same distribution of linkage disequilibrium with surrounding SNPs, indicating that these polymorphisms may share evolutionary history and that most deletion polymorphisms are effectively assayed by proxy in SNP-based association studies.  相似文献   

4.
Emerging technologies make it possible for the first time to genotype hundreds of thousands of SNPs simultaneously, enabling whole-genome association studies. Using empirical genotype data from the International HapMap Project, we evaluate the extent to which the sets of SNPs contained on three whole-genome genotyping arrays capture common SNPs across the genome, and we find that the majority of common SNPs are well captured by these products either directly or through linkage disequilibrium. We explore analytical strategies that use HapMap data to improve power of association studies conducted with these fixed sets of markers and show that limited inclusion of specific haplotype tests in association analysis can increase the fraction of common variants captured by 25-100%. Finally, we introduce a Bayesian approach to association analysis by weighting the likelihood of each statistical test to reflect the number of putative causal alleles to which it is correlated.  相似文献   

5.
The Genetic Association Information Network (GAIN) is a public-private partnership established to investigate the genetic basis of common diseases through a series of collaborative genome-wide association studies. GAIN has used new approaches for project selection, data deposition and distribution, collaborative analysis, publication and protection from premature intellectual property claims. These demonstrate a new commitment to shared scientific knowledge that should facilitate rapid advances in understanding the genetics of complex diseases.  相似文献   

6.
The proteins encoded by the classical HLA class I and class II genes in the major histocompatibility complex (MHC) are highly polymorphic and are essential in self versus non-self immune recognition. HLA variation is a crucial determinant of transplant rejection and susceptibility to a large number of infectious and autoimmune diseases. Yet identification of causal variants is problematic owing to linkage disequilibrium that extends across multiple HLA and non-HLA genes in the MHC. We therefore set out to characterize the linkage disequilibrium patterns between the highly polymorphic HLA genes and background variation by typing the classical HLA genes and >7,500 common SNPs and deletion-insertion polymorphisms across four population samples. The analysis provides informative tag SNPs that capture much of the common variation in the MHC region and that could be used in disease association studies, and it provides new insight into the evolutionary dynamics and ancestral origins of the HLA loci and their haplotypes.  相似文献   

7.
The effects of human population structure on large genetic association studies   总被引:21,自引:0,他引:21  
Large-scale association studies hold substantial promise for unraveling the genetic basis of common human diseases. A well-known problem with such studies is the presence of undetected population structure, which can lead to both false positive results and failures to detect genuine associations. Here we examine approximately 15,000 genome-wide single-nucleotide polymorphisms typed in three population groups to assess the consequences of population structure on the coming generation of association studies. The consequences of population structure on association outcomes increase markedly with sample size. For the size of study needed to detect typical genetic effects in common diseases, even the modest levels of population structure within population groups cannot safely be ignored. We also examine one method for correcting for population structure (Genomic Control). Although it often performs well, it may not correct for structure if too few loci are used and may overcorrect in other settings, leading to substantial loss of power. The results of our analysis can guide the design of large-scale association studies.  相似文献   

8.
A general question for linkage disequilibrium-based association studies is how power to detect an association is compromised when tag SNPs are chosen from data in one population sample and then deployed in another sample. Specifically, it is important to know how well tags picked from the HapMap DNA samples capture the variation in other samples. To address this, we collected dense data uniformly across the four HapMap population samples and eleven other population samples. We picked tag SNPs using genotype data we collected in the HapMap samples and then evaluated the effective coverage of these tags in comparison to the entire set of common variants observed in the other samples. We simulated case-control association studies in the non-HapMap samples under a disease model of modest risk, and we observed little loss in power. These results demonstrate that the HapMap DNA samples can be used to select tags for genome-wide association studies in many samples around the world.  相似文献   

9.
Genetic variation in DLG5 is associated with inflammatory bowel disease   总被引:22,自引:0,他引:22  
Crohn disease and ulcerative colitis are two subphenotypes of inflammatory bowel disease (IBD), a complex disorder resulting from gene-environment interaction. We refined our previously defined linkage region for IBD on chromosome 10q23 and used positional cloning to identify genetic variants in DLG5 associated with IBD. DLG5 encodes a scaffolding protein involved in the maintenance of epithelial integrity. We identified two distinct haplotypes with a replicable distortion in transmission (P = 0.000023 and P = 0.004 for association with IBD, P = 0.00012 and P = 0.04 for association with Crohn disease). One of the risk-associated DLG5 haplotypes is distinguished from the common haplotype by a nonsynonymous single-nucleotide polymorphism 113G-->A, resulting in the amino acid substitution R30Q in the DUF622 domain of DLG5. This mutation probably impedes scaffolding of DLG5. We stratified the study sample according to the presence of risk-associated CARD15 variants to study potential gene-gene interaction. We found a significant difference in association of the 113A DLG5 variant with Crohn disease in affected individuals carrying the risk-associated CARD15 alleles versus those carrying non-risk-associated CARD15 alleles. This is suggestive of a complex pattern of gene-gene interaction between DLG5 and CARD15, reflecting the complex nature of polygenic diseases. Further functional studies will evaluate the biological significance of DLG5 variants.  相似文献   

10.
Population choice in mapping genes for complex diseases   总被引:24,自引:0,他引:24  
The difficulty of identifying susceptibility genes for common diseases has polarized geneticists' views on what disease models are appropriate and how best to proceed once high-density genome maps become available. Different disease models have different implications for using linkage or linkage-disequilibrium-based approaches for mapping complex disease genes. We argue that the choice of study population is a critical factor when designing a study, and that genetically simplified isolates are more useful than diverse continental populations under most assumptions.  相似文献   

11.
Copy number variation (CNV) is pervasive in the human genome and can play a causal role in genetic diseases. The functional impact of CNV cannot be fully captured through linkage disequilibrium with SNPs. These observations motivate the development of statistical methods for performing direct CNV association studies. We show through simulation that current tests for CNV association are prone to false-positive associations in the presence of differential errors between cases and controls, especially if quantitative CNV measurements are noisy. We present a statistical framework for performing case-control CNV association studies that applies likelihood ratio testing of quantitative CNV measurements in cases and controls. We show that our methods are robust to differential errors and noisy data and can achieve maximal theoretical power. We illustrate the power of these methods for testing for association with binary and quantitative traits, and have made this software available as the R package CNVtools.  相似文献   

12.
13.
Genetic studies of Hirschsprung disease, a common congenital malformation, have identified eight genes with mutations that can be associated with this condition. Mutations at individual loci are, however, neither necessary nor sufficient to cause clinical disease. We conducted a genome-wide association study in 43 Mennonite family trios using 2,083 microsatellites and single-nucleotide polymorphisms and a new multipoint linkage disequilibrium method that searches for association arising from common ancestry. We identified susceptibility loci at 10q11, 13q22 and 16q23; the gene at 13q22 is EDNRB, encoding a G protein-coupled receptor (GPCR) and the gene at 10q11 is RET, encoding a receptor tyrosine kinase (RTK). Statistically significant joint transmission of RET and EDNRB alleles in affected individuals and non-complementation of aganglionosis in mouse intercrosses between Ret null and the Ednrb hypomorphic piebald allele are suggestive of epistasis between EDNRB and RET. Thus, genetic interaction between mutations in RET and EDNRB is an underlying mechanism for this complex disorder.  相似文献   

14.
Using variants from the 1000 Genomes Project pilot European CEU dataset and data from additional resequencing studies, we densely genotyped 183 non-HLA risk loci previously associated with immune-mediated diseases in 12,041 individuals with celiac disease (cases) and 12,228 controls. We identified 13 new celiac disease risk loci reaching genome-wide significance, bringing the number of known loci (including the HLA locus) to 40. We found multiple independent association signals at over one-third of these loci, a finding that is attributable to a combination of common, low-frequency and rare genetic variants. Compared to previously available data such as those from HapMap3, our dense genotyping in a large sample collection provided a higher resolution of the pattern of linkage disequilibrium and suggested localization of many signals to finer scale regions. In particular, 29 of the 54 fine-mapped signals seemed to be localized to single genes and, in some instances, to gene regulatory elements. Altogether, we define the complex genetic architecture of the risk regions of and refine the risk signals for celiac disease, providing the next step toward uncovering the causal mechanisms of the disease.  相似文献   

15.
In an effort to pinpoint potential genetic risk factors for schizophrenia, research groups worldwide have published over 1,000 genetic association studies with largely inconsistent results. To facilitate the interpretation of these findings, we have created a regularly updated online database of all published genetic association studies for schizophrenia ('SzGene'). For all polymorphisms having genotype data available in at least four independent case-control samples, we systematically carried out random-effects meta-analyses using allelic contrasts. Across 118 meta-analyses, a total of 24 genetic variants in 16 different genes (APOE, COMT, DAO, DRD1, DRD2, DRD4, DTNBP1, GABRB2, GRIN2B, HP, IL1B, MTHFR, PLXNA2, SLC6A4, TP53 and TPH1) showed nominally significant effects with average summary odds ratios of approximately 1.23. Seven of these variants had not been previously meta-analyzed. According to recently proposed criteria for the assessment of cumulative evidence in genetic association studies, four of the significant results can be characterized as showing 'strong' epidemiological credibility. Our project represents the first comprehensive online resource for systematically synthesized and graded evidence of genetic association studies in schizophrenia. As such, it could serve as a model for field synopses of genetic associations in other common and genetically complex disorders.  相似文献   

16.
Age-related macular degeneration (AMD) is a common, late-onset disease with seemingly typical complexity: recurrence ratios for siblings of an affected individual are three- to sixfold higher than in the general population, and family-based analysis has resulted in only modestly significant evidence for linkage. In a case-control study drawn from a US-based population of European descent, we have identified a previously unrecognized common, noncoding variant in CFH, the gene encoding complement factor H, that substantially increases the influence of this locus on AMD, and we have strongly replicated the associations of four other previously reported common alleles in three genes (P values ranging from 10(-6) to 10(-70)). Despite excellent power to detect epistasis, we observed purely additive accumulation of risk from alleles at these genes. We found no differences in association of these loci with major phenotypic categories of advanced AMD. Genotypes at these five common SNPs define a broad spectrum of interindividual disease risk and explain about half of the classical sibling risk of AMD in our study population.  相似文献   

17.
L Kruglyak 《Nature genetics》1999,22(2):139-144
Recently, attention has focused on the use of whole-genome linkage disequilibrium (LD) studies to map common disease genes. Such studies would employ a dense map of single nucleotide polymorphisms (SNPs) to detect association between a marker and disease. Construction of SNP maps is currently underway. An essential issue yet to be settled is the required marker density of such maps. Here, I use population simulations to estimate the extent of LD surrounding common gene variants in the general human population as well as in isolated populations. Two main conclusions emerge from these investigations. First, a useful level of LD is unlikely to extend beyond an average distance of roughly 3 kb in the general population, which implies that approximately 500,000 SNPs will be required for whole-genome studies. Second, the extent of LD is similar in isolated populations unless the founding bottleneck is very narrow or the frequency of the variant is low (<5%).  相似文献   

18.
Association studies offer a potentially powerful approach to identify genetic variants that influence susceptibility to common disease, but are plagued by the impression that they are not consistently reproducible. In principle, the inconsistency may be due to false positive studies, false negative studies or true variability in association among different populations. The critical question is whether false positives overwhelmingly explain the inconsistency. We analyzed 301 published studies covering 25 different reported associations. There was a large excess of studies replicating the first positive reports, inconsistent with the hypothesis of no true positive associations (P < 10(-14)). This excess of replications could not be reasonably explained by publication bias and was concentrated among 11 of the 25 associations. For 8 of these 11 associations, pooled analysis of follow-up studies yielded statistically significant replication of the first report, with modest estimated genetic effects. Thus, a sizable fraction (but under half) of reported associations have strong evidence of replication; for these, false negative, underpowered studies probably contribute to inconsistent replication. We conclude that there are probably many common variants in the human genome with modest but real effects on common disease risk, and that studies using large samples will convincingly identify such variants.  相似文献   

19.
We performed a genome-wide association scan to search for sequence variants conferring risk of prostate cancer using 1,501 Icelandic men with prostate cancer and 11,290 controls. Follow-up studies involving three additional case-control groups replicated an association of two variants on chromosome 17 with the disease. These two variants, 33 Mb apart, fall within a region previously implicated by family-based linkage studies on prostate cancer. The risks conferred by these variants are moderate individually (allele odds ratio of about 1.20), but because they are common, their joint population attributable risk is substantial. One of the variants is in TCF2 (HNF1beta), a gene known to be mutated in individuals with maturity-onset diabetes of the young type 5. Results from eight case-control groups, including one West African and one Chinese, demonstrate that this variant confers protection against type 2 diabetes.  相似文献   

20.
After nearly 10 years of intense academic and commercial research effort, large genome-wide association studies for common complex diseases are now imminent. Although these conditions involve a complex relationship between genotype and phenotype, including interactions between unlinked loci, the prevailing strategies for analysis of such studies focus on the locus-by-locus paradigm. Here we consider analytical methods that explicitly look for statistical interactions between loci. We show first that they are computationally feasible, even for studies of hundreds of thousands of loci, and second that even with a conservative correction for multiple testing, they can be more powerful than traditional analyses under a range of models for interlocus interactions. We also show that plausible variations across populations in allele frequencies among interacting loci can markedly affect the power to detect their marginal effects, which may account in part for the well-known difficulties in replicating association results. These results suggest that searching for interactions among genetic loci can be fruitfully incorporated into analysis strategies for genome-wide association studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号