首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 312 毫秒
1.
Whole-genome association studies are predicted to be especially powerful in isolated populations owing to increased linkage disequilibrium (LD) and decreased allelic diversity, but this possibility has not been empirically tested. We compared genome-wide data on 113,240 SNPs typed on 30 trios from the Pacific island of Kosrae to the same markers typed in the 270 samples from the International HapMap Project. The extent of LD is longer and haplotype diversity is lower in Kosrae than in the HapMap populations. More than 98% of Kosraen haplotypes are present in HapMap populations, indicating that HapMap will be useful for genetic studies on Kosrae. The long-range LD around common alleles and limited diversity result in improved efficiency in genetic studies in this population and augments the power to detect association of 'hidden SNPs'.  相似文献   

2.
Recent genomic surveys have produced high-resolution haplotype information, but only in a small number of human populations. We report haplotype structure across 12 Mb of DNA sequence in 927 individuals representing 52 populations. The geographic distribution of haplotypes reflects human history, with a loss of haplotype diversity as distance increases from Africa. Although the extent of linkage disequilibrium (LD) varies markedly across populations, considerable sharing of haplotype structure exists, and inferred recombination hotspot locations generally match across groups. The four samples in the International HapMap Project contain the majority of common haplotypes found in most populations: averaging across populations, 83% of common 20-kb haplotypes in a population are also common in the most similar HapMap sample. Consequently, although the portability of tag SNPs based on the HapMap is reduced in low-LD Africans, the HapMap will be helpful for the design of genome-wide association mapping studies in nearly all human populations.  相似文献   

3.
The genome-wide distribution of linkage disequilibrium (LD) determines the strategy for selecting markers for association studies, but it varies between populations. We assayed LD in large samples (200 individuals) from each of 11 well-described population isolates and an outbred European-derived sample, using SNP markers spaced across chromosome 22. Most isolates show substantially higher levels of LD than the outbred sample and many fewer regions of very low LD (termed 'holes'). Young isolates known to have had relatively few founders show particularly extensive LD with very few holes; these populations offer substantial advantages for genome-wide association mapping.  相似文献   

4.
Haplotype tagging for the identification of common disease genes   总被引:61,自引:0,他引:61  
Genome-wide linkage disequilibrium (LD) mapping of common disease genes could be more powerful than linkage analysis if the appropriate density of polymorphic markers were known and if the genotyping effort and cost of producing such an LD map could be reduced. Although different metrics that measure the extent of LD have been evaluated, even the most recent studies have not placed significant emphasis on the most informative and cost-effective method of LD mapping-that based on haplotypes. We have scanned 135 kb of DNA from nine genes, genotyped 122 single-nucleotide polymorphisms (SNPs; approximately 184,000 genotypes) and determined the common haplotypes in a minimum of 384 European individuals for each gene. Here we show how knowledge of the common haplotypes and the SNPs that tag them can be used to (i) explain the often complex patterns of LD between adjacent markers, (ii) reduce genotyping significantly (in this case from 122 to 34 SNPs), (iii) scan the common variation of a gene sensitively and comprehensively and (iv) provide key fine-mapping data within regions of strong LD. Our results also indicate that, at least for the genes studied here, the current version of dbSNP would have been of limited utility for LD mapping because many common haplotypes could not be defined. A directed re-sequencing effort of the approximately 10% of the genome in or near genes in the major ethnic groups would aid the systematic evaluation of the common variant model of common disease.  相似文献   

5.
The extent of linkage disequilibrium in Arabidopsis thaliana.   总被引:20,自引:0,他引:20  
Linkage disequilibrium (LD), the nonrandom occurrence of alleles in haplotypes, has long been of interest to population geneticists. Recently, the rapidly increasing availability of genomic polymorphism data has fueled interest in LD as a tool for fine-scale mapping, in particular for human disease loci. The chromosomal extent of LD is crucial in this context, because it determines how dense a map must be for associations to be detected and, conversely, limits how finely loci may be mapped. Arabidopsis thaliana is expected to harbor unusually extensive LD because of its high degree of selfing. Several polymorphism studies have found very strong LD within individual loci, but also evidence of some recombination. Here we investigate the pattern of LD on a genomic scale and show that in global samples, LD decays within approximately 1 cM, or 250 kb. We also show that LD in local populations may be much stronger than that of global populations, presumably as a result of founder events. The combination of a relatively high level of polymorphism and extensive haplotype structure bodes well for developing a genome-wide LD map in A. thaliana.  相似文献   

6.
L Kruglyak 《Nature genetics》1999,22(2):139-144
Recently, attention has focused on the use of whole-genome linkage disequilibrium (LD) studies to map common disease genes. Such studies would employ a dense map of single nucleotide polymorphisms (SNPs) to detect association between a marker and disease. Construction of SNP maps is currently underway. An essential issue yet to be settled is the required marker density of such maps. Here, I use population simulations to estimate the extent of LD surrounding common gene variants in the general human population as well as in isolated populations. Two main conclusions emerge from these investigations. First, a useful level of LD is unlikely to extend beyond an average distance of roughly 3 kb in the general population, which implies that approximately 500,000 SNPs will be required for whole-genome studies. Second, the extent of LD is similar in isolated populations unless the founding bottleneck is very narrow or the frequency of the variant is low (<5%).  相似文献   

7.
Variation in the human genome sequence is key to understanding susceptibility to disease in modern populations and the history of ancestral populations. Unlocking this information requires knowledge of the patterns and underlying causes of human sequence diversity. By applying a new population-genetic framework to two genome-wide polymorphism surveys, we find that the human genome contains sizeable regions (stretching over tens of thousands of base pairs) that have intrinsically high and low rates of sequence variation. We show that the primary determinant of these patterns is shared genealogical history. Only a fraction of the variation (at most 25%) is due to the local mutation rate. By measuring the average distance over which genealogical histories are typically preserved, these data provide the first genome-wide estimate of the average extent of correlation among variants (linkage disequilibrium). The results are best explained by extreme variability in the recombination rate at a fine scale, and provide the first empirical evidence that such recombination 'hot spots' are a general feature of the human genome and have a principal role in shaping genetic variation in the human population.  相似文献   

8.
High-resolution haplotype structure in the human genome   总被引:41,自引:0,他引:41  
Linkage disequilibrium (LD) analysis is traditionally based on individual genetic markers and often yields an erratic, non-monotonic picture, because the power to detect allelic associations depends on specific properties of each marker, such as frequency and population history. Ideally, LD analysis should be based directly on the underlying haplotype structure of the human genome, but this structure has remained poorly understood. Here we report a high-resolution analysis of the haplotype structure across 500 kilobases on chromosome 5q31 using 103 single-nucleotide polymorphisms (SNPs) in a European-derived population. The results show a picture of discrete haplotype blocks (of tens to hundreds of kilobases), each with limited diversity punctuated by apparent sites of recombination. In addition, we develop an analytical model for LD mapping based on such haplotype blocks. If our observed structure is general (and published data suggest that it may be), it offers a coherent framework for creating a haplotype map of the human genome.  相似文献   

9.
There is considerable interest in understanding patterns of linkage disequilibrium (LD) in the human genome, to aid investigations of human evolution and facilitate association studies in complex disease. The relative influences of meiotic crossover distribution and population history on LD remain unclear, however. In particular, it is uncertain to what extent crossovers are clustered into 'hot spots, that might influence LD patterns. As a first step to investigating the relationship between LD and recombination, we have analyzed a 216-kb segment of the class II region of the major histocompatibility complex (MHC) already characterized for familial crossovers. High-resolution LD analysis shows the existence of extended domains of strong association interrupted by patchwork areas of LD breakdown. Sperm typing shows that these areas correspond precisely to meiotic crossover hot spots. All six hot spots defined share a remarkably similar symmetrical morphology but vary considerably in intensity, and are not obviously associated with any primary DNA sequence determinants of hot-spot activity. These hot spots occur in clusters and together account for almost all crossovers in this region of the MHC. These data show that, within the MHC at least, crossovers are far from randomly distributed at the molecular level and that recombination hot spots can profoundly affect LD patterns.  相似文献   

10.
Linkage disequilibrium (LD) mapping provides a powerful method for fine-structure localization of rare disease genes, but has not yet been widely applied to common disease. We sought to design a systematic approach for LD mapping and apply it to the localization of a gene (IBD5) conferring susceptibility to Crohn disease. The key issues are: (i) to detect a significant LD signal (ii) to rigorously bound the critical region and (iii) to identify the causal genetic variant within this region. We previously mapped the IBD5 locus to a large region spanning 18 cM of chromosome 5q31 (P<10(-4)). Using dense genetic maps of microsatellite markers and single-nucleotide polymorphisms (SNPs) across the entire region, we found strong evidence of LD. We bound the region to a common haplotype spanning 250 kb that shows strong association with the disease (P< 2 x 10(-7)) and contains the cytokine gene cluster. This finding provides overwhelming evidence that a specific common haplotype of the cytokine region in 5q31 confers susceptibility to Crohn disease. However, genetic evidence alone is not sufficient to identify the causal mutation within this region, as strong LD across the region results in multiple SNPs having equivalent genetic evidence-each consistent with the expected properties of the IBD5 locus. These results have important implications for Crohn disease in particular and LD mapping in general.  相似文献   

11.
The choice of which population to study in the mapping of common disease genes may be critical. Isolated founder populations, such as that found in Finland, have already proved extremely useful for mapping the genes for specific rare monogenic disorders and are being used in attempts to map the genes underlying common, complex diseases. But simulation results suggest that, under the common disease-common variant hypothesis, most isolated populations will prove no more useful for linkage disequilibrium (LD) mapping of common disease genes than large outbred populations. There is very little empirical data to either support or refute this conclusion at present. Therefore, we evaluated LD between 21 common microsatellite polymorphisms on chromosome 18q21 in 2 genetic isolates (Finland and Sardinia) and compared the results with those observed in two mixed populations (United Kingdom and United States of America). Mean levels of LD were similar across all four populations. Our results provide empirical support for the expectation that genetic isolates like Finland and Sardinia will not prove significantly more valuable than general populations for LD mapping of common variants underlying complex disease.  相似文献   

12.
Crossover between the human sex chromosomes during male meiosis is restricted to the terminal pseudoautosomal pairing regions. An obligatory exchange occurs in PAR1, an Xp/Yp pseudoautosomal region of 2.6 Mb, which creates a male-specific recombination 'hot domain' with a recombination rate that is about 20 times higher than the genome average. Low-resolution analysis of PAR1 suggests that crossovers are distributed fairly randomly. By contrast, linkage disequilibrium (LD) and sperm crossover analyses indicate that crossovers in autosomal regions tend to cluster into 'hot spots' of 1-2 kb that lie between islands of disequilibrium of tens to hundreds of kilobases. To determine whether at high resolution this autosomal pattern also applies to PAR1, we have examined linkage disequilibrium over an interval of 43 kb around the gene SHOX. Here we show that in northern European populations, disequilibrium decays rapidly with physical distance, which is consistent with this interval of PAR1 being recombinationally active in male meiosis. Analysis of a subregion of 9.9 kb in sperm shows, however, that crossovers are not distributed randomly, but instead cluster into an intense recombination hot spot that is very similar in morphology to autosomal hot spots. Thus, PAR1 crossover activity may be influenced by male-specific hot spots that are highly suitable for characterization by sperm DNA analysis.  相似文献   

13.
Dissecting the genetic basis of disease risk requires measuring all forms of genetic variation, including SNPs and copy number variants (CNVs), and is enabled by accurate maps of their locations, frequencies and population-genetic properties. We designed a hybrid genotyping array (Affymetrix SNP 6.0) to simultaneously measure 906,600 SNPs and copy number at 1.8 million genomic locations. By characterizing 270 HapMap samples, we developed a map of human CNV (at 2-kb breakpoint resolution) informed by integer genotypes for 1,320 copy number polymorphisms (CNPs) that segregate at an allele frequency >1%. More than 80% of the sequence in previously reported CNV regions fell outside our estimated CNV boundaries, indicating that large (>100 kb) CNVs affect much less of the genome than initially reported. Approximately 80% of observed copy number differences between pairs of individuals were due to common CNPs with an allele frequency >5%, and more than 99% derived from inheritance rather than new mutation. Most common, diallelic CNPs were in strong linkage disequilibrium with SNPs, and most low-frequency CNVs segregated on specific SNP haplotypes.  相似文献   

14.
Mutational analyses in model organisms have shown that genes affecting metabolism and stress resistance regulate life span, but the genes responsible for variation in longevity in natural populations are largely unidentified. Previously, we mapped quantitative trait loci (QTLs) affecting variation in longevity between two Drosophila melanogaster strains. Here, we show that the longevity QTL in the 36E;38B cytogenetic interval on chromosome 2 contains multiple closely linked QTLs, including the Dopa decarboxylase (Ddc) locus. Complementation tests to mutations show that Ddc is a positional candidate gene for life span in these strains. Linkage disequilibrium (LD) mapping in a sample of 173 alleles from a single population shows that three common molecular polymorphisms in Ddc account for 15.5% of the genetic contribution to variance in life span from chromosome 2. The polymorphisms are in strong LD, and the effects of the haplotypes on longevity suggest that the polymorphisms are maintained by balancing selection. DDC catalyzes the final step in the synthesis of the neurotransmitters, dopamine and serotonin. Thus, these data implicate variation in the synthesis of bioamines as a factor contributing to natural variation in individual life span.  相似文献   

15.
Characterizing genetic diversity within and between populations has broad applications in studies of human disease and evolution. We propose a new approach, spatial ancestry analysis, for the modeling of genotypes in two- or three-dimensional space. In spatial ancestry analysis (SPA), we explicitly model the spatial distribution of each SNP by assigning an allele frequency as a continuous function in geographic space. We show that the explicit modeling of the allele frequency allows individuals to be localized on the map on the basis of their genetic information alone. We apply our SPA method to a European and a worldwide population genetic variation data set and identify SNPs showing large gradients in allele frequency, and we suggest these as candidate regions under selection. These regions include SNPs in the well-characterized LCT region, as well as at loci including FOXP2, OCA2 and LRP1B.  相似文献   

16.
The fine-scale distribution of meiotic recombination events in the human genome can be inferred from patterns of haplotype diversity in human populations but directly studied only by high-resolution sperm typing. Both approaches indicate that crossovers are heavily clustered into narrow recombination hot spots. But our direct understanding of hot-spot properties and distributions is largely limited to sperm typing in the major histocompatibility complex (MHC). We now describe the analysis of an unremarkable 206-kb region on human chromosome 1, which identified localized regions of linkage disequilibrium breakdown that mark the locations of sperm crossover hot spots. The distribution, intensity and morphology of these hot spots are markedly similar to those in the MHC. But we also accidentally detected additional hot spots in regions of strong association. Coalescent analysis of genotype data detected most of the hot spots but showed significant differences between sperm crossover frequencies and historical recombination rates. This raises the possibility that some hot spots, particularly those in regions of strong association, may have evolved very recently and not left their full imprint on haplotype diversity. These results suggest that hot spots could be very abundant and possibly fluid features of the human genome.  相似文献   

17.
Substantial efforts are focused on identifying single-nucleotide polymorphisms (SNPs) throughout the human genome, particularly in coding regions (cSNPs), for both linkage disequilibrium and association studies. Less attention, however, has been directed to the clarification of evolutionary processes that are responsible for the variability in nucleotide diversity among different regions of the genome. We report here the population sequence diversity of genomic segments within a 450-kb cluster of olfactory receptor (OR) genes on human chromosome 17. We found a dichotomy in the pattern of nucleotide diversity between OR pseudogenes and introns on the one hand and the closely interspersed intact genes on the other. We suggest that weak positive selection is responsible for the observed patterns of genetic variation. This is inferred from a lower ratio of polymorphism to divergence in genes compared with pseudogenes or introns, high non-synonymous substitution rates in OR genes, and a small but significant overall reduction in variability in the entire OR gene cluster compared with other genomic regions. The dichotomy among functionally different segments within a short genomic distance requires high recombination rates within this OR cluster. Our work demonstrates the impact of weak positive selection on human nucleotide diversity, and has implications for the evolution of the olfactory repertoire.  相似文献   

18.
Recombination and linkage disequilibrium in Arabidopsis thaliana   总被引:4,自引:0,他引:4  
Linkage disequilibrium (LD) is a major aspect of the organization of genetic variation in natural populations. Here we describe the genome-wide pattern of LD in a sample of 19 Arabidopsis thaliana accessions using 341,602 non-singleton SNPs. LD decays within 10 kb on average, considerably faster than previously estimated. Tag SNP selection algorithms and 'hide-the-SNP' simulations suggest that genome-wide association mapping will require only 40%-50% of the observed SNPs, a reduction similar to estimates in a sample of African Americans. An Affymetrix genotyping array containing 250,000 SNPs has been designed based on these results; we demonstrate that it should have more than adequate coverage for genome-wide association mapping. The extent of LD is highly variable, and we find clear evidence of recombination hotspots, which seem to occur preferentially in intergenic regions. LD also reflects the action of selection, and it is more extensive between nonsynonymous polymorphisms than between synonymous polymorphisms.  相似文献   

19.
Mutations in BRCA1 (ref. 1) confer an increased risk of female breast cancer. In a genome-wide scan of linkage disequilibrium (LD), a high level of LD was detected among microsatellite markers flanking BRCA1 (ref. 3), raising the prospect that positive natural selection may have acted on this gene. We have used the predictions of evolutionary genetic theory to investigate this further. Using phylogeny-based maximum likelihood analysis of the BRCA1 sequences from primates and other mammals, we found that the ratios of replacement to silent nucleotide substitutions on the human and chimpanzee lineages were not different from one another (P=0.8), were different from those of other primate lineages (P=0.004) and were greater than 1 (P=0.04). This is consistent with the historic occurrence of positive darwinian selection pressure on the BRCA1 protein in the human and chimpanzee lineages. Analysis of genetic variation in a sample of female Australians of Northern European origin showed evidence for Hardy-Weinberg (HW) disequilibrium at polymorphic sites in BRCA1, consistent with the possibility that natural selection is affecting genotype frequencies in modern Europeans. The clustering of between-species variation in the region of the gene encoding the RAD51-interaction domain of BRCA1 suggests the maintenance of genomic integrity as a possible target of selection.  相似文献   

20.
Y chromosome sequence variation and the history of human populations   总被引:48,自引:0,他引:48  
Binary polymorphisms associated with the non-recombining region of the human Y chromosome (NRY) preserve the paternal genetic legacy of our species that has persisted to the present, permitting inference of human evolution, population affinity and demographic history. We used denaturing high-performance liquid chromatography (DHPLC; ref. 2) to identify 160 of the 166 bi-allelic and 1 tri-allelic site that formed a parsimonious genealogy of 116 haplotypes, several of which display distinct population affinities based on the analysis of 1062 globally representative individuals. A minority of contemporary East Africans and Khoisan represent the descendants of the most ancestral patrilineages of anatomically modern humans that left Africa between 35,000 and 89,000 years ago.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号