首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Sequence variation in human genes is largely confined to single-nucleotide polymorphisms (SNPs) and is valuable in tests of association with common diseases and pharmacogenetic traits. We performed a systematic and comprehensive survey of molecular variation to assess the nature, pattern and frequency of SNPs in 75 candidate human genes for blood-pressure homeostasis and hypertension. We assayed 28 Mb (190 kb in 148 alleles) of genomic sequence, comprising the 5' and 3' untranslated regions (UTRs), introns and coding sequence of these genes, for sequence differences in individuals of African and Northern European descent using high-density variant detection arrays (VDAs). We identified 874 candidate human SNPs, of which 22% were confirmed by DNA sequencing to reveal a discordancy rate of 21% for VDA detection. The SNPs detected have an average minor allele frequency of 11%, and 387 are within the coding sequence (cSNPs). Of all cSNPs, 54% lead to a predicted change in the protein sequence, implying a high level of human protein diversity. These protein-altering SNPs are 38% of the total number of such SNPs expected, are more likely to be population-specific and are rarer in the human population, directly demonstrating the effects of natural selection on human genes. Overall, the degree of nucleotide polymorphism across these human genes, and orthologous great ape sequences, is highly variable and is correlated with the effects of functional conservation on gene sequences.  相似文献   

2.
Targeted capture combined with massively parallel exome sequencing is a promising approach to identify genetic variants implicated in human traits. We report exome sequencing of 200 individuals from Denmark with targeted capture of 18,654 coding genes and sequence coverage of each individual exome at an average depth of 12-fold. On average, about 95% of the target regions were covered by at least one read. We identified 121,870 SNPs in the sample population, including 53,081 coding SNPs (cSNPs). Using a statistical method for SNP calling and an estimation of allelic frequencies based on our population data, we derived the allele frequency spectrum of cSNPs with a minor allele frequency greater than 0.02. We identified a 1.8-fold excess of deleterious, non-syonomyous cSNPs over synonymous cSNPs in the low-frequency range (minor allele frequencies between 2% and 5%). This excess was more pronounced for X-linked SNPs, suggesting that deleterious substitutions are primarily recessive.  相似文献   

3.
Most human sequence variation is in the form of single-nucleotide polymorphisms (SNPs). It has been proposed that coding-region SNPs (cSNPs) be used for direct association studies to determine the genetic basis of complex traits. The success of such studies depends on the frequency of disease-associated alleles, and their distribution in different ethnic populations. If disease-associated alleles are frequent in most populations, then direct genotyping of candidate variants could show robust associations in manageable study samples. This approach is less feasible if the genetic risk from a given candidate gene is due to many infrequent alleles. Previous studies of several genes demonstrated that most variants are relatively infrequent (<0.05). These surveys genotyped small samples (n<75) and thus had limited ability to identify rare alleles. Here we evaluate the prevalence and distribution of such rare alleles by genotyping an ethnically diverse reference sample that is more than six times larger than those used in previous studies (n=450). We screened for variants in the complete coding sequence and intron-exon junctions of two candidate genes for neuropsychiatric phenotypes: SLC6A4, encoding the serotonin transporter; and SLC18A2, encoding the vesicular monoamine transporter. Both genes have unique roles in neuronal transmission, and variants in either gene might be associated with neurobehavioral phenotypes.  相似文献   

4.
Noncoding genetic variants are likely to influence human biology and disease, but recognizing functional noncoding variants is difficult. Approximately 3% of noncoding sequence is conserved among distantly related mammals, suggesting that these evolutionarily conserved noncoding regions (CNCs) are selectively constrained and contain functional variation. However, CNCs could also merely represent regions with lower local mutation rates. Here we address this issue and show that CNCs are selectively constrained in humans by analyzing HapMap genotype data. Specifically, new (derived) alleles of SNPs within CNCs are rarer than new alleles in nonconserved regions (P = 3 x 10(-18)), indicating that evolutionary pressure has suppressed CNC-derived allele frequencies. Intronic CNCs and CNCs near genes show greater allele frequency shifts, with magnitudes comparable to those for missense variants. Thus, conserved noncoding variants are more likely to be functional. Allele frequency distributions highlight selectively constrained genomic regions that should be intensively surveyed for functionally important variation.  相似文献   

5.
Substantial efforts are focused on identifying single-nucleotide polymorphisms (SNPs) throughout the human genome, particularly in coding regions (cSNPs), for both linkage disequilibrium and association studies. Less attention, however, has been directed to the clarification of evolutionary processes that are responsible for the variability in nucleotide diversity among different regions of the genome. We report here the population sequence diversity of genomic segments within a 450-kb cluster of olfactory receptor (OR) genes on human chromosome 17. We found a dichotomy in the pattern of nucleotide diversity between OR pseudogenes and introns on the one hand and the closely interspersed intact genes on the other. We suggest that weak positive selection is responsible for the observed patterns of genetic variation. This is inferred from a lower ratio of polymorphism to divergence in genes compared with pseudogenes or introns, high non-synonymous substitution rates in OR genes, and a small but significant overall reduction in variability in the entire OR gene cluster compared with other genomic regions. The dichotomy among functionally different segments within a short genomic distance requires high recombination rates within this OR cluster. Our work demonstrates the impact of weak positive selection on human nucleotide diversity, and has implications for the evolution of the olfactory repertoire.  相似文献   

6.
7.
Although studies suggest that SNPs derived from HapMap provide promising coverage and power for association studies, the lack of alternative variation datasets limits independent analysis. Using near-complete variation data for 76 genes resequenced in HapMap samples, we find that coverage of common variation by commercial genotyping arrays is substantially lower compared to the HapMap-based estimates. We quantify the power offered by these arrays for a range of disease models.  相似文献   

8.
The locations and properties of common deletion variants in the human genome are largely unknown. We describe a systematic method for using dense SNP genotype data to discover deletions and its application to data from the International HapMap Consortium to characterize and catalogue segregating deletion variants across the human genome. We identified 541 deletion variants (94% novel) ranging from 1 kb to 745 kb in size; 278 of these variants were observed in multiple, unrelated individuals, 120 in the homozygous state. The coding exons of ten expressed genes were found to be commonly deleted, including multiple genes with roles in sex steroid metabolism, olfaction and drug response. These common deletion polymorphisms typically represent ancestral mutations that are in linkage disequilibrium with nearby SNPs, meaning that their association to disease can often be evaluated in the course of SNP-based whole-genome association studies.  相似文献   

9.
We carried out a genome-wide association study of IgA nephropathy, a major cause of kidney failure worldwide. We studied 1,194 cases and 902 controls of Chinese Han ancestry, with targeted follow up in Chinese and European cohorts comprising 1,950 cases and 1,920 controls. We identified three independent loci in the major histocompatibility complex, as well as a common deletion of CFHR1 and CFHR3 at chromosome 1q32 and a locus at chromosome 22q12 that each surpassed genome-wide significance (P values for association between 1.59 × 10?2? and 4.84 × 10?? and minor allele odds ratios of 0.63-0.80). These five loci explain 4-7% of the disease variance and up to a tenfold variation in interindividual risk. Many of the alleles that protect against IgA nephropathy impart increased risk for other autoimmune or infectious diseases, and IgA nephropathy risk allele frequencies closely parallel the variation in disease prevalence among Asian, European and African populations, suggesting complex selective pressures.  相似文献   

10.
Osteoarthritis is the most common form of human arthritis. We investigated the potential role of asporin, an extracellular matrix component expressed abundantly in the articular cartilage of individuals with osteoarthritis, in the pathogenesis of osteoarthritis. Here we report a significant association between a polymorphism in the aspartic acid (D) repeat of the gene encoding asporin (ASPN) and osteoarthritis. In two independent populations of individuals with knee osteoarthritis, the D14 allele of ASPN is over-represented relative to the common D13 allele, and its frequency increases with disease severity. The D14 allele is also over-represented in individuals with hip osteoarthritis. Asporin suppresses TGF-beta-mediated expression of the genes aggrecan (AGC1) and type II collagen (COL2A1) and reduced proteoglycan accumulation in an in vitro model of chondrogenesis. The effect on TGF-beta activity is allele-specific, with the D14 allele resulting in greater inhibition than other alleles. In vitro binding assays showed a direct interaction between asporin and TGF-beta. Taken together, these findings provide another functional link between extracellular matrix proteins, TGF-beta activity and disease, suggesting new therapeutic strategies for osteoarthritis.  相似文献   

11.
Age-related macular degeneration (AMD) is a common, late-onset disease with seemingly typical complexity: recurrence ratios for siblings of an affected individual are three- to sixfold higher than in the general population, and family-based analysis has resulted in only modestly significant evidence for linkage. In a case-control study drawn from a US-based population of European descent, we have identified a previously unrecognized common, noncoding variant in CFH, the gene encoding complement factor H, that substantially increases the influence of this locus on AMD, and we have strongly replicated the associations of four other previously reported common alleles in three genes (P values ranging from 10(-6) to 10(-70)). Despite excellent power to detect epistasis, we observed purely additive accumulation of risk from alleles at these genes. We found no differences in association of these loci with major phenotypic categories of advanced AMD. Genotypes at these five common SNPs define a broad spectrum of interindividual disease risk and explain about half of the classical sibling risk of AMD in our study population.  相似文献   

12.
13.
Population genomics of human gene expression   总被引:1,自引:0,他引:1  
Genetic variation influences gene expression, and this variation in gene expression can be efficiently mapped to specific genomic regions and variants. Here we have used gene expression profiling of Epstein-Barr virus-transformed lymphoblastoid cell lines of all 270 individuals genotyped in the HapMap Consortium to elucidate the detailed features of genetic variation underlying gene expression variation. We find that gene expression is heritable and that differentiation between populations is in agreement with earlier small-scale studies. A detailed association analysis of over 2.2 million common SNPs per population (5% frequency in HapMap) with gene expression identified at least 1,348 genes with association signals in cis and at least 180 in trans. Replication in at least one independent population was achieved for 37% of cis signals and 15% of trans signals, respectively. Our results strongly support an abundance of cis-regulatory variation in the human genome. Detection of trans effects is limited but suggests that regulatory variation may be the key primary effect contributing to phenotypic variation in humans. We also explore several methodologies that improve the current state of analysis of gene expression variation.  相似文献   

14.
Here we report the application of high-density oligonucleotide array (DNA chip)-based analysis to determine the distant history of single nucleotide polymorphisms (SNPs) in current human populations. We analysed orthologues for 397 human SNP sites (identified in CEPH pedigrees from Amish, Venezuelan and Utah populations) from 23 common chimpanzee, 19 pygmy chimpanzee and 11 gorilla genomic DNA samples. From this data we determined 214 proposed ancestral alleles (the sequence found in the last common ancestor of humans and chimpanzees). In a diverse human population set, we found that SNP alleles with higher frequencies were more likely to be ancestral than less frequently occurring alleles. There were, however, exceptions. We also found three shared human/pygmy chimpanzee polymorphisms, all involving CpG dinucleotides, and two shared human/gorilla polymorphisms, one involving a CpG dinucleotide. We demonstrate that microarray-based assays allow rapid comparative sequence analysis of intra- and interspecies genetic variation.  相似文献   

15.
The effects of alleles in many genes are believed to contribute to common complex diseases such as hypertension. Whether risk alleles comprise a small number of common variants or many rare independent mutations at trait loci is largely unknown. We screened members of the Framingham Heart Study (FHS) for variation in three genes-SLC12A3 (NCCT), SLC12A1 (NKCC2) and KCNJ1 (ROMK)-causing rare recessive diseases featuring large reductions in blood pressure. Using comparative genomics, genetics and biochemistry, we identified subjects with mutations proven or inferred to be functional. These mutations, all heterozygous and rare, produce clinically significant blood pressure reduction and protect from development of hypertension. Our findings implicate many rare alleles that alter renal salt handling in blood pressure variation in the general population, and identify alleles with health benefit that are nonetheless under purifying selection. These findings have implications for the genetic architecture of hypertension and other common complex traits.  相似文献   

16.
More than 5 million single-nucleotide polymorphisms (SNPs) with minor-allele frequency greater than 10% are expected to exist in the human genome. Some of these SNPs may be associated with risk of developing common diseases. To assess the power of currently available SNPs to detect such associations, we resequenced 50 genes in two ethnic samples and measured patterns of linkage disequilibrium between the subset of SNPs reported in dbSNP and the complete set of common SNPs. Our results suggest that using all 2.7 million SNPs currently in the database would detect nearly 80% of all common SNPs in European populations but only 50% of those common in the African American population and that efficient selection of a minimal subset of SNPs for use in association studies requires measurement of allele frequency and linkage disequilibrium relationships for all SNPs in dbSNP.  相似文献   

17.
The proteins encoded by the classical HLA class I and class II genes in the major histocompatibility complex (MHC) are highly polymorphic and are essential in self versus non-self immune recognition. HLA variation is a crucial determinant of transplant rejection and susceptibility to a large number of infectious and autoimmune diseases. Yet identification of causal variants is problematic owing to linkage disequilibrium that extends across multiple HLA and non-HLA genes in the MHC. We therefore set out to characterize the linkage disequilibrium patterns between the highly polymorphic HLA genes and background variation by typing the classical HLA genes and >7,500 common SNPs and deletion-insertion polymorphisms across four population samples. The analysis provides informative tag SNPs that capture much of the common variation in the MHC region and that could be used in disease association studies, and it provides new insight into the evolutionary dynamics and ancestral origins of the HLA loci and their haplotypes.  相似文献   

18.
Variation in DNA sequence contributes to individual differences in quantitative traits, but in humans the specific sequence variants are known for very few traits. We characterized variation in gene expression in cells from individuals belonging to three major population groups. This quantitative phenotype differs significantly between European-derived and Asian-derived populations for 1,097 of 4,197 genes tested. For the phenotypes with the strongest evidence of cis determinants, most of the variation is due to allele frequency differences at cis-linked regulators. The results show that specific genetic variation among populations contributes appreciably to differences in gene expression phenotypes. Populations differ in prevalence of many complex genetic diseases, such as diabetes and cardiovascular disease. As some of these are probably influenced by the level of gene expression, our results suggest that allele frequency differences at regulatory polymorphisms also account for some population differences in prevalence of complex diseases.  相似文献   

19.
Many sequence variants affecting diversity of adult human height   总被引:1,自引:0,他引:1  
Adult human height is one of the classical complex human traits. We searched for sequence variants that affect height by scanning the genomes of 25,174 Icelanders, 2,876 Dutch, 1,770 European Americans and 1,148 African Americans. We then combined these results with previously published results from the Diabetes Genetics Initiative on 3,024 Scandinavians and tested a selected subset of SNPs in 5,517 Danes. We identified 27 regions of the genome with one or more sequence variants showing significant association with height. The estimated effects per allele of these variants ranged between 0.3 and 0.6 cm and, taken together, they explain around 3.7% of the population variation in height. The genes neighboring the identified loci cluster in biological processes related to skeletal development and mitosis. Association to three previously reported loci are replicated in our analyses, and the strongest association was with SNPs in the ZBTB38 gene.  相似文献   

20.
Haplotype tagging for the identification of common disease genes   总被引:61,自引:0,他引:61  
Genome-wide linkage disequilibrium (LD) mapping of common disease genes could be more powerful than linkage analysis if the appropriate density of polymorphic markers were known and if the genotyping effort and cost of producing such an LD map could be reduced. Although different metrics that measure the extent of LD have been evaluated, even the most recent studies have not placed significant emphasis on the most informative and cost-effective method of LD mapping-that based on haplotypes. We have scanned 135 kb of DNA from nine genes, genotyped 122 single-nucleotide polymorphisms (SNPs; approximately 184,000 genotypes) and determined the common haplotypes in a minimum of 384 European individuals for each gene. Here we show how knowledge of the common haplotypes and the SNPs that tag them can be used to (i) explain the often complex patterns of LD between adjacent markers, (ii) reduce genotyping significantly (in this case from 122 to 34 SNPs), (iii) scan the common variation of a gene sensitively and comprehensively and (iv) provide key fine-mapping data within regions of strong LD. Our results also indicate that, at least for the genes studied here, the current version of dbSNP would have been of limited utility for LD mapping because many common haplotypes could not be defined. A directed re-sequencing effort of the approximately 10% of the genome in or near genes in the major ethnic groups would aid the systematic evaluation of the common variant model of common disease.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号