首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 93 毫秒
1.
Targeted capture combined with massively parallel exome sequencing is a promising approach to identify genetic variants implicated in human traits. We report exome sequencing of 200 individuals from Denmark with targeted capture of 18,654 coding genes and sequence coverage of each individual exome at an average depth of 12-fold. On average, about 95% of the target regions were covered by at least one read. We identified 121,870 SNPs in the sample population, including 53,081 coding SNPs (cSNPs). Using a statistical method for SNP calling and an estimation of allelic frequencies based on our population data, we derived the allele frequency spectrum of cSNPs with a minor allele frequency greater than 0.02. We identified a 1.8-fold excess of deleterious, non-syonomyous cSNPs over synonymous cSNPs in the low-frequency range (minor allele frequencies between 2% and 5%). This excess was more pronounced for X-linked SNPs, suggesting that deleterious substitutions are primarily recessive.  相似文献   

2.
Dissecting the genetic basis of disease risk requires measuring all forms of genetic variation, including SNPs and copy number variants (CNVs), and is enabled by accurate maps of their locations, frequencies and population-genetic properties. We designed a hybrid genotyping array (Affymetrix SNP 6.0) to simultaneously measure 906,600 SNPs and copy number at 1.8 million genomic locations. By characterizing 270 HapMap samples, we developed a map of human CNV (at 2-kb breakpoint resolution) informed by integer genotypes for 1,320 copy number polymorphisms (CNPs) that segregate at an allele frequency >1%. More than 80% of the sequence in previously reported CNV regions fell outside our estimated CNV boundaries, indicating that large (>100 kb) CNVs affect much less of the genome than initially reported. Approximately 80% of observed copy number differences between pairs of individuals were due to common CNPs with an allele frequency >5%, and more than 99% derived from inheritance rather than new mutation. Most common, diallelic CNPs were in strong linkage disequilibrium with SNPs, and most low-frequency CNVs segregated on specific SNP haplotypes.  相似文献   

3.
A major goal in human genetics is to understand the role of common genetic variants in susceptibility to common diseases. This will require characterizing the nature of gene variation in human populations, assembling an extensive catalogue of single-nucleotide polymorphisms (SNPs) in candidate genes and performing association studies for particular diseases. At present, our knowledge of human gene variation remains rudimentary. Here we describe a systematic survey of SNPs in the coding regions of human genes. We identified SNPs in 106 genes relevant to cardiovascular disease, endocrinology and neuropsychiatry by screening an average of 114 independent alleles using 2 independent screening methods. To ensure high accuracy, all reported SNPs were confirmed by DNA sequencing. We identified 560 SNPs, including 392 coding-region SNPs (cSNPs) divided roughly equally between those causing synonymous and non-synonymous changes. We observed different rates of polymorphism among classes of sites within genes (non-coding, degenerate and non-degenerate) as well as between genes. The cSNPs most likely to influence disease, those that alter the amino acid sequence of the encoded protein, are found at a lower rate and with lower allele frequencies than silent substitutions. This likely reflects selection acting against deleterious alleles during human evolution. The lower allele frequency of missense cSNPs has implications for the compilation of a comprehensive catalogue, as well as for the subsequent application to disease association.  相似文献   

4.
As end-stage renal disease (ESRD) has a four times higher incidence in African Americans compared to European Americans, we hypothesized that susceptibility alleles for ESRD have a higher frequency in the West African than the European gene pool. We carried out a genome-wide admixture scan in 1,372 ESRD cases and 806 controls and found a highly significant association between excess African ancestry and nondiabetic ESRD (lod score = 5.70) but not diabetic ESRD (lod = 0.47) on chromosome 22q12. Each copy of the European ancestral allele conferred a relative risk of 0.50 (95% CI = 0.39-0.63) compared to African ancestry. Multiple common SNPs (allele frequencies ranging from 0.2 to 0.6) in the gene encoding nonmuscle myosin heavy chain type II isoform A (MYH9) were associated with two to four times greater risk of nondiabetic ESRD and accounted for a large proportion of the excess risk of ESRD observed in African compared to European Americans.  相似文献   

5.
We carried out a genome-wide association study (318,237 SNPs) for insulin resistance and related phenotypes in 2,684 Indian Asians, with further testing in 11,955 individuals of Indian Asian or European ancestry. We found associations of rs12970134 near MC4R with waist circumference (P = 1.7 x 10(-9)) and, independently, with insulin resistance. Homozygotes for the risk allele of rs12970134 have approximately 2 cm increased waist circumference. Common genetic variation near MC4R is associated with risk of adiposity and insulin resistance.  相似文献   

6.
We carried out a genome-wide association study of type-2 diabetes (T2D) in individuals of South Asian ancestry. Our discovery set included 5,561 individuals with T2D (cases) and 14,458 controls drawn from studies in London, Pakistan and Singapore. We identified 20 independent SNPs associated with T2D at P < 10(-4) for testing in a replication sample of 13,170 cases and 25,398 controls, also all of South Asian ancestry. In the combined analysis, we identified common genetic variants at six loci (GRB14, ST6GAL1, VPS26A, HMG20A, AP3S2 and HNF4A) newly associated with T2D (P = 4.1 × 10(-8) to P = 1.9 × 10(-11)). SNPs at GRB14 were also associated with insulin sensitivity (P = 5.0 × 10(-4)), and SNPs at ST6GAL1 and HNF4A were also associated with pancreatic beta-cell function (P = 0.02 and P = 0.001, respectively). Our findings provide additional insight into mechanisms underlying T2D and show the potential for new discovery from genetic association studies in South Asians, a population with increased susceptibility to T2D.  相似文献   

7.
Noncoding genetic variants are likely to influence human biology and disease, but recognizing functional noncoding variants is difficult. Approximately 3% of noncoding sequence is conserved among distantly related mammals, suggesting that these evolutionarily conserved noncoding regions (CNCs) are selectively constrained and contain functional variation. However, CNCs could also merely represent regions with lower local mutation rates. Here we address this issue and show that CNCs are selectively constrained in humans by analyzing HapMap genotype data. Specifically, new (derived) alleles of SNPs within CNCs are rarer than new alleles in nonconserved regions (P = 3 x 10(-18)), indicating that evolutionary pressure has suppressed CNC-derived allele frequencies. Intronic CNCs and CNCs near genes show greater allele frequency shifts, with magnitudes comparable to those for missense variants. Thus, conserved noncoding variants are more likely to be functional. Allele frequency distributions highlight selectively constrained genomic regions that should be intensively surveyed for functionally important variation.  相似文献   

8.
Large data sets on human genetic variation have been collected recently, but their usefulness for learning about history and natural selection has been limited by biases in the ways polymorphisms were chosen. We report large subsets of SNPs from the International HapMap Project that allow us to overcome these biases and to provide accurate measurement of a quantity of crucial importance for understanding genetic variation: the allele frequency spectrum. Our analysis shows that East Asian and northern European ancestors shared the same population bottleneck expanding out of Africa but that both also experienced more recent genetic drift, which was greater in East Asians.  相似文献   

9.
Population stratification occurs in case-control association studies when allele frequencies differ between cases and controls because of ancestry. Stratification may lead to false positive associations, although this issue remains controversial. Empirical studies have found little evidence of stratification in European-derived populations, but potentially significant levels of stratification could not be ruled out. We studied a European American panel discordant for height, a heritable trait that varies widely across Europe. Genotyping 178 SNPs and applying standard analytical methods yielded no evidence of stratification. But a SNP in the gene LCT that varies widely in frequency across Europe was strongly associated with height (P < 10(-6)). This apparent association was largely or completely due to stratification; rematching individuals on the basis of European ancestry greatly reduced the apparent association, and no association was observed in Polish or Scandinavian individuals. The failure of standard methods to detect this stratification indicates that new methods may be required.  相似文献   

10.
In search of common risk alleles for prostate cancer that could contribute to high rates of the disease in men of African ancestry, we conducted a genome-wide association study, with 1,047,986 SNP markers examined in 3,425 African-Americans with prostate cancer (cases) and 3,290 African-American male controls. We followed up the most significant 17 new associations from stage 1 in 1,844 cases and 3,269 controls of African ancestry. We identified a new risk variant on chromosome 17q21 (rs7210100, odds ratio per allele = 1.51, P = 3.4 × 10(-13)). The frequency of the risk allele is ~5% in men of African descent, whereas it is rare in other populations (<1%). Further studies are needed to investigate the biological contribution of this allele to prostate cancer risk. These findings emphasize the importance of conducting genome-wide association studies in diverse populations.  相似文献   

11.
Systemic lupus erythematosus (SLE) is a common systemic autoimmune disease with complex etiology but strong clustering in families (lambda(S) = approximately 30). We performed a genome-wide association scan using 317,501 SNPs in 720 women of European ancestry with SLE and in 2,337 controls, and we genotyped consistently associated SNPs in two additional independent sample sets totaling 1,846 affected women and 1,825 controls. Aside from the expected strong association between SLE and the HLA region on chromosome 6p21 and the previously confirmed non-HLA locus IRF5 on chromosome 7q32, we found evidence of association with replication (1.1 x 10(-7) < P(overall) < 1.6 x 10(-23); odds ratio = 0.82-1.62) in four regions: 16p11.2 (ITGAM), 11p15.5 (KIAA1542), 3p14.3 (PXK) and 1q25.1 (rs10798269). We also found evidence for association (P < 1 x 10(-5)) at FCGR2A, PTPN22 and STAT4, regions previously associated with SLE and other autoimmune diseases, as well as at > or =9 other loci (P < 2 x 10(-7)). Our results show that numerous genes, some with known immune-related functions, predispose to SLE.  相似文献   

12.
Familial clustering studies indicate that breast cancer risk has a substantial genetic component. To identify new breast cancer risk variants, we genotyped approximately 300,000 SNPs in 1,600 Icelandic individuals with breast cancer and 11,563 controls using the Illumina Hap300 platform. We then tested selected SNPs in five replication sample sets. Overall, we studied 4,554 affected individuals and 17,577 controls. Two SNPs consistently associated with breast cancer: approximately 25% of individuals of European descent are homozygous for allele A of rs13387042 on chromosome 2q35 and have an estimated 1.44-fold greater risk than noncarriers, and for allele T of rs3803662 on 16q12, about 7% are homozygous and have a 1.64-fold greater risk. Risk from both alleles was confined to estrogen receptor-positive tumors. At present, no genes have been identified in the linkage disequilibrium block containing rs13387042. rs3803662 is near the 5' end of TNRC9 , a high mobility group chromatin-associated protein whose expression is implicated in breast cancer metastasis to bone.  相似文献   

13.
We carried out a multistage genome-wide association study of type 2 diabetes mellitus in Japanese individuals, with a total of 1,612 cases and 1,424 controls and 100,000 SNPs. The most significant association was obtained with SNPs in KCNQ1, and dense mapping within the gene revealed that rs2237892 in intron 15 showed the lowest Pvalue (6.7 x 10(-13), odds ratio (OR) = 1.49). The association of KCNQ1 with type 2 diabetes was replicated in populations of Korean, Chinese and European ancestry as well as in two independent Japanese populations, and meta-analysis with a total of 19,930 individuals (9,569 cases and 10,361 controls) yielded a P value of 1.7 x 10(-42) (OR = 1.40; 95% CI = 1.34-1.47) for rs2237892. Among control subjects, the risk allele of this polymorphism was associated with impairment of insulin secretion according to the homeostasis model assessment of beta-cell function or the corrected insulin response. Our data thus implicate KCNQ1 as a diabetes susceptibility gene in groups of different ancestries.  相似文献   

14.
Strong signatures of positive selection at newly arising genetic variants are well documented in humans, but this form of selection may not be widespread in recent human evolution. Because many human traits are highly polygenic and partly determined by common, ancient genetic variation, an alternative model for rapid genetic adaptation has been proposed: weak selection acting on many pre-existing (standing) genetic variants, or polygenic adaptation. By studying height, a classic polygenic trait, we demonstrate the first human signature of widespread selection on standing variation. We show that frequencies of alleles associated with increased height, both at known loci and genome wide, are systematically elevated in Northern Europeans compared with Southern Europeans (P < 4.3 × 10(-4)). This pattern mirrors intra-European height differences and is not confounded by ancestry or other ascertainment biases. The systematic frequency differences are consistent with the presence of widespread weak selection (selection coefficients ~10(-3)-10(-5) per allele) rather than genetic drift alone (P < 10(-15)).  相似文献   

15.
Estrogen receptor (ER)-negative breast cancer shows a higher incidence in women of African ancestry compared to women of European ancestry. In search of common risk alleles for ER-negative breast cancer, we combined genome-wide association study (GWAS) data from women of African ancestry (1,004 ER-negative cases and 2,745 controls) and European ancestry (1,718 ER-negative cases and 3,670 controls), with replication testing conducted in an additional 2,292 ER-negative cases and 16,901 controls of European ancestry. We identified a common risk variant for ER-negative breast cancer at the TERT-CLPTM1L locus on chromosome 5p15 (rs10069690: per-allele odds ratio (OR) = 1.18 per allele, P = 1.0 × 10(-10)). The variant was also significantly associated with triple-negative (ER-negative, progesterone receptor (PR)-negative and human epidermal growth factor-2 (HER2)-negative) breast cancer (OR = 1.25, P = 1.1 × 10(-9)), particularly in younger women (<50 years of age) (OR = 1.48, P = 1.9 × 10(-9)). Our results identify a genetic locus associated with estrogen receptor negative breast cancer subtypes in multiple populations.  相似文献   

16.
Sequence variation in human genes is largely confined to single-nucleotide polymorphisms (SNPs) and is valuable in tests of association with common diseases and pharmacogenetic traits. We performed a systematic and comprehensive survey of molecular variation to assess the nature, pattern and frequency of SNPs in 75 candidate human genes for blood-pressure homeostasis and hypertension. We assayed 28 Mb (190 kb in 148 alleles) of genomic sequence, comprising the 5' and 3' untranslated regions (UTRs), introns and coding sequence of these genes, for sequence differences in individuals of African and Northern European descent using high-density variant detection arrays (VDAs). We identified 874 candidate human SNPs, of which 22% were confirmed by DNA sequencing to reveal a discordancy rate of 21% for VDA detection. The SNPs detected have an average minor allele frequency of 11%, and 387 are within the coding sequence (cSNPs). Of all cSNPs, 54% lead to a predicted change in the protein sequence, implying a high level of human protein diversity. These protein-altering SNPs are 38% of the total number of such SNPs expected, are more likely to be population-specific and are rarer in the human population, directly demonstrating the effects of natural selection on human genes. Overall, the degree of nucleotide polymorphism across these human genes, and orthologous great ape sequences, is highly variable and is correlated with the effects of functional conservation on gene sequences.  相似文献   

17.
To identify risk variants for lung cancer, we conducted a multistage genome-wide association study. In the discovery phase, we analyzed 315,450 tagging SNPs in 1,154 current and former (ever) smoking cases of European ancestry and 1,137 frequency-matched, ever-smoking controls from Houston, Texas. For replication, we evaluated the ten SNPs most significantly associated with lung cancer in an additional 711 cases and 632 controls from Texas and 2,013 cases and 3,062 controls from the UK. Two SNPs, rs1051730 and rs8034191, mapping to a region of strong linkage disequilibrium within 15q25.1 containing PSMA4 and the nicotinic acetylcholine receptor subunit genes CHRNA3 and CHRNA5, were significantly associated with risk in both replication sets. Combined analysis yielded odds ratios of 1.32 (P < 1 x 10(-17)) for both SNPs. Haplotype analysis was consistent with there being a single risk variant in this region. We conclude that variation in a region of 15q25.1 containing nicotinic acetylcholine receptors genes contributes to lung cancer risk.  相似文献   

18.
Human earwax consists of wet and dry types. Dry earwax is frequent in East Asians, whereas wet earwax is common in other populations. Here we show that a SNP, 538G --> A (rs17822931), in the ABCC11 gene is responsible for determination of earwax type. The AA genotype corresponds to dry earwax, and GA and GG to wet type. A 27-bp deletion in ABCC11 exon 29 was also found in a few individuals of Asian ancestry. A functional assay demonstrated that cells with allele A show a lower excretory activity for cGMP than those with allele G. The allele A frequency shows a north-south and east-west downward geographical gradient; worldwide, it is highest in Chinese and Koreans, and a common dry-type haplotype is retained among various ethnic populations. These suggest that the allele A arose in northeast Asia and thereafter spread through the world. The 538G --> A SNP is the first example of DNA polymorphism determining a visible genetic trait.  相似文献   

19.
More than 5 million single-nucleotide polymorphisms (SNPs) with minor-allele frequency greater than 10% are expected to exist in the human genome. Some of these SNPs may be associated with risk of developing common diseases. To assess the power of currently available SNPs to detect such associations, we resequenced 50 genes in two ethnic samples and measured patterns of linkage disequilibrium between the subset of SNPs reported in dbSNP and the complete set of common SNPs. Our results suggest that using all 2.7 million SNPs currently in the database would detect nearly 80% of all common SNPs in European populations but only 50% of those common in the African American population and that efficient selection of a minimal subset of SNPs for use in association studies requires measurement of allele frequency and linkage disequilibrium relationships for all SNPs in dbSNP.  相似文献   

20.
Population stratification refers to differences in allele frequencies between cases and controls due to systematic differences in ancestry rather than association of genes with disease. It has been proposed that false positive associations due to stratification can be controlled by genotyping a few dozen unlinked genetic markers. To assess stratification empirically, we analyzed data from 11 case-control and case-cohort association studies. We did not detect statistically significant evidence for stratification but did observe that assessments based on a few dozen markers lack power to rule out moderate levels of stratification that could cause false positive associations in studies designed to detect modest genetic risk factors. After increasing the number of markers and samples in a case-cohort study (the design most immune to stratification), we found that stratification was in fact present. Our results suggest that modest amounts of stratification can exist even in well designed studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号