首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A central challenge in genetics is to predict phenotypic variation from individual genome sequences. Here we construct and evaluate phenotypic predictions for 19 strains of Saccharomyces cerevisiae. We use conservation-based methods to predict the impact of protein-coding variation within genes on protein function. We then rank strains using a prediction score that measures the total sum of function-altering changes in different sets of genes reported to influence over 100 phenotypes in genome-wide loss-of-function screens. We evaluate our predictions by comparing them with the observed growth rate and efficiency of 15 strains tested across 20 conditions in quantitative experiments. The median predictive performance, as measured by ROC AUC, was 0.76, and predictions were more accurate when the genes reported to influence a trait were highly connected in a functional gene network.  相似文献   

2.
Numerous types of DNA variation exist, ranging from SNPs to larger structural alterations such as copy number variants (CNVs) and inversions. Alignment of DNA sequence from different sources has been used to identify SNPs and intermediate-sized variants (ISVs). However, only a small proportion of total heterogeneity is characterized, and little is known of the characteristics of most smaller-sized (<50 kb) variants. Here we show that genome assembly comparison is a robust approach for identification of all classes of genetic variation. Through comparison of two human assemblies (Celera's R27c compilation and the Build 35 reference sequence), we identified megabases of sequence (in the form of 13,534 putative non-SNP events) that were absent, inverted or polymorphic in one assembly. Database comparison and laboratory experimentation further demonstrated overlap or validation for 240 variable regions and confirmed >1.5 million SNPs. Some differences were simple insertions and deletions, but in regions containing CNVs, segmental duplication and repetitive DNA, they were more complex. Our results uncover substantial undescribed variation in humans, highlighting the need for comprehensive annotation strategies to fully interpret genome scanning and personalized sequencing projects.  相似文献   

3.
4.
A new study reports a comprehensive survey of genetic diversity in natural populations of the nematode Caenorhabditis elegans. Their analyses suggest that recent chromosome-scale selective sweeps have reduced C. elegans genetic diversity worldwide and strongly structured genetic variation across its genome.  相似文献   

5.
Detection of large-scale variation in the human genome   总被引:26,自引:0,他引:26  
We identified 255 loci across the human genome that contain genomic imbalances among unrelated individuals. Twenty-four variants are present in > 10% of the individuals that we examined. Half of these regions overlap with genes, and many coincide with segmental duplications or gaps in the human genome assembly. This previously unappreciated heterogeneity may underlie certain human phenotypic variation and susceptibility to disease and argues for a more dynamic human genome structure.  相似文献   

6.
Complex SNP-related sequence variation in segmental genome duplications   总被引:23,自引:0,他引:23  
There is uncertainty about the true nature of predicted single-nucleotide polymorphisms (SNPs) in segmental duplications (duplicons) and whether these markers genuinely exist at increased density as indicated in public databases. We explored these issues by genotyping 157 predicted SNPs in duplicons and control regions in normal diploid genomes and fully homozygous complete hydatidiform moles. Our data identified many true SNPs in duplicon regions and few paralogous sequence variants. Twenty-eight percent of the polymorphic duplicon sequences we tested involved multisite variation, a new type of polymorphism representing the sum of the signals from many individual duplicon copies that vary in sequence content due to duplication, deletion or gene conversion. Multisite variations can masquerade as normal SNPs when genotyped. Given that duplicons comprise at least 5% of the genome and many are yet to be annotated in the genome draft, effective strategies to identify multisite variation must be established and deployed.  相似文献   

7.
One goal in sequencing the Plasmodium falciparum genome, the agent of the most lethal form of malaria, is to discover vaccine and drug targets. However, identifying those targets in a genome in which approximately 60% of genes have unknown functions is an enormous challenge. Because the majority of known malaria antigens and drug-resistant genes are highly polymorphic and under various selective pressures, genome-wide analysis for signatures of selection may lead to discovery of new vaccine and drug candidates. Here we surveyed 3,539 P. falciparum genes ( approximately 65% of the predicted genes) for polymorphisms and identified various highly polymorphic loci and genes, some of which encode new antigens that we confirmed using human immune sera. Our collections of genome-wide SNPs ( approximately 65% nonsynonymous) and polymorphic microsatellites and indels provide a high-resolution map (one marker per approximately 4 kb) for mapping parasite traits and studying parasite populations. In addition, we report new antigens, providing urgently needed vaccine candidates for disease control.  相似文献   

8.
The ratio of genetic diversity on chromosome X to that on the autosomes is sensitive to both natural selection and demography. On the basis of whole-genome sequences of 69 females, we report that whereas this ratio increases with genetic distance from genes across populations, it is lower in Europeans than in West Africans independent of proximity to genes. This relative reduction is most parsimoniously explained by differences in demographic history without the need to invoke natural selection.  相似文献   

9.
Characterizing fine-scale variation in human recombination rates is important, both to deepen understanding of the recombination process and to aid the design of disease association studies. Current genetic maps show that rates vary on a megabase scale, but studying finer-scale variation using pedigrees is difficult. Sperm-typing experiments have characterized regions where crossovers cluster into 1-2-kb hot spots, but technical difficulties limit the number of studies. An alternative is to use population variation to infer fine-scale characteristics of the recombination process. Several surveys reported 'block-like' patterns of diversity, which may reflect fine-scale recombination rate variation, but limitations of available methods made this impossible to assess. Here, we applied a new statistical method, which overcomes these limitations, to infer patterns of fine-scale recombination rate variation in 74 genes. We found extensive rate variation both within and among genes. In particular, recombination hot spots are a common feature of the human genome: 47% (35 of 74) of genes showed substantive evidence for a hot spot, and many more showed evidence for some rate variation. No primary sequence characteristics are consistently associated with precise hot-spot location, although G+C content and nucleotide diversity are correlated with local recombination rate.  相似文献   

10.
Isolates of Salmonella enterica serovar Typhi (Typhi), a human-restricted bacterial pathogen that causes typhoid, show limited genetic variation. We generated whole-genome sequences for 19 Typhi isolates using 454 (Roche) and Solexa (Illumina) technologies. Isolates, including the previously sequenced CT18 and Ty2 isolates, were selected to represent major nodes in the phylogenetic tree. Comparative analysis showed little evidence of purifying selection, antigenic variation or recombination between isolates. Rather, evolution in the Typhi population seems to be characterized by ongoing loss of gene function, consistent with a small effective population size. The lack of evidence for antigenic variation driven by immune selection is in contrast to strong adaptive selection for mutations conferring antibiotic resistance in Typhi. The observed patterns of genetic isolation and drift are consistent with the proposed key role of asymptomatic carriers of Typhi as the main reservoir of this pathogen, highlighting the need for identification and treatment of carriers.  相似文献   

11.
Recent genomic surveys have produced high-resolution haplotype information, but only in a small number of human populations. We report haplotype structure across 12 Mb of DNA sequence in 927 individuals representing 52 populations. The geographic distribution of haplotypes reflects human history, with a loss of haplotype diversity as distance increases from Africa. Although the extent of linkage disequilibrium (LD) varies markedly across populations, considerable sharing of haplotype structure exists, and inferred recombination hotspot locations generally match across groups. The four samples in the International HapMap Project contain the majority of common haplotypes found in most populations: averaging across populations, 83% of common 20-kb haplotypes in a population are also common in the most similar HapMap sample. Consequently, although the portability of tag SNPs based on the HapMap is reduced in low-LD Africans, the HapMap will be helpful for the design of genome-wide association mapping studies in nearly all human populations.  相似文献   

12.
We conducted a meta-analysis of genome-wide association studies of systolic (SBP) and diastolic (DBP) blood pressure in 19,608 subjects of east Asian ancestry from the AGEN-BP consortium followed up with de novo genotyping (n = 10,518) and further replication (n = 20,247) in east Asian samples. We identified genome-wide significant (P < 5 × 10(-8)) associations with SBP or DBP, which included variants at four new loci (ST7L-CAPZA1, FIGN-GRB14, ENPEP and NPR3) and a newly discovered variant near TBX3. Among the five newly discovered variants, we obtained significant replication in the independent samples for all of these loci except NPR3. We also confirmed seven loci previously identified in populations of European descent. Moreover, at 12q24.13 near ALDH2, we observed strong association signals (P = 7.9 × 10(-31) and P = 1.3 × 10(-35) for SBP and DBP, respectively) with ethnic specificity. These findings provide new insights into blood pressure regulation and potential targets for intervention.  相似文献   

13.
14.
Kawasaki disease is a systemic vasculitis of unknown etiology, with clinical observations suggesting a substantial genetic contribution to disease susceptibility. We conducted a genome-wide association study and replication analysis in 2,173 individuals with Kawasaki disease and 9,383 controls from five independent sample collections. Two loci exceeded the formal threshold for genome-wide significance. The first locus is a functional polymorphism in the IgG receptor gene FCGR2A (encoding an H131R substitution) (rs1801274; P = 7.35 × 10(-11), odds ratio (OR) = 1.32), with the A allele (coding for histadine) conferring elevated disease risk. The second locus is at 19q13, (P = 2.51 × 10(-9), OR = 1.42 for the rs2233152 SNP near MIA and RAB4B; P = 1.68 × 10(-12), OR = 1.52 for rs28493229 in ITPKC), which confirms previous findings(1). The involvement of the FCGR2A locus may have implications for understanding immune activation in Kawasaki disease pathogenesis and the mechanism of response to intravenous immunoglobulin, the only proven therapy for this disease.  相似文献   

15.
The incidence of melanoma is increasing more than any other cancer, and knowledge of its genetic alterations is limited. To systematically analyze such alterations, we performed whole-exome sequencing of 14 matched normal and metastatic tumor DNAs. Using stringent criteria, we identified 68 genes that appeared to be somatically mutated at elevated frequency, many of which are not known to be genetically altered in tumors. Most importantly, we discovered that TRRAP harbored a recurrent mutation that clustered in one position (p. Ser722Phe) in 6 out of 167 affected individuals (~4%), as well as a previously unidentified gene, GRIN2A, which was mutated in 33% of melanoma samples. The nature, pattern and functional evaluation of the TRRAP recurrent mutation suggest that TRRAP functions as an oncogene. Our study provides, to our knowledge, the most comprehensive map of genetic alterations in melanoma to date and suggests that the glutamate signaling pathway is involved in this disease.  相似文献   

16.
Prior studies have identified recurrent oncogenic mutations in colorectal adenocarcinoma and have surveyed exons of protein-coding genes for mutations in 11 affected individuals. Here we report whole-genome sequencing from nine individuals with colorectal cancer, including primary colorectal tumors and matched adjacent non-tumor tissues, at an average of 30.7× and 31.9× coverage, respectively. We identify an average of 75 somatic rearrangements per tumor, including complex networks of translocations between pairs of chromosomes. Eleven rearrangements encode predicted in-frame fusion proteins, including a fusion of VTI1A and TCF7L2 found in 3 out of 97 colorectal cancers. Although TCF7L2 encodes TCF4, which cooperates with β-catenin in colorectal carcinogenesis, the fusion lacks the TCF4 β-catenin-binding domain. We found a colorectal carcinoma cell line harboring the fusion gene to be dependent on VTI1A-TCF7L2 for anchorage-independent growth using RNA interference-mediated knockdown. This study shows previously unidentified levels of genomic rearrangements in colorectal carcinoma that can lead to essential gene fusions and other oncogenic events.  相似文献   

17.
We performed exome sequencing to detect somatic mutations in protein-coding regions in seven melanoma cell lines and donor-matched germline cells. All melanoma samples had high numbers of somatic mutations, which showed the hallmark of UV-induced DNA repair. Such a hallmark was absent in tumor sample-specific mutations in two metastases derived from the same individual. Two melanomas with non-canonical BRAF mutations harbored gain-of-function MAP2K1 and MAP2K2 (MEK1 and MEK2, respectively) mutations, resulting in constitutive ERK phosphorylation and higher resistance to MEK inhibitors. Screening a larger cohort of individuals with melanoma revealed the presence of recurring somatic MAP2K1 and MAP2K2 mutations, which occurred at an overall frequency of 8%. Furthermore, missense and nonsense somatic mutations were frequently found in three candidate melanoma genes, FAT4, LRP1B and DSC1.  相似文献   

18.
The minimal gene set essential for life has long been sought. We report the 860-kb genome of the obligate intracellular plant pathogen phytoplasma (Candidatus Phytoplasma asteris, OY strain). The phytoplasma genome encodes even fewer metabolic functions than do mycoplasma genomes. It lacks the pentose phosphate cycle and, more unexpectedly, ATP-synthase subunits, which are thought to be essential for life. This may be the result of reductive evolution as a consequence of life as an intracellular parasite in a nutrient-rich environment.  相似文献   

19.
Chen WJ  Lin Y  Xiong ZQ  Wei W  Ni W  Tan GH  Guo SL  He J  Chen YF  Zhang QJ  Li HF  Lin Y  Murong SX  Xu J  Wang N  Wu ZY 《Nature genetics》2011,43(12):1252-1255
Paroxysmal kinesigenic dyskinesia is the most common type of paroxysmal movement disorder and is often misdiagnosed clinically as epilepsy. Using whole-exome sequencing followed by Sanger sequencing, we identified three truncating mutations within PRRT2 (NM_145239.2) in eight Han Chinese families with histories of paroxysmal kinesigenic dyskinesia: c.514_517delTCTG (p.Ser172Argfs*3) in one family, c.649dupC (p.Arg217Profs*8) in six families and c.972delA (p.Val325Serfs*12) in one family. These truncating mutations co-segregated exactly with the disease in these families and were not observed in 1,000 control subjects of matched ancestry. PRRT2 is a newly discovered gene consisting of four exons encoding the proline-rich transmembrane protein 2, which encompasses 340 amino acids and contains two predicted transmembrane domains. PRRT2 is highly expressed in the developing nervous system, and a truncating mutation alters the subcellular localization of the PRRT2 protein. The function of PRRT2 and its role in paroxysmal kinesigenic dyskinesia should be further investigated.  相似文献   

20.
The identification of tumor-suppressor genes in solid tumors by classical cancer genetics methods is difficult and slow. We combined nonsense-mediated RNA decay microarrays and array-based comparative genomic hybridization for the genome-wide identification of genes with biallelic inactivation involving nonsense mutations and loss of the wild-type allele. This approach enabled us to identify previously unknown mutations in the receptor tyrosine kinase gene EPHB2. The DU 145 prostate cancer cell line, originating from a brain metastasis, carries a truncating mutation of EPHB2 and a deletion of the remaining allele. Additional frameshift, splice site, missense and nonsense mutations are present in clinical prostate cancer samples. Transfection of DU 145 cells, which lack functional EphB2, with wild-type EPHB2 suppresses clonogenic growth. Taken together with studies indicating that EphB2 may have an essential role in cell migration and maintenance of normal tissue architecture, our findings suggest that mutational inactivation of EPHB2 may be important in the progression and metastasis of prostate cancer.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号