共查询到20条相似文献,搜索用时 15 毫秒
1.
Cao J Schneeberger K Ossowski S Günther T Bender S Fitz J Koenig D Lanz C Stegle O Lippert C Wang X Ott F Müller J Alonso-Blanco C Borgwardt K Schmid KJ Weigel D 《Nature genetics》2011,43(10):956-963
The plant Arabidopsis thaliana occurs naturally in many different habitats throughout Eurasia. As a foundation for identifying genetic variation contributing to adaptation to diverse environments, a 1001 Genomes Project to sequence geographically diverse A. thaliana strains has been initiated. Here we present the first phase of this project, based on population-scale sequencing of 80 strains drawn from eight regions throughout the species' native range. We describe the majority of common small-scale polymorphisms as well as many larger insertions and deletions in the A. thaliana pan-genome, their effects on gene function, and the patterns of local and global linkage among these variants. The action of processes other than spontaneous mutation is identified by comparing the spectrum of mutations that have accumulated since A. thaliana diverged from its closest relative 10 million years ago with the spectrum observed in the laboratory. Recent species-wide selective sweeps are rare, and potentially deleterious mutations are more common in marginal populations. 相似文献
2.
3.
Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing 总被引:1,自引:0,他引:1
Fujimoto A Nakagawa H Hosono N Nakano K Abe T Boroevich KA Nagasaki M Yamaguchi R Shibuya T Kubo M Miyano S Nakamura Y Tsunoda T 《Nature genetics》2010,42(11):931-936
We report the analysis of a Japanese male using high-throughput sequencing to × 40 coverage. More than 99% of the sequence reads were mapped to the reference human genome. Using a Bayesian decision method, we identified 3,132,608 single nucleotide variations (SNVs). Comparison with six previously reported genomes revealed an excess of singleton nonsense and nonsynonymous SNVs, as well as singleton SNVs in conserved non-coding regions. We also identified 5,319 deletions smaller than 10 kb with high accuracy, in addition to copy number variations and rearrangements. De novo assembly of the unmapped sequence reads generated around 3 Mb of novel sequence, which showed high similarity to non-reference human genomes and the human herpesvirus 4 genome. Our analysis suggests that considerable variation remains undiscovered in the human genome and that whole-genome sequencing is an invaluable tool for obtaining a complete understanding of human genetic variation. 相似文献
4.
Harris SR Clarke IN Seth-Smith HM Solomon AW Cutcliffe LT Marsh P Skilton RJ Holland MJ Mabey D Peeling RW Lewis DA Spratt BG Unemo M Persson K Bjartling C Brunham R de Vries HJ Morré SA Speksnijder A Bébéar CM Clerc M de Barbeyrac B Parkhill J Thomson NR 《Nature genetics》2012,44(4):413-9, S1
Chlamydia trachomatis is responsible for both trachoma and sexually transmitted infections, causing substantial morbidity and economic cost globally. Despite this, our knowledge of its population and evolutionary genetics is limited. Here we present a detailed phylogeny based on whole-genome sequencing of representative strains of C. trachomatis from both trachoma and lymphogranuloma venereum (LGV) biovars from temporally and geographically diverse sources. Our analysis shows that predicting phylogenetic structure using ompA, which is traditionally used to classify Chlamydia, is misleading because extensive recombination in this region masks any true relationships present. We show that in many instances, ompA is a chimera that can be exchanged in part or as a whole both within and between biovars. We also provide evidence for exchange of, and recombination within, the cryptic plasmid, which is another key diagnostic target. We used our phylogenetic framework to show how genetic exchange has manifested itself in ocular, urogenital and LGV C. trachomatis strains, including the epidemic LGV serotype L2b. 相似文献
5.
Fujimoto A Totoki Y Abe T Boroevich KA Hosoda F Nguyen HH Aoki M Hosono N Kubo M Miya F Arai Y Takahashi H Shirakihara T Nagasaki M Shibuya T Nakano K Watanabe-Makino K Tanaka H Nakamura H Kusuda J Ojima H Shimada K Okusaka T Ueno M Shigekawa Y Kawakami Y Arihiro K Ohdan H Gotoh K Ishikawa O Ariizumi S Yamamoto M Yamada T Chayama K Kosuge T Yamaue H Kamatani N Miyano S Nakagama H Nakamura Y Tsunoda T Shibata T Nakagawa H 《Nature genetics》2012,44(7):760-764
Hepatocellular carcinoma (HCC) is the third leading cause of cancer-related death worldwide. We sequenced and analyzed the whole genomes of 27 HCCs, 25 of which were associated with hepatitis B or C virus infections, including two sets of multicentric tumors. Although no common somatic mutations were identified in the multicentric tumor pairs, their whole-genome substitution patterns were similar, suggesting that these tumors developed from independent mutations, although their shared etiological backgrounds may have strongly influenced their somatic mutation patterns. Statistical and functional analyses yielded a list of recurrently mutated genes. Multiple chromatin regulators, including ARID1A, ARID1B, ARID2, MLL and MLL3, were mutated in ~50% of the tumors. Hepatitis B virus genome integration in the TERT locus was frequently observed in a high clonal proportion. Our whole-genome sequencing analysis of HCCs identified the influence of etiological background on somatic mutation patterns and subsequent carcinogenesis, as well as recurrent mutations in chromatin regulators in HCCs. 相似文献
6.
Verjovski-Almeida S DeMarco R Martins EA Guimarães PE Ojopi EP Paquola AC Piazza JP Nishiyama MY Kitajima JP Adamson RE Ashton PD Bonaldo MF Coulson PS Dillon GP Farias LP Gregorio SP Ho PL Leite RA Malaquias LC Marques RC Miyasato PA Nascimento AL Ohlweiler FP Reis EM Ribeiro MA Sá RG Stukart GC Soares MB Gargioni C Kawano T Rodrigues V Madeira AM Wilson RA Menck CF Setubal JC Leite LC Dias-Neto E 《Nature genetics》2003,35(2):148-157
Schistosoma mansoni is the primary causative agent of schistosomiasis, which affects 200 million individuals in 74 countries. We generated 163,000 expressed-sequence tags (ESTs) from normalized cDNA libraries from six selected developmental stages of the parasite, resulting in 31,000 assembled sequences and 92% sampling of an estimated 14,000 gene complement. By analyzing automated Gene Ontology assignments, we provide a detailed view of important S. mansoni biological systems, including characterization of metazoa-specific and eukarya-conserved genes. Phylogenetic analysis suggests an early divergence from other metazoa. The data set provides insights into the molecular mechanisms of tissue organization, development, signaling, sexual dimorphism, host interactions and immune evasion and identifies novel proteins to be investigated as vaccine candidates and potential drug targets. 相似文献
7.
Evolutionary and biomedical implications of a Schistosoma japonicum complementary DNA resource 总被引:21,自引:0,他引:21
Hu W Yan Q Shen DK Liu F Zhu ZD Song HD Xu XR Wang ZJ Rong YP Zeng LC Wu J Zhang X Wang JJ Xu XN Wang SY Fu G Zhang XL Wang ZQ Brindley PJ McManus DP Xue CL Feng Z Chen Z Han ZG 《Nature genetics》2003,35(2):139-147
Schistosoma japonicum causes schistosomiasis in humans and livestock in the Asia-Pacific region. Knowledge of the genome of this parasite should improve understanding of schistosome-host interactions, biomedical aspects of schistosomiasis and invertebrate evolution. We assigned 43,707 expressed sequence tags (ESTs) derived from adult S. japonicum and their eggs to 13,131 gene clusters. Of these, 35% shared no similarity with known genes and 75% had not been reported previously in schistosomes. Notably, S. japonicum encoded mammalian-like receptors for insulin, progesterone, cytokines and neuropeptides, suggesting that host hormones, or endogenous parasite homologs, could orchestrate schistosome development and maturation and that schistosomes modulate anti-parasite immune responses through inhibitors, molecular mimicry and other evasion strategies. 相似文献
8.
The completed draft version of the human genome, comprised of multiple short contigs encompassing 85% or more of euchromatin, was announced in June of 2000 (ref. 1). The detailed findings of the sequencing consortium were reported several months later. The draft sequence has provided insight into global characteristics, such as the total number of genes and a more accurate definition of gene families. Also of importance are genome positional details such as local genome architecture, regional gene density and the location of transcribed units that are critical for disease gene identification. We carried out a series of mapping and computational experiments using a nonredundant collection of 925 expressed sequence tags (ESTs) and sections of the public draft genome sequence that were available at different timepoints between April 2000 and April 2001. We found discrepancies in both the reported coverage of the human genome and the accuracy of mapping of genomic clones, suggesting some limitations of the draft genome sequence in providing accurate positional information and detailed characterization of chromosomal subregions. 相似文献
9.
Gout AM;ADPKD Gene Variant Consortium Ravine D Harris PC Rossetti S Peters D Breuning M Henske EP Koizumi A Inoue S Shimizu Y Thongnoppakhun W Yenchitsomanus PT Deltas C Sandford R Torra R Turco AE Jeffery S Fontes M Somlo S Furu LM Smulders YM Mercier B Ferec C Burtey S Pei Y Kalaydjieva L Bogdanova N McCluskey M Geon LJ Wouters CH Reiterova J Stekrová J San Millan JL Aguiari G Del Senno L 《Nature genetics》2007,39(4):427-428
10.
《Nature genetics》2011,43(3):173
The substantial $10 million purse of the Archon Genomics X PRI ZE (AGXP) is being offered for the generation of rapid, accurate and complete human DNA sequences. Because so many genomics researchers have a stake, we offer to help with a process of community consultation to help evolve fair and efficient methods to validate contestant data for the competition. 相似文献
11.
Analysis of expressed sequence tags indicates 35,000 human genes 总被引:18,自引:0,他引:18
The number of protein-coding genes in an organism provides a useful first measure of its molecular complexity. Single-celled prokaryotes and eukaryotes typically have a few thousand genes; for example, Escherichia coli has 4,300 and Saccharomyces cerevisiae has 6,000. Evolution of multicellularity appears to have been accompanied by a several-fold increase in gene number, the invertebrates Caenorhabditis elegans and Drosophila melanogaster having 19,000 and 13,600 genes, respectively. Here we estimate the number of human genes by comparing a set of human expressed sequence tag (EST) contigs with human chromosome 22 and with a non-redundant set of mRNA sequences. The two comparisons give mutually consistent estimates of approximately 35,000 genes, substantially lower than most previous estimates. Evolution of the increased physiological complexity of vertebrates may therefore have depended more on the combinatorial diversification of regulatory networks or alternative splicing than on a substantial increase in gene number. 相似文献
12.
Many sequence variants affecting diversity of adult human height 总被引:1,自引:0,他引:1
Gudbjartsson DF Walters GB Thorleifsson G Stefansson H Halldorsson BV Zusmanovich P Sulem P Thorlacius S Gylfason A Steinberg S Helgadottir A Ingason A Steinthorsdottir V Olafsdottir EJ Olafsdottir GH Jonsson T Borch-Johnsen K Hansen T Andersen G Jorgensen T Pedersen O Aben KK Witjes JA Swinkels DW den Heijer M Franke B Verbeek AL Becker DM Yanek LR Becker LC Tryggvadottir L Rafnar T Gulcher J Kiemeney LA Kong A Thorsteinsdottir U Stefansson K 《Nature genetics》2008,40(5):609-615
Adult human height is one of the classical complex human traits. We searched for sequence variants that affect height by scanning the genomes of 25,174 Icelanders, 2,876 Dutch, 1,770 European Americans and 1,148 African Americans. We then combined these results with previously published results from the Diabetes Genetics Initiative on 3,024 Scandinavians and tested a selected subset of SNPs in 5,517 Danes. We identified 27 regions of the genome with one or more sequence variants showing significant association with height. The estimated effects per allele of these variants ranged between 0.3 and 0.6 cm and, taken together, they explain around 3.7% of the population variation in height. The genes neighboring the identified loci cluster in biological processes related to skeletal development and mitosis. Association to three previously reported loci are replicated in our analyses, and the strongest association was with SNPs in the ZBTB38 gene. 相似文献
13.
Complex SNP-related sequence variation in segmental genome duplications 总被引:23,自引:0,他引:23
There is uncertainty about the true nature of predicted single-nucleotide polymorphisms (SNPs) in segmental duplications (duplicons) and whether these markers genuinely exist at increased density as indicated in public databases. We explored these issues by genotyping 157 predicted SNPs in duplicons and control regions in normal diploid genomes and fully homozygous complete hydatidiform moles. Our data identified many true SNPs in duplicon regions and few paralogous sequence variants. Twenty-eight percent of the polymorphic duplicon sequences we tested involved multisite variation, a new type of polymorphism representing the sum of the signals from many individual duplicon copies that vary in sequence content due to duplication, deletion or gene conversion. Multisite variations can masquerade as normal SNPs when genotyped. Given that duplicons comprise at least 5% of the genome and many are yet to be annotated in the genome draft, effective strategies to identify multisite variation must be established and deployed. 相似文献
14.
15.
Y chromosome sequence variation and the history of human populations 总被引:48,自引:0,他引:48
Underhill PA Shen P Lin AA Jin L Passarino G Yang WH Kauffman E Bonné-Tamir B Bertranpetit J Francalacci P Ibrahim M Jenkins T Kidd JR Mehdi SQ Seielstad MT Wells RS Piazza A Davis RW Feldman MW Cavalli-Sforza LL Oefner PJ 《Nature genetics》2000,26(3):358-361
Binary polymorphisms associated with the non-recombining region of the human Y chromosome (NRY) preserve the paternal genetic legacy of our species that has persisted to the present, permitting inference of human evolution, population affinity and demographic history. We used denaturing high-performance liquid chromatography (DHPLC; ref. 2) to identify 160 of the 166 bi-allelic and 1 tri-allelic site that formed a parsimonious genealogy of 116 haplotypes, several of which display distinct population affinities based on the analysis of 1062 globally representative individuals. A minority of contemporary East Africans and Khoisan represent the descendants of the most ancestral patrilineages of anatomically modern humans that left Africa between 35,000 and 89,000 years ago. 相似文献
16.
A mixed 'clone-by-clone' and 'whole-genome shotgun' strategy will be used to determine the genomic sequence of the mouse. This method will allow a phase of rapid annotation of the contemporaneous human sequence draft, through whole-genome 'sample sequence comparisons'. 相似文献
17.
M Marra T Kucaba M Sekhon L Hillier R Martienssen A Chinwalla J Crockett J Fedele H Grover C Gund W R McCombie K McDonald J McPherson N Mudd L Parnell J Schein R Seim P Shelby R Waterston R Wilson 《Nature genetics》1999,22(3):265-270
Arabidopsis thaliana has emerged as a model system for studies of plant genetics and development, and its genome has been targeted for sequencing by an international consortium (the Arabidopsis Genome Initiative; http://genome-www. stanford.edu/Arabidopsis/agi.html). To support the genome-sequencing effort, we fingerprinted more than 20,000 BACs (ref. 2) from two high-quality publicly available libraries, generating an estimated 17-fold redundant coverage of the genome, and used the fingerprints to nucleate assembly of the data by computer. Subsequent manual revision of the assemblies resulted in the incorporation of 19,661 fingerprinted BACs into 169 ordered sets of overlapping clones ('contigs'), each containing at least 3 clones. These contigs are ideal for parallel selection of BACs for large-scale sequencing and have supported the generation of more than 5.8 Mb of finished genome sequence submitted to GenBank; analysis of the sequence has confirmed the integrity of contigs constructed using this fingerprint data. Placement of contigs onto chromosomes can now be performed, and is being pursued by groups involved in both sequencing and positional cloning studies. To our knowledge, these data provide the first example of whole-genome random BAC fingerprint analysis of a eucaryote, and have provided a model essential to efforts aimed at generating similar databases of fingerprint contigs to support sequencing of other complex genomes, including that of human. 相似文献
18.
Reductive evolution suggested from the complete genome sequence of a plant-pathogenic phytoplasma 总被引:20,自引:0,他引:20
Oshima K Kakizawa S Nishigawa H Jung HY Wei W Suzuki S Arashida R Nakata D Miyata S Ugaki M Namba S 《Nature genetics》2004,36(1):27-29
The minimal gene set essential for life has long been sought. We report the 860-kb genome of the obligate intracellular plant pathogen phytoplasma (Candidatus Phytoplasma asteris, OY strain). The phytoplasma genome encodes even fewer metabolic functions than do mycoplasma genomes. It lacks the pentose phosphate cycle and, more unexpectedly, ATP-synthase subunits, which are thought to be essential for life. This may be the result of reductive evolution as a consequence of life as an intracellular parasite in a nutrient-rich environment. 相似文献
19.
Brown KM Macgregor S Montgomery GW Craig DW Zhao ZZ Iyadurai K Henders AK Homer N Campbell MJ Stark M Thomas S Schmid H Holland EA Gillanders EM Duffy DL Maskiell JA Jetann J Ferguson M Stephan DA Cust AE Whiteman D Green A Olsson H Puig S Ghiorzo P Hansson J Demenais F Goldstein AM Gruis NA Elder DE Bishop JN Kefford RF Giles GG Armstrong BK Aitken JF Hopper JL Martin NG Trent JM Mann GJ Hayward NK 《Nature genetics》2008,40(7):838-840
We conducted a genome-wide association pooling study for cutaneous melanoma and performed validation in samples totaling 2,019 cases and 2,105 controls. Using pooling, we identified a new melanoma risk locus on chromosome 20 (rs910873 and rs1885120), with replication in two further samples (combined P < 1 x 10(-15)). The per allele odds ratio was 1.75 (1.53, 2.01), with evidence for stronger association in early-onset cases. 相似文献
20.
Akman L Yamashita A Watanabe H Oshima K Shiba T Hattori M Aksoy S 《Nature genetics》2002,32(3):402-407
Many insects that rely on a single food source throughout their developmental cycle harbor beneficial microbes that provide nutrients absent from their restricted diet. Tsetse flies, the vectors of African trypanosomes, feed exclusively on blood and rely on one such intracellular microbe for nutritional provisioning and fecundity. As a result of co-evolution with hosts over millions of years, these mutualists have lost the ability to survive outside the sheltered environment of their host insect cells. We present the complete annotated genome of Wigglesworthia glossinidia brevipalpis, which is composed of one chromosome of 697,724 base pairs (bp) and one small plasmid, called pWig1, of 5,200 bp. Genes involved in the biosynthesis of vitamin metabolites, apparently essential for host nutrition and fecundity, have been retained. Unexpectedly, this obligate's genome bears hallmarks of both parasitic and free-living microbes, and the gene encoding the important regulatory protein DnaA is absent. 相似文献