共查询到20条相似文献,搜索用时 15 毫秒
1.
The map-based sequence of the rice genome 总被引:14,自引:0,他引:14
International Rice Genome Sequencing Project 《Nature》2005,436(7052):793-800
Rice, one of the world's most important food plants, has important syntenic relationships with the other cereal species and is a model plant for the grasses. Here we present a map-based, finished quality sequence that covers 95% of the 389 Mb genome, including virtually all of the euchromatin and two complete centromeres. A total of 37,544 non-transposable-element-related protein-coding genes were identified, of which 71% had a putative homologue in Arabidopsis. In a reciprocal analysis, 90% of the Arabidopsis proteins had a putative homologue in the predicted rice proteome. Twenty-nine per cent of the 37,544 predicted genes appear in clustered gene families. The number and classes of transposable elements found in the rice genome are consistent with the expansion of syntenic regions in the maize and sorghum genomes. We find evidence for widespread and recurrent gene transfer from the organelles to the nuclear chromosomes. The map-based sequence has proven useful for the identification of genes underlying agronomic traits. The additional single-nucleotide polymorphisms and simple sequence repeats identified in our study should accelerate improvements in rice production. 相似文献
2.
The genome sequence of the rice blast fungus Magnaporthe grisea 总被引:8,自引:0,他引:8
Dean RA Talbot NJ Ebbole DJ Farman ML Mitchell TK Orbach MJ Thon M Kulkarni R Xu JR Pan H Read ND Lee YH Carbone I Brown D Oh YY Donofrio N Jeong JS Soanes DM Djonovic S Kolomiets E Rehmeyer C Li W Harding M Kim S Lebrun MH Bohnert H Coughlan S Butler J Calvo S Ma LJ Nicol R Purcell S Nusbaum C Galagan JE Birren BW 《Nature》2005,434(7036):980-986
Magnaporthe grisea is the most destructive pathogen of rice worldwide and the principal model organism for elucidating the molecular basis of fungal disease of plants. Here, we report the draft sequence of the M. grisea genome. Analysis of the gene set provides an insight into the adaptations required by a fungus to cause disease. The genome encodes a large and diverse set of secreted proteins, including those defined by unusual carbohydrate-binding domains. This fungus also possesses an expanded family of G-protein-coupled receptors, several new virulence-associated genes and large suites of enzymes involved in secondary metabolism. Consistent with a role in fungal pathogenesis, the expression of several of these genes is upregulated during the early stages of infection-related development. The M. grisea genome has been subject to invasion and proliferation of active transposable elements, reflecting the clonal nature of this fungus imposed by widespread rice cultivation. 相似文献
3.
YU JinSheng FAN YouRong LIU Nan SHAN Yan LI XiangHua ZHANG QiFa 《科学通报(英文版)》2007,52(7):912-921
Pms1, a locus for photoperiod sensitive genic male sterility in rice, was identified and mapped to chromosome 7 in previous studies. Here we report an effort to identify the candidate genes for Pms1 by comparative sequencing of BAC clones from two cultivars Minghui 63 and Nongken 58, the parents for the initial mapping population. Annotation and comparison of the sequences of the two clones resulted in a total of five potential candidates which should be functionally tested. We also conducted com-parative analysis of sequences of these two cultivars with two other cultivars, Nipponbare and 93-11, for which sequence data were available in public databases. The analysis revealed large differences in sequence composition among the four genotypes in the Pms1 region primarily due to retroelement activity leading to rapid recent growth and divergence of the genomes. High levels of polymorphism in the forms of indels and SNPs were found both in intra- and inter-subspecific comparisons. Dating analysis using LTRs of the retroelements in this region showed that the substitution rate of LTRs was much higher than reported in the literature. The results provided strong evidence for rapid genomic evolution of this region as a consequence of natural and artificial selection. 相似文献
4.
Gregory SG Barlow KF McLay KE Kaul R Swarbreck D Dunham A Scott CE Howe KL Woodfine K Spencer CC Jones MC Gillson C Searle S Zhou Y Kokocinski F McDonald L Evans R Phillips K Atkinson A Cooper R Jones C Hall RE Andrews TD Lloyd C Ainscough R Almeida JP Ambrose KD Anderson F Andrew RW Ashwell RI Aubin K Babbage AK Bagguley CL Bailey J Beasley H Bethel G Bird CP Bray-Allen S Brown JY Brown AJ Buckley D Burton J Bye J Carder C Chapman JC Clark SY Clarke G Clee C Cobley V Collier RE Corby N 《Nature》2006,441(7091):315-321
The reference sequence for each human chromosome provides the framework for understanding genome function, variation and evolution. Here we report the finished sequence and biological annotation of human chromosome 1. Chromosome 1 is gene-dense, with 3,141 genes and 991 pseudogenes, and many coding sequences overlap. Rearrangements and mutations of chromosome 1 are prevalent in cancer and many other diseases. Patterns of sequence variation reveal signals of recent selection in specific genes that may contribute to human fitness, and also in regions where no function is evident. Fine-scale recombination occurs in hotspots of varying intensity along the sequence, and is enriched near genes. These and other studies of human biology and disease encoded within chromosome 1 are made possible with the highly accurate annotated sequence, as part of the completed set of chromosome sequences that comprise the reference human genome. 相似文献
5.
Wood V Gwilliam R Rajandream MA Lyne M Lyne R Stewart A Sgouros J Peat N Hayles J Baker S Basham D Bowman S Brooks K Brown D Brown S Chillingworth T Churcher C Collins M Connor R Cronin A Davis P Feltwell T Fraser A Gentles S Goble A Hamlin N Harris D Hidalgo J Hodgson G Holroyd S Hornsby T Howarth S Huckle EJ Hunt S Jagels K James K Jones L Jones M Leather S McDonald S McLean J Mooney P Moule S Mungall K Murphy L Niblett D Odell C Oliver K O'Neil S Pearson D Quail MA Rabbinowitsch E 《Nature》2002,415(6874):871-880
We have sequenced and annotated the genome of fission yeast (Schizosaccharomyces pombe), which contains the smallest number of protein-coding genes yet recorded for a eukaryote: 4,824. The centromeres are between 35 and 110 kilobases (kb) and contain related repeats including a highly conserved 1.8-kb element. Regions upstream of genes are longer than in budding yeast (Saccharomyces cerevisiae), possibly reflecting more-extended control regions. Some 43% of the genes contain introns, of which there are 4,730. Fifty genes have significant similarity with human disease genes; half of these are cancer related. We identify highly conserved genes important for eukaryotic cell organization including those required for the cytoskeleton, compartmentation, cell-cycle control, proteolysis, protein phosphorylation and RNA splicing. These genes may have originated with the appearance of eukaryotic life. Few similarly conserved genes that are important for multicellular organization were identified, suggesting that the transition from prokaryotes to eukaryotes required more new genes than did the transition from unicellular to multicellular organization. 相似文献
6.
7.
YU Jun HU Songnian WANG Jun LI Songgang WONG Ka-Shu Gane LIU Bin DENG Yajun DAI Li YUAN Longping YANG Huanming 《科学通报(英文版)》2001,46(23):1937-1942
The sequence of the rice genome holds fundamental information for its biology, including physiology, genetics, development,
and evolution, as well as information on many beneficial phenotypes of economic significance. Using a “whole genome shotgun”
approach, we have produced a draft rice genome sequence ofOryza sativa ssp.indica, the major crop rice subspecies in China and many other regions of Asia. The draft genome sequence is constructed from over
4.3 million successful sequencing traces with an accumulative total length of 2214.9 Mb. The initial assembly of the non-redundant
sequences reached 409.76 Mb in length, based on 3.30 million successful sequencing traces with a total length of 1797.4 Mb
from anindica variant cultivar93-11, giving an estimated coverage of 95.29% of the rice genome with an average base accuracy of higher than 99%. The coverage
of the draft sequence, the randomness of the sequence distribution, and the consistency of BIG-ASSEMBLER, a custom-designed
software package used for the initial assembly, were verified rigorously by comparisons against finished BAC clone sequences
from bothindica andjapanica strains, available from the public databases. Over all, 96.3% of full-length cDNAs, 96.4% of STS, STR, RFLP markers, 94.0%
of ESTs and 94.9% unigene clusters were identified from the draft sequence. Our preliminary analysis on the data set shows
that our rice draft sequence is consistent with the comman standard accepted by the genome sequencing community. The unconditional
release of the draft to the public also undoubtedly provides a fundamental resource to the international scientific communities
to facilitate genomic and genetic studies on rice biology.
These authors contributed equally to this work. 相似文献
8.
Hillier LW Fulton RS Fulton LA Graves TA Pepin KH Wagner-McPherson C Layman D Maas J Jaeger S Walker R Wylie K Sekhon M Becker MC O'Laughlin MD Schaller ME Fewell GA Delehaunty KD Miner TL Nash WE Cordes M Du H Sun H Edwards J Bradshaw-Cordum H Ali J Andrews S Isak A Vanbrunt A Nguyen C Du F Lamar B Courtney L Kalicki J Ozersky P Bielicki L Scott K Holmes A Harkins R Harris A Strong CM Hou S Tomlinson C Dauphin-Kohlberg S Kozlowicz-Reilly A Leonard S Rohlfing T Rock SM Tin-Wollam AM Abbott A 《Nature》2003,424(6945):157-164
9.
The DNA sequence of human chromosome 22 总被引:75,自引:0,他引:75
Dunham I Shimizu N Roe BA Chissoe S Hunt AR Collins JE Bruskiewich R Beare DM Clamp M Smink LJ Ainscough R Almeida JP Babbage A Bagguley C Bailey J Barlow K Bates KN Beasley O Bird CP Blakey S Bridgeman AM Buck D Burgess J Burrill WD O'Brien KP 《Nature》1999,402(6761):489-495
Knowledge of the complete genomic DNA sequence of an organism allows a systematic approach to defining its genetic components. The genomic sequence provides access to the complete structures of all genes, including those without known function, their control elements, and, by inference, the proteins they encode, as well as all other biologically important sequences. Furthermore, the sequence is a rich and permanent source of information for the design of further biological studies of the organism and for the study of evolution through cross-species sequence comparison. The power of this approach has been amply demonstrated by the determination of the sequences of a number of microbial and model organisms. The next step is to obtain the complete sequence of the entire human genome. Here we report the sequence of the euchromatic part of human chromosome 22. The sequence obtained consists of 12 contiguous segments spanning 33.4 megabases, contains at least 545 genes and 134 pseudogenes, and provides the first view of the complex chromosomal landscapes that will be found in the rest of the genome. 相似文献
10.
Hattori M Fujiyama A Taylor TD Watanabe H Yada T Park HS Toyoda A Ishii K Totoki Y Choi DK Groner Y Soeda E Ohki M Takagi T Sakaki Y Taudien S Blechschmidt K Polley A Menzel U Delabar J Kumpf K Lehmann R Patterson D Reichwald K Rump A Schillhabel M Schudy A Zimmermann W Rosenthal A Kudoh J Schibuya K Kawasaki K Asakawa S Shintani A Sasaki T Nagamine K Mitsuyama S Antonarakis SE Minoshima S Shimizu N Nordsiek G Hornischer K Brant P Scharfe M Schon O Desario A Reichelt J Kauer G Blocker H 《Nature》2000,405(6784):311-319
Chromosome 21 is the smallest human autosome. An extra copy of chromosome 21 causes Down syndrome, the most frequent genetic cause of significant mental retardation, which affects up to 1 in 700 live births. Several anonymous loci for monogenic disorders and predispositions for common complex disorders have also been mapped to this chromosome, and loss of heterozygosity has been observed in regions associated with solid tumours. Here we report the sequence and gene catalogue of the long arm of chromosome 21. We have sequenced 33,546,361 base pairs (bp) of DNA with very high accuracy, the largest contig being 25,491,867 bp. Only three small clone gaps and seven sequencing gaps remain, comprising about 100 kilobases. Thus, we achieved 99.7% coverage of 21q. We also sequenced 281,116 bp from the short arm. The structural features identified include duplications that are probably involved in chromosomal abnormalities and repeat structures in the telomeric and pericentromeric regions. Analysis of the chromosome revealed 127 known genes, 98 predicted genes and 59 pseudogenes. 相似文献
11.
12.
Grimwood J Gordon LA Olsen A Terry A Schmutz J Lamerdin J Hellsten U Goodstein D Couronne O Tran-Gyamfi M Aerts A Altherr M Ashworth L Bajorek E Black S Branscomb E Caenepeel S Carrano A Caoile C Chan YM Christensen M Cleland CA Copeland A Dalin E Dehal P Denys M Detter JC Escobar J Flowers D Fotopulos D Garcia C Georgescu AM Glavina T Gomez M Gonzales E Groza M Hammon N Hawkins T Haydu L Ho I Huang W Israni S Jett J Kadner K Kimball H Kobayashi A Larionov V Leem SH Lopez F Lou Y Lowry S 《Nature》2004,428(6982):529-535
Chromosome 19 has the highest gene density of all human chromosomes, more than double the genome-wide average. The large clustered gene families, corresponding high G + C content, CpG islands and density of repetitive DNA indicate a chromosome rich in biological and evolutionary significance. Here we describe 55.8 million base pairs of highly accurate finished sequence representing 99.9% of the euchromatin portion of the chromosome. Manual curation of gene loci reveals 1,461 protein-coding genes and 321 pseudogenes. Among these are genes directly implicated in mendelian disorders, including familial hypercholesterolaemia and insulin-resistant diabetes. Nearly one-quarter of these genes belong to tandemly arranged families, encompassing more than 25% of the chromosome. Comparative analyses show a fascinating picture of conservation and divergence, revealing large blocks of gene orthology with rodents, scattered regions with more recent gene family expansions and deletions, and segments of coding and non-coding conservation with the distant fish species Takifugu. 相似文献
13.
Martin J Han C Gordon LA Terry A Prabhakar S She X Xie G Hellsten U Chan YM Altherr M Couronne O Aerts A Bajorek E Black S Blumer H Branscomb E Brown NC Bruno WJ Buckingham JM Callen DF Campbell CS Campbell ML Campbell EW Caoile C Challacombe JF Chasteen LA Chertkov O Chi HC Christensen M Clark LM Cohn JD Denys M Detter JC Dickson M Dimitrijevic-Bussod M Escobar J Fawcett JJ Flowers D Fotopulos D Glavina T Gomez M Gonzales E Goodstein D Goodwin LA Grady DL Grigoriev I Groza M Hammon N Hawkins T 《Nature》2004,432(7020):988-994
14.
Wang J Wang W Li R Li Y Tian G Goodman L Fan W Zhang J Li J Zhang J Guo Y Feng B Li H Lu Y Fang X Liang H Du Z Li D Zhao Y Hu Y Yang Z Zheng H Hellmann I Inouye M Pool J Yi X Zhao J Duan J Zhou Y Qin J Ma L Li G Yang Z Zhang G Yang B Yu C Liang F Li W Li S Li D Ni P Ruan J Li Q Zhu H Liu D Lu Z Li N Guo G Zhang J Ye J Fang L Hao Q Chen Q Liang Y Su Y San A Ping C Yang S Chen F Li L Zhou K Zheng H Ren Y Yang L Gao Y Yang G Li Z Feng X Kristiansen K Wong GK Nielsen R Durbin R Bolund L Zhang X 《Nature》2008,456(7218):60-65
Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics. 相似文献
15.
Dunham A Matthews LH Burton J Ashurst JL Howe KL Ashcroft KJ Beare DM Burford DC Hunt SE Griffiths-Jones S Jones MC Keenan SJ Oliver K Scott CE Ainscough R Almeida JP Ambrose KD Andrews DT Ashwell RI Babbage AK Bagguley CL Bailey J Bannerjee R Barlow KF Bates K Beasley H Bird CP Bray-Allen S Brown AJ Brown JY Burrill W Carder C Carter NP Chapman JC Clamp ME Clark SY Clarke G Clee CM Clegg SC Cobley V Collins JE Corby N Coville GJ Deloukas P Dhami P Dunham I Dunn M Earthrowl ME Ellington AG 《Nature》2004,428(6982):522-528
Chromosome 13 is the largest acrocentric human chromosome. It carries genes involved in cancer including the breast cancer type 2 (BRCA2) and retinoblastoma (RB1) genes, is frequently rearranged in B-cell chronic lymphocytic leukaemia, and contains the DAOA locus associated with bipolar disorder and schizophrenia. We describe completion and analysis of 95.5 megabases (Mb) of sequence from chromosome 13, which contains 633 genes and 296 pseudogenes. We estimate that more than 95.4% of the protein-coding genes of this chromosome have been identified, on the basis of comparison with other vertebrate genome sequences. Additionally, 105 putative non-coding RNA genes were found. Chromosome 13 has one of the lowest gene densities (6.5 genes per Mb) among human chromosomes, and contains a central region of 38 Mb where the gene density drops to only 3.1 genes per Mb. 相似文献
16.
Mungall AJ Palmer SA Sims SK Edwards CA Ashurst JL Wilming L Jones MC Horton R Hunt SE Scott CE Gilbert JG Clamp ME Bethel G Milne S Ainscough R Almeida JP Ambrose KD Andrews TD Ashwell RI Babbage AK Bagguley CL Bailey J Banerjee R Barker DJ Barlow KF Bates K Beare DM Beasley H Beasley O Bird CP Blakey S Bray-Allen S Brook J Brown AJ Brown JY Burford DC Burrill W Burton J Carder C Carter NP Chapman JC Clark SY Clark G Clee CM Clegg S Cobley V Collier RE Collins JE Colman LK Corby NR Coville GJ 《Nature》2003,425(6960):805-811
17.
Heilig R Eckenberg R Petit JL Fonknechten N Da Silva C Cattolico L Levy M Barbe V de Berardinis V Ureta-Vidal A Pelletier E Vico V Anthouard V Rowen L Madan A Qin S Sun H Du H Pepin K Artiguenave F Robert C Cruaud C Brüls T Jaillon O Friedlander L Samson G Brottier P Cure S Ségurens B Anière F Samain S Crespeau H Abbasi N Aiach N Boscus D Dickhoff R Dors M Dubois I Friedman C Gouyvenoux M James R Madan A Mairey-Estrada B Mangenot S Martins N Ménard M Oztas S Ratcliffe A Shaffer T Trask B 《Nature》2003,421(6923):601-607
Chromosome 14 is one of five acrocentric chromosomes in the human genome. These chromosomes are characterized by a heterochromatic short arm that contains essentially ribosomal RNA genes, and a euchromatic long arm in which most, if not all, of the protein-coding genes are located. The finished sequence of human chromosome 14 comprises 87,410,661 base pairs, representing 100% of its euchromatic portion, in a single continuous segment covering the entire long arm with no gaps. Two loci of crucial importance for the immune system, as well as more than 60 disease genes, have been localized so far on chromosome 14. We identified 1,050 genes and gene fragments, and 393 pseudogenes. On the basis of comparisons with other vertebrate genomes, we estimate that more than 96% of the chromosome 14 genes have been annotated. From an analysis of the CpG island occurrences, we estimate that 70% of these annotated genes are complete at their 5' end. 相似文献
18.
We have applied the newly developed hierarchical structure theory for complex systems to analyze the multi-scaling structures
of the nucleotide density distribution along a linear DNA sequence from the completeEscherichia coli genome. The hierarchical symmetry in the nucleotide density distribution was demonstrated. In particular, we have shown that
the G, C density distribution that represents a strong H-bonding between the two DNA chains is more coherent with smaller
similarity parameter compared to that of A, T density distribution, indicating a better organized multi-scaling fluctuation
field for G, C density distribution along the genome sequence. The biological significance of these findings is under investigation. 相似文献
19.
Scherer SE Muzny DM Buhay CJ Chen R Cree A Ding Y Dugan-Rocha S Gill R Gunaratne P Harris RA Hawes AC Hernandez J Hodgson AV Hume J Jackson A Khan ZM Kovar-Smith C Lewis LR Lozado RJ Metzker ML Milosavljevic A Miner GR Montgomery KT Morgan MB Nazareth LV Scott G Sodergren E Song XZ Steffen D Lovering RC Wheeler DA Worley KC Yuan Y Zhang Z Adams CQ Ansari-Lari MA Ayele M Brown MJ Chen G Chen Z Clerc-Blankenburg KP Davis C Delgado O Dinh HH Draper H Gonzalez-Garay ML Havlak P Jackson LR Jacob LS 《Nature》2006,440(7082):346-351
Human chromosome 12 contains more than 1,400 coding genes and 487 loci that have been directly implicated in human disease. The q arm of chromosome 12 contains one of the largest blocks of linkage disequilibrium found in the human genome. Here we present the finished sequence of human chromosome 12, which has been finished to high quality and spans approximately 132 megabases, representing approximately 4.5% of the human genome. Alignment of the human chromosome 12 sequence across vertebrates reveals the origin of individual segments in chicken, and a unique history of rearrangement through rodent and primate lineages. The rate of base substitutions in recent evolutionary history shows an overall slowing in hominids compared with primates and rodents. 相似文献
20.
Ross MT Grafham DV Coffey AJ Scherer S McLay K Muzny D Platzer M Howell GR Burrows C Bird CP Frankish A Lovell FL Howe KL Ashurst JL Fulton RS Sudbrak R Wen G Jones MC Hurles ME Andrews TD Scott CE Searle S Ramser J Whittaker A Deadman R Carter NP Hunt SE Chen R Cree A Gunaratne P Havlak P Hodgson A Metzker ML Richards S Scott G Steffen D Sodergren E Wheeler DA Worley KC Ainscough R Ambrose KD Ansari-Lari MA Aradhya S Ashwell RI Babbage AK Bagguley CL Ballabio A Banerjee R Barker GE Barlow KF 《Nature》2005,434(7031):325-337
The human X chromosome has a unique biology that was shaped by its evolution as the sex chromosome shared by males and females. We have determined 99.3% of the euchromatic sequence of the X chromosome. Our analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome. LINE1 repeat elements cover one-third of the X chromosome, with a distribution that is consistent with their proposed role as way stations in the process of X-chromosome inactivation. We found 1,098 genes in the sequence, of which 99 encode proteins expressed in testis and in various tumour types. A disproportionately high number of mendelian diseases are documented for the X chromosome. Of this number, 168 have been explained by mutations in 113 X-linked genes, which in many cases were characterized with the aid of the DNA sequence. 相似文献