首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 281 毫秒
1.
Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana   总被引:21,自引:0,他引:21  
Arabidopsis thaliana (Arabidopsis) is unique among plant model organisms in having a small genome (130-140 Mb), excellent physical and genetic maps, and little repetitive DNA. Here we report the sequence of chromosome 2 from the Columbia ecotype in two gap-free assemblies (contigs) of 3.6 and 16 megabases (Mb). The latter represents the longest published stretch of uninterrupted DNA sequence assembled from any organism to date. Chromosome 2 represents 15% of the genome and encodes 4,037 genes, 49% of which have no predicted function. Roughly 250 tandem gene duplications were found in addition to large-scale duplications of about 0.5 and 4.5 Mb between chromosomes 2 and 1 and between chromosomes 2 and 4, respectively. Sequencing of nearly 2 Mb within the genetically defined centromere revealed a low density of recognizable genes, and a high density and diverse range of vestigial and presumably inactive mobile elements. More unexpected is what appears to be a recent insertion of a continuous stretch of 75% of the mitochondrial genome into chromosome 2.  相似文献   

2.
The genome of the flowering plant Arabidopsis thaliana has five chromosomes. Here we report the sequence of the largest, chromosome 1, in two contigs of around 14.2 and 14.6 megabases. The contigs extend from the telomeres to the centromeric borders, regions rich in transposons, retrotransposons and repetitive elements such as the 180-base-pair repeat. The chromosome represents 25% of the genome and contains about 6,850 open reading frames, 236 transfer RNAs (tRNAs) and 12 small nuclear RNAs. There are two clusters of tRNA genes at different places on the chromosome. One consists of 27 tRNA(Pro) genes and the other contains 27 tandem repeats of tRNA(Tyr)-tRNA(Tyr)-tRNA(Ser) genes. Chromosome 1 contains about 300 gene families with clustered duplications. There are also many repeat elements, representing 8% of the sequence.  相似文献   

3.
The genome of the model plant Arabidopsis thaliana has been sequenced by an international collaboration, The Arabidopsis Genome Initiative. Here we report the complete sequence of chromosome 5. This chromosome is 26 megabases long; it is the second largest Arabidopsis chromosome and represents 21% of the sequenced regions of the genome. The sequence of chromosomes 2 and 4 have been reported previously and that of chromosomes 1 and 3, together with an analysis of the complete genome sequence, are reported in this issue. Analysis of the sequence of chromosome 5 yields further insights into centromere structure and the sequence determinants of heterochromatin condensation. The 5,874 genes encoded on chromosome 5 reveal several new functions in plants, and the patterns of gene organization provide insights into the mechanisms and extent of genome evolution in plants.  相似文献   

4.
The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.  相似文献   

5.
Sequence and analysis of rice chromosome 4   总被引:1,自引:0,他引:1  
Feng Q  Zhang Y  Hao P  Wang S  Fu G  Huang Y  Li Y  Zhu J  Liu Y  Hu X  Jia P  Zhang Y  Zhao Q  Ying K  Yu S  Tang Y  Weng Q  Zhang L  Lu Y  Mu J  Lu Y  Zhang LS  Yu Z  Fan D  Liu X  Lu T  Li C  Wu Y  Sun T  Lei H  Li T  Hu H  Guan J  Wu M  Zhang R  Zhou B  Chen Z  Chen L  Jin Z  Wang R  Yin H  Cai Z  Ren S  Lv G  Gu W  Zhu G  Tu Y  Jia J  Zhang Y  Chen J  Kang H  Chen X  Shao C  Sun Y  Hu Q  Zhang X  Zhang W  Wang L  Ding C  Sheng H  Gu J  Chen S  Ni L  Zhu F  Chen W  Lan L  Lai Y  Cheng Z  Gu M  Jiang J  Li J  Hong G  Xue Y  Han B 《Nature》2002,420(6913):316-320
Rice is the principal food for over half of the population of the world. With its genome size of 430 megabase pairs (Mb), the cultivated rice species Oryza sativa is a model plant for genome research. Here we report the sequence analysis of chromosome 4 of O. sativa, one of the first two rice chromosomes to be sequenced completely. The finished sequence spans 34.6 Mb and represents 97.3% of the chromosome. In addition, we report the longest known sequence for a plant centromere, a completely sequenced contig of 1.16 Mb corresponding to the centromeric region of chromosome 4. We predict 4,658 protein coding genes and 70 transfer RNA genes. A total of 1,681 predicted genes match available unique rice expressed sequence tags. Transposable elements have a pronounced bias towards the euchromatic regions, indicating a close correlation of their distributions to genes along the chromosome. Comparative genome analysis between cultivated rice subspecies shows that there is an overall syntenic relationship between the chromosomes and divergence at the level of single-nucleotide polymorphisms and insertions and deletions. By contrast, there is little conservation in gene order between rice and Arabidopsis.  相似文献   

6.
After the completion of a draft human genome sequence, the International Human Genome Sequencing Consortium has proceeded to finish and annotate each of the 24 chromosomes comprising the human genome. Here we describe the sequencing and analysis of human chromosome 3, one of the largest human chromosomes. Chromosome 3 comprises just four contigs, one of which currently represents the longest unbroken stretch of finished DNA sequence known so far. The chromosome is remarkable in having the lowest rate of segmental duplication in the genome. It also includes a chemokine receptor gene cluster as well as numerous loci involved in multiple human cancers such as the gene encoding FHIT, which contains the most common constitutive fragile site in the genome, FRA3B. Using genomic sequence from chimpanzee and rhesus macaque, we were able to characterize the breakpoints defining a large pericentric inversion that occurred some time after the split of Homininae from Ponginae, and propose an evolutionary history of the inversion.  相似文献   

7.
The map-based sequence of the rice genome   总被引:14,自引:0,他引:14  
Rice, one of the world's most important food plants, has important syntenic relationships with the other cereal species and is a model plant for the grasses. Here we present a map-based, finished quality sequence that covers 95% of the 389 Mb genome, including virtually all of the euchromatin and two complete centromeres. A total of 37,544 non-transposable-element-related protein-coding genes were identified, of which 71% had a putative homologue in Arabidopsis. In a reciprocal analysis, 90% of the Arabidopsis proteins had a putative homologue in the predicted rice proteome. Twenty-nine per cent of the 37,544 predicted genes appear in clustered gene families. The number and classes of transposable elements found in the rice genome are consistent with the expansion of syntenic regions in the maize and sorghum genomes. We find evidence for widespread and recurrent gene transfer from the organelles to the nuclear chromosomes. The map-based sequence has proven useful for the identification of genes underlying agronomic traits. The additional single-nucleotide polymorphisms and simple sequence repeats identified in our study should accelerate improvements in rice production.  相似文献   

8.
A physical map of the mouse genome   总被引:1,自引:0,他引:1  
A physical map of a genome is an essential guide for navigation, allowing the location of any gene or other landmark in the chromosomal DNA. We have constructed a physical map of the mouse genome that contains 296 contigs of overlapping bacterial clones and 16,992 unique markers. The mouse contigs were aligned to the human genome sequence on the basis of 51,486 homology matches, thus enabling use of the conserved synteny (correspondence between chromosome blocks) of the two genomes to accelerate construction of the mouse map. The map provides a framework for assembly of whole-genome shotgun sequence data, and a tile path of clones for generation of the reference sequence. Definition of the human-mouse alignment at this level of resolution enables identification of a mouse clone that corresponds to almost any position in the human genome. The human sequence may be used to facilitate construction of other mammalian genome maps using the same strategy.  相似文献   

9.
In eukaryotes, the ubiquitin-mediated protein degradation pathway has been shown to control several key biological processes such as cell division, development, metabolism and immune response. F-box proteins, as a part of SCF (Skp1-Cullin (or Cdc53)-F-box) complex, functioned by interacting with substrate proteins, leading to their subsequent degradation by the 26S proteasome. To date, several F-box proteins identified in Arabidopsis and Antirrhinum have been shown to play important roles in auxin signal transduction, floral organ formation, flowering and leaf senescence. Arabidopsis genome sequence analysis revealed that it encodes over 1000 predicted F-box proteins accounting for about 5% of total predicted proteins. These results indicate that the ubiquitin-mediated protein degradation involving the F-box proteins is an important mechanism controlling plant gene expression. Here, we review the known F-box proteins and their functionsin flowering plants.  相似文献   

10.
Species of malaria parasite that infect rodents have long been used as models for malaria disease research. Here we report the whole-genome shotgun sequence of one species, Plasmodium yoelii yoelii, and comparative studies with the genome of the human malaria parasite Plasmodium falciparum clone 3D7. A synteny map of 2,212 P. y. yoelii contiguous DNA sequences (contigs) aligned to 14 P. falciparum chromosomes reveals marked conservation of gene synteny within the body of each chromosome. Of about 5,300 P. falciparum genes, more than 3,300 P. y. yoelii orthologues of predominantly metabolic function were identified. Over 800 copies of a variant antigen gene located in subtelomeric regions were found. This is the first genome sequence of a model eukaryotic parasite, and it provides insight into the use of such systems in the modelling of Plasmodium biology and disease.  相似文献   

11.
The genome sequence and structure of rice chromosome 1   总被引:2,自引:0,他引:2  
The rice species Oryza sativa is considered to be a model plant because of its small genome size, extensive genetic map, relative ease of transformation and synteny with other cereal crops. Here we report the essentially complete sequence of chromosome 1, the longest chromosome in the rice genome. We summarize characteristics of the chromosome structure and the biological insight gained from the sequence. The analysis of 43.3 megabases (Mb) of non-overlapping sequence reveals 6,756 protein coding genes, of which 3,161 show homology to proteins of Arabidopsis thaliana, another model plant. About 30% (2,073) of the genes have been functionally categorized. Rice chromosome 1 is (G + C)-rich, especially in its coding regions, and is characterized by several gene families that are dispersed or arranged in tandem repeats. Comparison with a draft sequence indicates the importance of a high-quality finished sequence.  相似文献   

12.
Hyman RW  Fung E  Conway A  Kurdi O  Mao J  Miranda M  Nakao B  Rowley D  Tamaki T  Wang F  Davis RW 《Nature》2002,419(6906):534-537
The human malaria parasite Plasmodium falciparum is responsible for the death of more than a million people every year. To stimulate basic research on the disease, and to promote the development of effective drugs and vaccines against the parasite, the complete genome of P. falciparum clone 3D7 has been sequenced, using a chromosome-by-chromosome shotgun strategy. Here we report the nucleotide sequence of the third largest of the parasite's 14 chromosomes, chromosome 12, which comprises about 10% of the 23-megabase genome. As the most (A + T)-rich (80.6%) genome sequenced to date, the P. falciparum genome presented severe problems during the assembly of primary sequence reads. We discuss the methodology that yielded a finished and fully contiguous sequence for chromosome 12. The biological implications of the sequence data are more thoroughly discussed in an accompanying Article (ref. 3).  相似文献   

13.
钙调磷酸酶B类似蛋白互作蛋白激酶(CIPKs)在植物生长发育和抗逆过程中发挥着重要作用。为了对桃中CIPK家族基因进行系统分析,利用桃基因组数据库,通过生物信息学手段,鉴定桃CIPK家族基因的基因结构、染色体定位和编码蛋白,通过序列比对进行进化和分类分析。结果表明,桃基因组中含有18个CIPK基因,分布于桃的6条染色体上。MEME和Pfam保守结构域分析显示,桃CIPK蛋白均含有2个保守的PKinase和NAF结构域。进化树分析表明CIPKs可分为2个亚家族。Net Phos 2.0 Server结果显示Pp CIPKs存在着大量的丝氨酸(Ser)、苏氨酸(Thr)及酪氨酸(Tyr)潜在磷酸化位点。以上结果将为今后揭示桃CIPK蛋白的功能提供重要的理论基础。  相似文献   

14.
We constructed maps for eight chromosomes (1, 6, 9, 10, 13, 20, X and (previously) 22), representing one-third of the genome, by building landmark maps, isolating bacterial clones and assembling contigs. By this approach, we could establish the long-range organization of the maps early in the project, and all contig extension, gap closure and problem-solving was simplified by containment within local regions. The maps currently represent more than 94% of the euchromatic (gene-containing) regions of these chromosomes in 176 contigs, and contain 96% of the chromosome-specific markers in the human gene map. By measuring the remaining gaps, we can assess chromosome length and coverage in sequenced clones.  相似文献   

15.
Sequence and analysis of chromosome 2 of Dictyostelium discoideum   总被引:1,自引:0,他引:1  
The genome of the lower eukaryote Dictyostelium discoideum comprises six chromosomes. Here we report the sequence of the largest, chromosome 2, which at 8 megabases (Mb) represents about 25% of the genome. Despite an A + T content of nearly 80%, the chromosome codes for 2,799 predicted protein coding genes and 73 transfer RNA genes. This gene density, about 1 gene per 2.6 kilobases (kb), is surpassed only by Saccharomyces cerevisiae (one per 2 kb) and is similar to that of Schizosaccharomyces pombe (one per 2.5 kb). If we assume that the other chromosomes have a similar gene density, we can expect around 11,000 genes in the D. discoideum genome. A significant number of the genes show higher similarities to genes of vertebrates than to those of other fully sequenced eukaryotes. This analysis strengthens the view that the evolutionary position of D. discoideum is located before the branching of metazoa and fungi but after the divergence of the plant kingdom, placing it close to the base of metazoan evolution.  相似文献   

16.
A primary physical map of rice chromosome 12 was constructed using marker-based chromosome landing and chromosome walking. A BAC library from IR64 was screened using 84 RFLP markers, 4 STS markers and 6 microsatellite markers on chromosome 12 by colony hybridization and polymerase chain reaction (PCR) amplification. A total of 59 contigs consisting of 419 BAC clones including 5 single-clones were physically aligned on rice chromosome 12 with the largest BAC contig covering 855 kb. The whole physical map had a size of ∼16 Mb and covered about 52% of rice chromosome 12. This physical map will be certainly helpful for map-based gene cloning of agronomically and biological important genes and understanding the genome structure of the chromosome. Foundation item: Supported by Rockefeller Foundation Biography: FU Bin-Ying (1965-), male, Ph. D. candidate, Reseach direction: plant molecular genetics.  相似文献   

17.
Since the sequencing of the first two chromosomes of the malaria parasite, Plasmodium falciparum, there has been a concerted effort to sequence and assemble the entire genome of this organism. Here we report the sequence of chromosomes 1, 3-9 and 13 of P. falciparum clone 3D7--these chromosomes account for approximately 55% of the total genome. We describe the methods used to map, sequence and annotate these chromosomes. By comparing our assemblies with the optical map, we indicate the completeness of the resulting sequence. During annotation, we assign Gene Ontology terms to the predicted gene products, and observe clustering of some malaria-specific terms to specific chromosomes. We identify a highly conserved sequence element found in the intergenic region of internal var genes that is not associated with their telomeric counterparts.  相似文献   

18.
Genome linking with yeast artificial chromosomes   总被引:76,自引:0,他引:76  
A Coulson  R Waterston  J Kiff  J Sulston  Y Kohara 《Nature》1988,335(6186):184-186
The haploid genome of Caenorhabditis elegans consists of some 80 x 10(6) base pairs of DNA contained in six chromosomes. The large number of interesting loci that have been recognized by mutation, and the accuracy of the genetic map, mean that a physical map of the genome is highly desirable, because it will facilitate the molecular cloning of chosen loci. The first steps towards such a map used a fingerprinting method to link cosmid clones together. This approach reached its practical limit last year, when 90-95% of the genome had been cloned into 17,500 cosmids assembled into some 700 clusters (contigs), but the linking clones needed were either non-existent or extremely rare. Anticipating this, we had planned to link by physical means--probably by hybridization to NotI fragments separated by pulse field gel electrophoresis. NotI recognizes an eight base sequence of GC pairs; thus the fragments should be large enough to bridge regions that clone poorly in cosmids, and, with no selective step involved, would necessarily be fully representative. However, with the availability of a yeast artificial chromosome (YAC) vector, we decided to use this alternative source of large DNA fragments to obtain linkage. The technique involves the ligation of large (50-1,000 kilobase) genomic fragments into a vector that provides centromeric, telomeric and selective functions; the constructs are then introduced into Saccharomyces cerevisiae, and replicate in the same manner as the host chromosomes.  相似文献   

19.
Small peptides function as key signals in processes, such as plant cell differentiation, organ development and defenses to biotic stresses. A large number of small peptide precursor genes have been predicted from the analysis of the soybean (Glycine max) whole genome DNA sequence. However, most of these genes have unknown characteristics and functions. In this report, we systemically searched for the gene families of small peptide precursors that are up-regulated in soybean nitrogen-fixing root nodules. We found 212 genes (encoding peptides shorter than 150 amino acids) that were up-regulated, and among them, 79 genes belong to 38 multiple-gene families, but the other 133 genes are unique. Twenty-eight of 38 families are conserved in Arabidopsis, but the other 10 only exist in legumes. We also identified 16 out of the 38 members of the wound-induced polypeptide (WIP) gene family to be upregulated in nitrogen-fixing nodules. We further analyzed homologs of WIP genes in Medicago, Lotus, Arabidopsis and Oryza species and found that a few homologous genes from Medicago truncatula and Lotus japonicus were also upregulated in their nodules and some WIP genes were induced by specific fungal pathogens on soybean and rice. Structure prediction indicated that all WIP prepropeptides contain a conserved DUF3774 domain (including two hydrophobic regions) and most of them have an N-terminal signal sequence. Fluorescence microscopy analysis of two WIP prepropeptides fused to GFP revealed that these proteins are located on the plasma membrane of tobacco leaf cells. Interestingly, 34 soybean WIP genes are clustered onto three soybean chromosomes, different from known peptide gene families (such as CLE). Among them, 11 highly identical genes are aligned on the 6th chromosome, 12 on the 12th, and 11 on the 13th chromosomes. Most of WIP genes from the 12th chromosome share the highest identities with their homologs on the 13th chromosome, suggesting that ancestral WIP genes could have originated from the 13th chromosome, then spread onto the 12th chromosome by chromosome homologous recombination; the new WIP genes could have existed in multiple copies by gene duplication which then spread onto the 6th chromosome. In Arabidopsis and Oryza species, half of the WIP genes are also aligned on one chromosome and showed higher identity with those from the soybean 12th and 13th chromosomes, suggesting that WIP genes originated from one common ancestor.  相似文献   

20.
The large genome size (~17000 Mb) and complicated DNA structures of common wheat (Triticum aestivum) hamper its genome sequencing.By means of flow cytometry,systematic investigations on individual chromosome sorting have been carried out to construct chromosome-specific bacterial artificial chromosome (BAC) libraries since the 1980s.Several wheat chromosome-specific BAC libraries,such as chromosome 3B,three D genome chromosomes (1D,4D and 6D),and the short arm of chromosome 1B,have been developed,and the ph...  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号