首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 591 毫秒
1.
Arabidopsis thaliana is an important model system for plant biologists. In 1996 an international collaboration (the Arabidopsis Genome Initiative) was formed to sequence the whole genome of Arabidopsis and in 1999 the sequence of the first two chromosomes was reported. The sequence of the last three chromosomes and an analysis of the whole genome are reported in this issue. Here we present the sequence of chromosome 3, organized into four sequence segments (contigs). The two largest (13.5 and 9.2 Mb) correspond to the top (long) and the bottom (short) arms of chromosome 3, and the two small contigs are located in the genetically defined centromere. This chromosome encodes 5,220 of the roughly 25,500 predicted protein-coding genes in the genome. About 20% of the predicted proteins have significant homology to proteins in eukaryotic genomes for which the complete sequence is available, pointing to important conserved cellular functions among eukaryotes.  相似文献   

2.
Generation and annotation of the DNA sequences of human chromosomes 2 and 4   总被引:1,自引:0,他引:1  
Human chromosome 2 is unique to the human lineage in being the product of a head-to-head fusion of two intermediate-sized ancestral chromosomes. Chromosome 4 has received attention primarily related to the search for the Huntington's disease gene, but also for genes associated with Wolf-Hirschhorn syndrome, polycystic kidney disease and a form of muscular dystrophy. Here we present approximately 237 million base pairs of sequence for chromosome 2, and 186 million base pairs for chromosome 4, representing more than 99.6% of their euchromatic sequences. Our initial analyses have identified 1,346 protein-coding genes and 1,239 pseudogenes on chromosome 2, and 796 protein-coding genes and 778 pseudogenes on chromosome 4. Extensive analyses confirm the underlying construction of the sequence, and expand our understanding of the structure and evolution of mammalian chromosomes, including gene deserts, segmental duplications and highly variant regions.  相似文献   

3.
The use of comparative genomics to infer genome function relies on the understanding of how different components of the genome change over evolutionary time. The aim of such comparative analysis is to identify conserved, functionally transcribed sequences such as protein-coding genes and non-coding RNA genes, and other functional sequences such as regulatory regions, as well as other genomic features. Here, we have compared the entire human chromosome 21 with syntenic regions of the mouse genome, and have identified a large number of conserved blocks of unknown function. Although previous studies have made similar observations, it is unknown whether these conserved sequences are genes or not. Here we present an extensive experimental and computational analysis of human chromosome 21 in an effort to assign function to sequences conserved between human chromosome 21 (ref. 8) and the syntenic mouse regions. Our data support the presence of a large number of potentially functional non-genic sequences, probably regulatory and structural. The integration of the properties of the conserved components of human chromosome 21 to the rapidly accumulating functional data for this chromosome will improve considerably our understanding of the role of sequence conservation in mammalian genomes.  相似文献   

4.
The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.  相似文献   

5.
Chromosome 14 is one of five acrocentric chromosomes in the human genome. These chromosomes are characterized by a heterochromatic short arm that contains essentially ribosomal RNA genes, and a euchromatic long arm in which most, if not all, of the protein-coding genes are located. The finished sequence of human chromosome 14 comprises 87,410,661 base pairs, representing 100% of its euchromatic portion, in a single continuous segment covering the entire long arm with no gaps. Two loci of crucial importance for the immune system, as well as more than 60 disease genes, have been localized so far on chromosome 14. We identified 1,050 genes and gene fragments, and 393 pseudogenes. On the basis of comparisons with other vertebrate genomes, we estimate that more than 96% of the chromosome 14 genes have been annotated. From an analysis of the CpG island occurrences, we estimate that 70% of these annotated genes are complete at their 5' end.  相似文献   

6.
Chromosome 18 appears to have the lowest gene density of any human chromosome and is one of only three chromosomes for which trisomic individuals survive to term. There are also a number of genetic disorders stemming from chromosome 18 trisomy and aneuploidy. Here we report the finished sequence and gene annotation of human chromosome 18, which will allow a better understanding of the normal and disease biology of this chromosome. Despite the low density of protein-coding genes on chromosome 18, we find that the proportion of non-protein-coding sequences evolutionarily conserved among mammals is close to the genome-wide average. Extending this analysis to the entire human genome, we find that the density of conserved non-protein-coding sequences is largely uncorrelated with gene density. This has important implications for the nature and roles of non-protein-coding sequence elements.  相似文献   

7.
Complete genomic sequence is known for two multicellular eukaryotes, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster, and it will soon be known for humans. However, biological function has been assigned to only a small proportion of the predicted genes in any animal. Here we have used RNA-mediated interference (RNAi) to target nearly 90% of predicted genes on C. elegans chromosome I by feeding worms with bacteria that express double-stranded RNA. We have assigned function to 13.9% of the genes analysed, increasing the number of sequenced genes with known phenotypes on chromosome I from 70 to 378. Although most genes with sterile or embryonic lethal RNAi phenotypes are involved in basal cell metabolism, many genes giving post-embryonic phenotypes have conserved sequences but unknown function. In addition, conserved genes are significantly more likely to have an RNAi phenotype than are genes with no conservation. We have constructed a reusable library of bacterial clones that will permit unlimited RNAi screens in the future; this should help develop a more complete view of the relationships between the genome, gene function and the environment.  相似文献   

8.
D H Hall  Y Liu  D A Shub 《Nature》1989,340(6234):575-576
The organization of genes into exons separated by introns may permit rapid evolution of protein-coding sequences by exon shuffling. Introns could provide non-coding targets for recombination, which would then give rise to novel combinations of exons. Evidence to support this theory is indirect and consists of examples of homologous domains of protein structure encoded in different genes, with introns in conserved positions at the boundaries of these domains. Here, we report the first direct evidence for exon shuffling. Two spontaneous deletion mutations of phage T4 have been characterized by sequencing, and they are clearly the result of recombination between homologous regions of two self-splicing group I introns. As a result of the recombination, exons of different genes are transcribed together, with a hybrid intron between them. One of these introns is proficient in self-splicing.  相似文献   

9.
V Lindgren  M Ares  A M Weiner  U Francke 《Nature》1985,314(6006):115-116
U2 RNA is one of the abundant, highly conserved species of small nuclear RNA (snRNA) molecules implicated in RNA processing. As is typical of mammalian snRNAs, human U1 and U2 are each encoded by a multigene family. In the human genome, defective copies of the genes (pseudogenes) far outnumber the authentic genes. The majority or all of the 35 to 100 bona fide U1 genes have at least 20 kilobases (kb) of nearly perfect 5' and 3' flanking homology in common with each other; these U1 genes are clustered loosely in chromosome band 1p36 (refs 5, 7) with intergenic distances exceeding 44 kb. In contrast, the 10 to 20 U2 genes are clustered tightly in a virtually perfect tandem array which has a strict 6-kb repeating unit. We report here the assignment, by in situ hybridization, of the U2 gene cluster to chromosome 17, bands q21-q22. Surprisingly, this region is one of three major adenovirus 12 modification sites which undergo chromosome decondensation ('uncoiling') in permissive human cells infected by highly oncogenic strains of adenovirus. The two other major modification sites, 1p36 and 1q21, coincide with the locations of U1 genes and class I U1 pseudogenes, respectively. We suggest that snRNA genes are the major targets of viral chromosome modification.  相似文献   

10.
Maintenance of functional equivalence during paralogous Hox gene evolution   总被引:15,自引:0,他引:15  
Greer JM  Puetz J  Thomas KR  Capecchi MR 《Nature》2000,403(6770):661-665
Biological diversity is driven mainly by gene duplication followed by mutation and selection. This divergence in either regulatory or protein-coding sequences can result in quite different biological functions for even closely related genes. This concept is exemplified by the mammalian Hox gene complex, a group of 39 genes which are located on 4 linkage groups, dispersed on 4 chromosomes. The evolution of this complex began with amplification in cis of a primordial Hox gene to produce 13 members, followed by duplications in trans of much of the entire unit. As a consequence, Hox genes that occupy the same relative position along the 5' to 3' chromosomal coordinate (trans-paralogous genes) share more similarity in sequence and expression pattern than do adjacent Hox genes on the same chromosome. Studies in mice indicate that although individual family members may have unique biological roles, they also share overlapping functions with their paralogues. Here we show that the proteins encoded by the paralogous genes, Hoxa3 and Hoxd3, can carry out identical biological functions, and that the different roles attributed to these genes are the result of quantitative modulations in gene expression.  相似文献   

11.
Chromosome 13 is the largest acrocentric human chromosome. It carries genes involved in cancer including the breast cancer type 2 (BRCA2) and retinoblastoma (RB1) genes, is frequently rearranged in B-cell chronic lymphocytic leukaemia, and contains the DAOA locus associated with bipolar disorder and schizophrenia. We describe completion and analysis of 95.5 megabases (Mb) of sequence from chromosome 13, which contains 633 genes and 296 pseudogenes. We estimate that more than 95.4% of the protein-coding genes of this chromosome have been identified, on the basis of comparison with other vertebrate genome sequences. Additionally, 105 putative non-coding RNA genes were found. Chromosome 13 has one of the lowest gene densities (6.5 genes per Mb) among human chromosomes, and contains a central region of 38 Mb where the gene density drops to only 3.1 genes per Mb.  相似文献   

12.
人类基因组表达序列筛选是寻找候选基因的重要路线之一,外显子陷阱法,cDNA直接筛筛选法,它们可分别根据表达序列的结构及表达特点进行筛选,EST是表达图的位标,它们是一些位点专一的表达序列位标,根据EST的特征,在国内首次建立了一种从EST出发的筛选候选基因的新方法,用睦方法已在人X染色体Xq13区段筛选得到了一个新的cDNA,总测序徇的1398bp包含了完整的3末端。  相似文献   

13.
We report a high-quality draft of the genome sequence of the grey, short-tailed opossum (Monodelphis domestica). As the first metatherian ('marsupial') species to be sequenced, the opossum provides a unique perspective on the organization and evolution of mammalian genomes. Distinctive features of the opossum chromosomes provide support for recent theories about genome evolution and function, including a strong influence of biased gene conversion on nucleotide sequence composition, and a relationship between chromosomal characteristics and X chromosome inactivation. Comparison of opossum and eutherian genomes also reveals a sharp difference in evolutionary innovation between protein-coding and non-coding functional elements. True innovation in protein-coding genes seems to be relatively rare, with lineage-specific differences being largely due to diversification and rapid turnover in gene families involved in environmental interactions. In contrast, about 20% of eutherian conserved non-coding elements (CNEs) are recent inventions that postdate the divergence of Eutheria and Metatheria. A substantial proportion of these eutherian-specific CNEs arose from sequence inserted by transposable elements, pointing to transposons as a major creative force in the evolution of mammalian gene regulation.  相似文献   

14.
S Augustin  M W Müller  R J Schweyen 《Nature》1990,343(6256):383-386
Group II introns, which are classed together on the basis of a conserved secondary structure, are found in organellar genes of lower eukaryotes and plants. Like introns in nuclear pre-messenger RNA, they are excised by a two-step splicing reaction to generate branched circular RNAs, the so-called lariats. A remarkable feature of group II introns is their self-splicing activity in vitro. In the absence of a nucleotide cofactor, the intron RNAs catalyse two successive transesterification reactions which lead to autocatalytic excision of the lariat IVS from pre-mRNA and concomitantly to exon ligation. By virtue of its ability to specifically bind the 5' exon, the intron can also catalyse such reactions on exogenous RNA substrates. This sequence-specific attachment could enable group II introns to integrate into unrelated RNAs by reverse splicing, in a process similar to that described for the self-splicing Tetrahymena group I intron. Here we report that group II lariat IVS can indeed reintegrate itself into an RNA composed of the ligated exons in vitro. This occurs by a process of self-splicing that completely reverses both transesterification steps of the forward reaction: it involves a transition of the 2'-5' phosphodiester bond of the lariat RNA into the 3'-5' bond of the reconstituted 5' splice junction.  相似文献   

15.
16.
17.
18.
Hundreds of highly conserved distal cis-regulatory elements have been characterized so far in vertebrate genomes. Many thousands more are predicted on the basis of comparative genomics. However, in stark contrast to the genes that they regulate, in invertebrates virtually none of these regions can be traced by using sequence similarity, leaving their evolutionary origins obscure. Here we show that a class of conserved, primarily non-coding regions in tetrapods originated from a previously unknown short interspersed repetitive element (SINE) retroposon family that was active in the Sarcopterygii (lobe-finned fishes and terrestrial vertebrates) in the Silurian period at least 410 million years ago (ref. 4), and seems to be recently active in the 'living fossil' Indonesian coelacanth, Latimeria menadoensis. Using a mouse enhancer assay we show that one copy, 0.5 million bases from the neuro-developmental gene ISL1, is an enhancer that recapitulates multiple aspects of Isl1 expression patterns. Several other copies represent new, possibly regulatory, alternatively spliced exons in the middle of pre-existing Sarcopterygian genes. One of these, a more than 200-base-pair ultraconserved region, 100% identical in mammals, and 80% identical to the coelacanth SINE, contains a 31-amino-acid-residue alternatively spliced exon of the messenger RNA processing gene PCBP2 (ref. 6). These add to a growing list of examples in which relics of transposable elements have acquired a function that serves their host, a process termed 'exaptation', and provide an origin for at least some of the many highly conserved vertebrate-specific genomic sequences.  相似文献   

19.
The chromosomes 1, 3, 5, 6, 7, 10 and 12 of rice field eel (Monopterus albus Zuiew) have been microdissected successfully from meiosis Ⅰ diakinesis spreads by using glass microneedle under an inverted microscope. And the DOP-PCR products of the single chromosome dotted on the nylon membrane as “specific chromosomal DNA pool”, have been hybridized with 6 probes to map these genes. The mapping results show that Zfa has been mapped to chromosome 1, rDNA to chromosomes 3 and 7, both Gh and Pdegg to chromosome 10, Hsl to chromosome 5 and Hox genes have been detected on chromosomes 1, 3, 6 and 10 meantime. It has initiatively been suggested that chromosome 10 of rice field eel might possess the commom conserved synteny to that on chromosome 17 of human, chromosome 11 of mouse, chromosome 12 of pig and chromosome 19 of bovine. And so chromosome 3 of rice field eel might also contain the commom conserved synteny to that on chromosome 2 of zebrafish. Our study is an attempt to establish a new and feasible method to advance the study of gene mapping and chromosome evolution in fish, and also to provide a new idea to distinguish each chromosome on the base of molecular markers for fish.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号