首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Genome evolution studies for the phylum Nematoda have been limited by focusing on comparisons involving Caenorhabditis elegans. We report a draft genome sequence of Trichinella spiralis, a food-borne zoonotic parasite, which is the most common cause of human trichinellosis. This parasitic nematode is an extant member of a clade that diverged early in the evolution of the phylum, enabling identification of archetypical genes and molecular signatures exclusive to nematodes. We sequenced the 64-Mb nuclear genome, which is estimated to contain 15,808 protein-coding genes, at ~35-fold coverage using whole-genome shotgun and hierarchal map-assisted sequencing. Comparative genome analyses support intrachromosomal rearrangements across the phylum, disproportionate numbers of protein family deaths over births in parasitic compared to a non-parasitic nematode and a preponderance of gene-loss and -gain events in nematodes relative to Drosophila melanogaster. This genome sequence and the identified pan-phylum characteristics will contribute to genome evolution studies of Nematoda as well as strategies to combat global parasites of humans, food animals and crops.  相似文献   

2.
The approach to annotating a genome critically affects the number and accuracy of genes identified in the genome sequence. Genome annotation based on stringent gene identification is prone to underestimate the complement of genes encoded in a genome. In contrast, over-prediction of putative genes followed by exhaustive computational sequence, motif and structural homology search will find rarely expressed, possibly unique, new genes at the risk of including non-functional genes. We developed a two-stage approach that combines the merits of stringent genome annotation with the benefits of over-prediction. First we identify plausible genes regardless of matches with EST, cDNA or protein sequences from the organism (stage 1). In the second stage, proteins predicted from the plausible genes are compared at the protein level with EST, cDNA and protein sequences, and protein structures from other organisms (stage 2). Remote but biologically meaningful protein sequence or structure homologies provide supporting evidence for genuine genes. The method, applied to the Drosophila melanogaster genome, validated 1,042 novel candidate genes after filtering 19,410 plausible genes, of which 12,124 matched the original 13,601 annotated genes. This annotation strategy is applicable to genomes of all organisms, including human.  相似文献   

3.
Cloning procedures aided by homology searches of EST databases have accelerated the pace of discovery of new genes, but EST database searching remains an involved and onerous task. More than 1.6 million human EST sequences have been deposited in public databases, making it difficult to identify ESTs that represent new genes. Compounding the problems of scale are difficulties in detection associated with a high sequencing error rate and low sequence similarity between distant homologues. We have developed a new method, coupling BLAST-based searches with a domain identification protocol, that filters candidate homologues. Application of this method in a large-scale analysis of 100 signalling domain families has led to the identification of ESTs representing more than 1,000 novel human signalling genes. The 4,206 publicly available ESTs representing these genes are a valuable resource for rapid cloning of novel human signalling proteins. For example, we were able to identify ESTs of at least 106 new small GTPases, of which 6 are likely to belong to new subfamilies. In some cases, further analyses of genomic DNA led to the discovery of previously unidentified full-length protein sequences. This is exemplified by the in silico cloning (prediction of a gene product sequence using only genomic and EST sequence data) of a new type of GTPase with two catalytic domains.  相似文献   

4.
Genetic variation allows the malaria parasite Plasmodium falciparum to overcome chemotherapeutic agents, vaccines and vector control strategies and remain a leading cause of global morbidity and mortality. Here we describe an initial survey of genetic variation across the P. falciparum genome. We performed extensive sequencing of 16 geographically diverse parasites and identified 46,937 SNPs, demonstrating rich diversity among P. falciparum parasites (pi = 1.16 x 10(-3)) and strong correlation with gene function. We identified multiple regions with signatures of selective sweeps in drug-resistant parasites, including a previously unidentified 160-kb region with extremely low polymorphism in pyrimethamine-resistant parasites. We further characterized 54 worldwide isolates by genotyping SNPs across 20 genomic regions. These data begin to define population structure among African, Asian and American groups and illustrate the degree of linkage disequilibrium, which extends over relatively short distances in African parasites but over longer distances in Asian parasites. We provide an initial map of genetic diversity in P. falciparum and demonstrate its potential utility in identifying genes subject to recent natural selection and in understanding the population genetics of this parasite.  相似文献   

5.
The mammalian Y chromosome has unique characteristics compared with the autosomes or X chromosomes. Here we report the finished sequence of the chimpanzee Y chromosome (PTRY), including 271 kb of the Y-specific pseudoautosomal region 1 and 12.7 Mb of the male-specific region of the Y chromosome. Greater sequence divergence between the human Y chromosome (HSAY) and PTRY (1.78%) than between their respective whole genomes (1.23%) confirmed the accelerated evolutionary rate of the Y chromosome. Each of the 19 PTRY protein-coding genes analyzed had at least one nonsynonymous substitution, and 11 genes had higher nonsynonymous substitution rates than synonymous ones, suggesting relaxation of selective constraint, positive selection or both. We also identified lineage-specific changes, including deletion of a 200-kb fragment from the pericentromeric region of HSAY, expansion of young Alu families in HSAY and accumulation of young L1 elements and long terminal repeat retrotransposons in PTRY. Reconstruction of the common ancestral Y chromosome reflects the dynamic changes in our genomes in the 5-6 million years since speciation.  相似文献   

6.
DNA mismatch repair is important because of its role in maintaining genomic integrity and its association with hereditary non-polyposis colon cancer (HNPCC). To identify new human mismatch repair proteins, we probed nuclear extracts with the conserved carboxy-terminal MLH1 interaction domain. Here we describe the cloning and complete genomic sequence of MLH3, which encodes a new DNA mismatch repair protein that interacts with MLH1. MLH3 is more similar to mismatch repair proteins from yeast, plants, worms and bacteria than to any known mammalian protein, suggesting that its conserved sequence may confer unique functions in mice and humans. Cells in culture stably expressing a dominant-negative MLH3 protein exhibit microsatellite instability. Mlh3 is highly expressed in gastrointestinal epithelium and physically maps to the mouse complex trait locus colon cancer susceptibility I (Ccs1). Although we were unable to identify a mutation in the protein-coding region of Mlh3 in the susceptible mouse strain, colon tumours from congenic Ccs1 mice exhibit microsatellite instability. Functional redundancy among Mlh3, Pms1 and Pms2 may explain why neither Pms1 nor Pms2 mutant mice develop colon cancer, and why PMS1 and PMS2 mutations are only rarely found in HNPCC families.  相似文献   

7.
Hereditary inclusion body myopathy (HIBM; OMIM 600737) is a unique group of neuromuscular disorders characterized by adult onset, slowly progressive distal and proximal weakness and a typical muscle pathology including rimmed vacuoles and filamentous inclusions. The autosomal recessive form described in Jews of Persian descent is the HIBM prototype. This myopathy affects mainly leg muscles, but with an unusual distribution that spares the quadriceps. This particular pattern of weakness distribution, termed quadriceps-sparing myopathy (QSM), was later found in Jews originating from other Middle Eastern countries as well as in non-Jews. We previously localized the gene causing HIBM in Middle Eastern Jews on chromosome 9p12-13 (ref. 5) within a genomic interval of about 700 kb (ref. 6). Haplotype analysis around the HIBM gene region of 104 affected people from 47 Middle Eastern families indicates one unique ancestral founder chromosome in this community. By contrast, single non-Jewish families from India, Georgia (USA) and the Bahamas, with QSM and linkage to the same 9p12-13 region, show three distinct haplotypes. After excluding other potential candidate genes, we eventually identified mutations in the UDP-N-acetylglucosamine-2-epimerase/N-acetylmannosamine kinase (GNE) gene in the HIBM families: all patients from Middle Eastern descent shared a single homozygous missense mutation, whereas distinct compound heterozygotes were identified in affected individuals of families of other ethnic origins. Our findings indicate that GNE is the gene responsible for recessive HIBM.  相似文献   

8.
9.
10.
We performed a high-throughput retroviral insertional mutagenesis screen in mouse mammary tumor virus (MMTV)-induced mammary tumors and identified 33 common insertion sites, of which 17 genes were previously not known to be associated with mammary cancer and 13 had not previously been linked to cancer in general. Although members of the Wnt and fibroblast growth factors (Fgf) families were frequently tagged, our exhaustive screening for MMTV insertion sites uncovered a new repertoire of candidate breast cancer oncogenes. We validated one of these genes, Rspo3, as an oncogene by overexpression in a p53-deficient mammary epithelial cell line. The human orthologs of the candidate oncogenes were frequently deregulated in human breast cancers and associated with several tumor parameters. Computational analysis of all MMTV-tagged genes uncovered specific gene families not previously associated with cancer and showed a significant overrepresentation of protein domains and signaling pathways mainly associated with development and growth factor signaling. Comparison of all tagged genes in MMTV and Moloney murine leukemia virus-induced malignancies showed that both viruses target mostly different genes that act predominantly in distinct pathways.  相似文献   

11.
Francisella tularensis is one of the most infectious human pathogens known. In the past, both the former Soviet Union and the US had programs to develop weapons containing the bacterium. We report the complete genome sequence of a highly virulent isolate of F. tularensis (1,892,819 bp). The sequence uncovers previously uncharacterized genes encoding type IV pili, a surface polysaccharide and iron-acquisition systems. Several virulence-associated genes were located in a putative pathogenicity island, which was duplicated in the genome. More than 10% of the putative coding sequences contained insertion-deletion or substitution mutations and seemed to be deteriorating. The genome is rich in IS elements, including IS630 Tc-1 mariner family transposons, which are not expected in a prokaryote. We used a computational method for predicting metabolic pathways and found an unexpectedly high proportion of disrupted pathways, explaining the fastidious nutritional requirements of the bacterium. The loss of biosynthetic pathways indicates that F. tularensis is an obligate host-dependent bacterium in its natural life cycle. Our results have implications for our understanding of how highly virulent human pathogens evolve and will expedite strategies to combat them.  相似文献   

12.
During evolution different genes evolve at unequal rates, reflecting the varying functional constraints on phenotype. An important contributor to this variation is genetic buffering, which reduces the potential detrimental effects of mutations. We studied whether gene duplication and redundant metabolic networks affect genetic buffering by comparing the evolutionary rate of 242 human and mouse orthologous genes involved in metabolic pathways. A gene with a redundant network is defined as one for which the structural layout of metabolic pathways provides an alternative metabolic route that can, in principle, compensate for the loss of a protein function encoded by the gene. We found that genes with redundant networks evolve at similar rates as did genes without redundant networks, [corrected] but no significant difference was detected between single-copy genes and gene families. This implies that redundancy in metabolic networks provides significantly more genetic buffering than do gene families. We also found that genes encoding proteins involved in glycolysis and gluconeogenesis showed as a group a distinct pattern of variation, in contrast to genes involved in other pathways. These results suggest that redundant networks provide genetic buffering and contribute to the functional diversification of metabolic pathways.  相似文献   

13.
Bardet-Biedl syndrome (BBS) is a genetically heterogeneous ciliopathy. Although nine BBS genes have been cloned, they explain only 40-50% of the total mutational load. Here we report a major new BBS locus, BBS10, that encodes a previously unknown, rapidly evolving vertebrate-specific chaperonin-like protein. We found BBS10 to be mutated in about 20% of an unselected cohort of families of various ethnic origins, including some families with mutations in other BBS genes, consistent with oligogenic inheritance. In zebrafish, mild suppression of bbs10 exacerbated the phenotypes of other bbs morphants.  相似文献   

14.
Sequence variation in human genes is largely confined to single-nucleotide polymorphisms (SNPs) and is valuable in tests of association with common diseases and pharmacogenetic traits. We performed a systematic and comprehensive survey of molecular variation to assess the nature, pattern and frequency of SNPs in 75 candidate human genes for blood-pressure homeostasis and hypertension. We assayed 28 Mb (190 kb in 148 alleles) of genomic sequence, comprising the 5' and 3' untranslated regions (UTRs), introns and coding sequence of these genes, for sequence differences in individuals of African and Northern European descent using high-density variant detection arrays (VDAs). We identified 874 candidate human SNPs, of which 22% were confirmed by DNA sequencing to reveal a discordancy rate of 21% for VDA detection. The SNPs detected have an average minor allele frequency of 11%, and 387 are within the coding sequence (cSNPs). Of all cSNPs, 54% lead to a predicted change in the protein sequence, implying a high level of human protein diversity. These protein-altering SNPs are 38% of the total number of such SNPs expected, are more likely to be population-specific and are rarer in the human population, directly demonstrating the effects of natural selection on human genes. Overall, the degree of nucleotide polymorphism across these human genes, and orthologous great ape sequences, is highly variable and is correlated with the effects of functional conservation on gene sequences.  相似文献   

15.
A gene expression map of Arabidopsis thaliana development   总被引:3,自引:0,他引:3  
  相似文献   

16.
17.
18.
19.
Hermansky-Pudlak syndrome (HPS) is a rare autosomal recessive disorder characterized by oculocutaneous albinism and a storage pool deficiency due to an absence of platelet dense bodies. Lysosomal ceroid lipofuscinosis, pulmonary fibrosis and granulomatous colitis are occasional manifestations of the disease. HPS occurs with a frequency of one in 1800 in north-west Puerto Rico due to a founder effect. Several non-Puerto Rican patients also have mutations in HPS1, which produces a protein of unknown function. Another gene, ADTB3A, causes HPS in the pearl mouse and in two brothers with HPS-2 (refs. 11,12). ADTB3A encodes a coat protein involved in vesicle formation, implicating HPS as a disorder of membrane trafficking. We sought to identify other HPS-causing genes. Using homozygosity mapping on pooled DNA of 6 families from central Puerto Rico, we localized a new HPS susceptibility gene to a 1.6-cM interval on chromosome 3q24. The gene, HPS3, has 17 exons, and a putative 113.7-kD product expected to reveal how new vesicles form in specialized cells. The homozygous, disease-causing mutation is a large deletion and represents the second example of a founder mutation causing HPS on the small island of Puerto Rico. We also present an allele-specific assay for diagnosing individuals heterozygous or homozygous for this mutation.  相似文献   

20.
Williams-Beuren syndrome (WBS) is most often caused by hemizygous deletion of a 1.5-Mb interval encompassing at least 17 genes at 7q11.23 (refs. 1,2). As with many other haploinsufficiency diseases, the mechanism underlying the WBS deletion is thought to be unequal meiotic recombination, probably mediated by the highly homologous DNA that flanks the commonly deleted region. Here, we report the use of interphase fluorescence in situ hybridization (FISH) and pulsed-field gel electrophoresis (PFGE) to identify a genomic polymorphism in families with WBS, consisting of an inversion of the WBS region. We have observed that the inversion is hemizygous in 3 of 11 (27%) atypical affected individuals who show a subset of the WBS phenotypic spectrum but do not carry the typical WBS microdeletion. Two of these individuals also have a parent who carries the inversion. In addition, in 4 of 12 (33%) families with a proband carrying the WBS deletion, we observed the inversion exclusively in the parent transmitting the disease-related chromosome. These results suggest the presence of a newly identified genomic variant within the population that may be associated with the disease. It may result in predisposition to primarily WBS-causing microdeletions, but may also cause translocations and inversions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号