首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A database containing mapped partial cDNA sequences from Caenorhabditis elegans will provide a ready starting point for identifying nematode homologues of important human genes and determining their functions in C. elegans. A total of 720 expressed sequence tags (ESTs) have been generated from 585 clones randomly selected from a mixed-stage C. elegans cDNA library. Comparison of these ESTs with sequence databases identified 422 new C. elegans genes, of which 317 are not similar to any sequences in the database. Twenty-six new genes have been mapped by YAC clone hybridization. Members of several gene families, including cuticle collagens, GTP-binding proteins, and RNA helicases were discovered. Many of the new genes are similar to known or potential human disease genes, including CFTR and the LDL receptor.  相似文献   

2.
The genome sequences of Caenorhabditis elegans, Drosophila melanogaster and Arabidopsis thaliana have been predicted to contain 19,000, 13,600 and 25,500 genes, respectively. Before this information can be fully used for evolutionary and functional studies, several issues need to be addressed. First, the gene number estimates obtained in silico and not yet supported by any experimental data need to be verified. For example, it seems biologically paradoxical that C. elegans would have 50% more genes than Drosophilia. Second, intron/exon predictions need to be tested experimentally. Third, complete sets of open reading frames (ORFs), or "ORFeomes," need to be cloned into various expression vectors. To address these issues simultaneously, we have designed and applied to C. elegans the following strategy. Predicted ORFs are amplified by PCR from a highly representative cDNA library using ORF-specific primers, cloned by Gateway recombination cloning and then sequenced to generate ORF sequence tags (OSTs) as a way to verify identity and splicing. In a sample (n=1,222) of the nearly 10,000 genes predicted ab initio (that is, for which no expressed sequence tag (EST) is available so far), at least 70% were verified by OSTs. We also observed that 27% of these experimentally confirmed genes have a structure different from that predicted by GeneFinder. We now have experimental evidence that supports the existence of at least 17,300 genes in C. elegans. Hence we suggest that gene counts based primarily on ESTs may underestimate the number of genes in human and in other organisms.  相似文献   

3.
An abundance of X-linked genes expressed in spermatogonia   总被引:22,自引:0,他引:22  
Spermatogonia are the self-renewing, mitotic germ cells of the testis from which sperm arise by means of the differentiation pathway known as spermatogenesis. By contrast with hematopoietic and other mammalian stem-cell populations, which have been subjects of intense molecular genetic investigation, spermatogonia have remained largely unexplored at the molecular level. Here we describe a systematic search for genes expressed in mouse spermatogonia, but not in somatic tissues. We identified 25 genes (19 of which are novel) that are expressed in only male germ cells. Of the 25 genes, 3 are Y-linked and 10 are X-linked. If these genes had been distributed randomly in the genome, one would have expected zero to two of the genes to be X-linked. Our findings indicate that the X chromosome has a predominant role in pre-meiotic stages of mammalian spermatogenesis. We hypothesize that the X chromosome acquired this prominent role in male germ-cell development as it evolved from an ordinary, unspecialized autosome.  相似文献   

4.
5.
Selection for short introns in highly expressed genes   总被引:1,自引:0,他引:1  
  相似文献   

6.
A survey of expressed genes in Caenorhabditis elegans.   总被引:29,自引:0,他引:29  
As an adjunct to the genomic sequencing of Caenorhabditis elegans, we have investigated a representative cDNA library of 1,517 clones. A single sequence read has been obtained from the 5' end of each clone, allowing its characterization with respect to the public databases, and the clones are being localized on the genome map. The result is the identification of about 1,200 of the estimated 15,000 genes of C. elegans. More than 30% of the inferred protein sequences have significant similarity to existing sequences in the databases, providing a route towards in vivo analysis of known genes in the nematode. These clones also provide material for assessing the accuracy of predicted exons and splicing patterns and will lead to a more accurate estimate of the total number of genes in the organism than has hitherto been available.  相似文献   

7.
Single-nucleotide polymorphisms (SNPs) have been explored as a high-resolution marker set for accelerating the mapping of disease genes. Here we report 48,196 candidate SNPs detected by statistical analysis of human expressed sequence tags (ESTs), associated primarily with coding regions of genes. We used Bayesian inference to weigh evidence for true polymorphism versus sequencing error, misalignment or ambiguity, misclustering or chimaeric EST sequences, assessing data such as raw chromatogram height, sharpness, overlap and spacing, sequencing error rates, context-sensitivity and cDNA library origin. Three separate validations-comparison with 54 genes screened for SNPs independently, verification of HLA-A polymorphisms and restriction fragment length polymorphism (RFLP) testing-verified 70%, 89% and 71% of our predicted SNPs, respectively. Our method detects tenfold more true HLA-A SNPs than previous analyses of the EST data. We found SNPs in a large fraction of known disease genes, including some disease-causing mutations (for example, the HbS sickle-cell mutation). Our comprehensive analysis of human coding region polymorphism provides a public resource for mapping of disease genes (available at http://www.bioinformatics.ucla.edu/snp).  相似文献   

8.
The completed draft version of the human genome, comprised of multiple short contigs encompassing 85% or more of euchromatin, was announced in June of 2000 (ref. 1). The detailed findings of the sequencing consortium were reported several months later. The draft sequence has provided insight into global characteristics, such as the total number of genes and a more accurate definition of gene families. Also of importance are genome positional details such as local genome architecture, regional gene density and the location of transcribed units that are critical for disease gene identification. We carried out a series of mapping and computational experiments using a nonredundant collection of 925 expressed sequence tags (ESTs) and sections of the public draft genome sequence that were available at different timepoints between April 2000 and April 2001. We found discrepancies in both the reported coverage of the human genome and the accuracy of mapping of genomic clones, suggesting some limitations of the draft genome sequence in providing accurate positional information and detailed characterization of chromosomal subregions.  相似文献   

9.
10.
《Nature genetics》2011,43(3):173
The substantial $10 million purse of the Archon Genomics X PRI ZE (AGXP) is being offered for the generation of rapid, accurate and complete human DNA sequences. Because so many genomics researchers have a stake, we offer to help with a process of community consultation to help evolve fair and efficient methods to validate contestant data for the competition.  相似文献   

11.
Milk from domestic cows has been a valuable food source for over 8,000 years, especially in lactose-tolerant human societies that exploit dairy breeds. We studied geographic patterns of variation in genes encoding the six most important milk proteins in 70 native European cattle breeds. We found substantial geographic coincidence between high diversity in cattle milk genes, locations of the European Neolithic cattle farming sites (>5,000 years ago) and present-day lactose tolerance in Europeans. This suggests a gene-culture coevolution between cattle and humans.  相似文献   

12.
Many sequence variants affecting diversity of adult human height   总被引:1,自引:0,他引:1  
Adult human height is one of the classical complex human traits. We searched for sequence variants that affect height by scanning the genomes of 25,174 Icelanders, 2,876 Dutch, 1,770 European Americans and 1,148 African Americans. We then combined these results with previously published results from the Diabetes Genetics Initiative on 3,024 Scandinavians and tested a selected subset of SNPs in 5,517 Danes. We identified 27 regions of the genome with one or more sequence variants showing significant association with height. The estimated effects per allele of these variants ranged between 0.3 and 0.6 cm and, taken together, they explain around 3.7% of the population variation in height. The genes neighboring the identified loci cluster in biological processes related to skeletal development and mitosis. Association to three previously reported loci are replicated in our analyses, and the strongest association was with SNPs in the ZBTB38 gene.  相似文献   

13.
14.
Y chromosome sequence variation and the history of human populations   总被引:48,自引:0,他引:48  
Binary polymorphisms associated with the non-recombining region of the human Y chromosome (NRY) preserve the paternal genetic legacy of our species that has persisted to the present, permitting inference of human evolution, population affinity and demographic history. We used denaturing high-performance liquid chromatography (DHPLC; ref. 2) to identify 160 of the 166 bi-allelic and 1 tri-allelic site that formed a parsimonious genealogy of 116 haplotypes, several of which display distinct population affinities based on the analysis of 1062 globally representative individuals. A minority of contemporary East Africans and Khoisan represent the descendants of the most ancestral patrilineages of anatomically modern humans that left Africa between 35,000 and 89,000 years ago.  相似文献   

15.
Systematic screen for human disease genes in yeast   总被引:19,自引:0,他引:19  
High similarity between yeast and human mitochondria allows functional genomic study of Saccharomyces cerevisiae to be used to identify human genes involved in disease. So far, 102 heritable disorders have been attributed to defects in a quarter of the known nuclear-encoded mitochondrial proteins in humans. Many mitochondrial diseases remain unexplained, however, in part because only 40-60% of the presumed 700-1,000 proteins involved in mitochondrial function and biogenesis have been identified. Here we apply a systematic functional screen using the pre-existing whole-genome pool of yeast deletion mutants to identify mitochondrial proteins. Three million measurements of strain fitness identified 466 genes whose deletions impaired mitochondrial respiration, of which 265 were new. Our approach gave higher selection than other systematic approaches, including fivefold greater selection than gene expression analysis. To apply these advantages to human disorders involving mitochondria, human orthologs were identified and linked to heritable diseases using genomic map positions.  相似文献   

16.
Shotgun sample sequence comparisons between mouse and human genomes   总被引:3,自引:0,他引:3  
A mixed 'clone-by-clone' and 'whole-genome shotgun' strategy will be used to determine the genomic sequence of the mouse. This method will allow a phase of rapid annotation of the contemporaneous human sequence draft, through whole-genome 'sample sequence comparisons'.  相似文献   

17.
18.
19.
A major goal in human genetics is to understand the role of common genetic variants in susceptibility to common diseases. This will require characterizing the nature of gene variation in human populations, assembling an extensive catalogue of single-nucleotide polymorphisms (SNPs) in candidate genes and performing association studies for particular diseases. At present, our knowledge of human gene variation remains rudimentary. Here we describe a systematic survey of SNPs in the coding regions of human genes. We identified SNPs in 106 genes relevant to cardiovascular disease, endocrinology and neuropsychiatry by screening an average of 114 independent alleles using 2 independent screening methods. To ensure high accuracy, all reported SNPs were confirmed by DNA sequencing. We identified 560 SNPs, including 392 coding-region SNPs (cSNPs) divided roughly equally between those causing synonymous and non-synonymous changes. We observed different rates of polymorphism among classes of sites within genes (non-coding, degenerate and non-degenerate) as well as between genes. The cSNPs most likely to influence disease, those that alter the amino acid sequence of the encoded protein, are found at a lower rate and with lower allele frequencies than silent substitutions. This likely reflects selection acting against deleterious alleles during human evolution. The lower allele frequency of missense cSNPs has implications for the compilation of a comprehensive catalogue, as well as for the subsequent application to disease association.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号