共查询到20条相似文献,搜索用时 15 毫秒
1.
Star B Nederbragt AJ Jentoft S Grimholt U Malmstrøm M Gregers TF Rounge TB Paulsen J Solbakken MH Sharma A Wetten OF Lanzén A Winer R Knight J Vogel JH Aken B Andersen O Lagesen K Tooming-Klunderud A Edvardsen RB Tina KG Espelund M Nepal C Previti C Karlsen BO Moum T Skage M Berg PR Gjøen T Kuhl H Thorsen J Malde K Reinhardt R Du L Johansen SD Searle S Lien S Nilsen F Jonassen I Omholt SW Stenseth NC Jakobsen KS 《Nature》2011,477(7363):207-210
Atlantic cod (Gadus morhua) is a large, cold-adapted teleost that sustains long-standing commercial fisheries and incipient aquaculture. Here we present the genome sequence of Atlantic cod, showing evidence for complex thermal adaptations in its haemoglobin gene cluster and an unusual immune architecture compared to other sequenced vertebrates. The genome assembly was obtained exclusively by 454 sequencing of shotgun and paired-end libraries, and automated annotation identified 22,154 genes. The major histocompatibility complex (MHC)?II is a conserved feature of the adaptive immune system of jawed vertebrates, but we show that Atlantic cod has lost the genes for MHC?II, CD4 and invariant chain (Ii) that are essential for the function of this pathway. Nevertheless, Atlantic cod is not exceptionally susceptible to disease under natural conditions. We find a highly expanded number of MHC?I genes and a unique composition of its Toll-like receptor (TLR) families. This indicates how the Atlantic cod immune system has evolved compensatory mechanisms in both adaptive and innate immunity in the absence of MHC?II. These observations affect fundamental assumptions about the evolution of the adaptive immune system and its components in vertebrates. 相似文献
2.
Integration of telomere sequences with the draft human genome sequence 总被引:15,自引:0,他引:15
Riethman HC Xiang Z Paul S Morse E Hu XL Flint J Chi HC Grady DL Moyzis RK 《Nature》2001,409(6822):948-951
Telomeres are the ends of linear eukaryotic chromosomes. To ensure that no large stretches of uncharacterized DNA remain between the ends of the human working draft sequence and the ends of each chromosome, we would need to connect the sequences of the telomeres to the working draft sequence. But telomeres have an unusual DNA sequence composition and organization that makes them particularly difficult to isolate and analyse. Here we use specialized linear yeast artificial chromosome clones, each carrying a large telomere-terminal fragment of human DNA, to integrate most human telomeres with the working draft sequence. Subtelomeric sequence structure appears to vary widely, mainly as a result of large differences in subtelomeric repeat sequence abundance and organization at individual telomeres. Many subtelomeric regions appear to be gene-rich, matching both known and unknown expressed genes. This indicates that human subtelomeric regions are not simply buffers of nonfunctional 'junk DNA' next to the molecular telomere, but are instead functional parts of the expressed genome. 相似文献
3.
Complete genome sequence of Pseudomonas aeruginosa PAO1, an opportunistic pathogen 总被引:45,自引:0,他引:45
Stover CK Pham XQ Erwin AL Mizoguchi SD Warrener P Hickey MJ Brinkman FS Hufnagle WO Kowalik DJ Lagrou M Garber RL Goltry L Tolentino E Westbrock-Wadman S Yuan Y Brody LL Coulter SN Folger KR Kas A Larbig K Lim R Smith K Spencer D Wong GK Wu Z Paulsen IT Reizer J Saier MH Hancock RE Lory S Olson MV 《Nature》2000,406(6799):959-964
Pseudomonas aeruginosa is a ubiquitous environmental bacterium that is one of the top three causes of opportunistic human infections. A major factor in its prominence as a pathogen is its intrinsic resistance to antibiotics and disinfectants. Here we report the complete sequence of P. aeruginosa strain PAO1. At 6.3 million base pairs, this is the largest bacterial genome sequenced, and the sequence provides insights into the basis of the versatility and intrinsic drug resistance of P. aeruginosa. Consistent with its larger genome size and environmental adaptability, P. aeruginosa contains the highest proportion of regulatory genes observed for a bacterial genome and a large number of genes involved in the catabolism, transport and efflux of organic compounds as well as four potential chemotaxis systems. We propose that the size and complexity of the P. aeruginosa genome reflect an evolutionary adaptation permitting it to thrive in diverse environments and resist the effects of a variety of antimicrobial substances. 相似文献
4.
The map-based sequence of the rice genome 总被引:14,自引:0,他引:14
International Rice Genome Sequencing Project 《Nature》2005,436(7052):793-800
Rice, one of the world's most important food plants, has important syntenic relationships with the other cereal species and is a model plant for the grasses. Here we present a map-based, finished quality sequence that covers 95% of the 389 Mb genome, including virtually all of the euchromatin and two complete centromeres. A total of 37,544 non-transposable-element-related protein-coding genes were identified, of which 71% had a putative homologue in Arabidopsis. In a reciprocal analysis, 90% of the Arabidopsis proteins had a putative homologue in the predicted rice proteome. Twenty-nine per cent of the 37,544 predicted genes appear in clustered gene families. The number and classes of transposable elements found in the rice genome are consistent with the expansion of syntenic regions in the maize and sorghum genomes. We find evidence for widespread and recurrent gene transfer from the organelles to the nuclear chromosomes. The map-based sequence has proven useful for the identification of genes underlying agronomic traits. The additional single-nucleotide polymorphisms and simple sequence repeats identified in our study should accelerate improvements in rice production. 相似文献
5.
Simpson AJ Reinach FC Arruda P Abreu FA Acencio M Alvarenga R Alves LM Araya JE Baia GS Baptista CS Barros MH Bonaccorsi ED Bordin S Bové JM Briones MR Bueno MR Camargo AA Camargo LE Carraro DM Carrer H Colauto NB Colombo C Costa FF Costa MC Costa-Neto CM Coutinho LL Cristofani M Dias-Neto E Docena C El-Dorry H Facincani AP Ferreira AJ Ferreira VC Ferro JA Fraga JS França SC Franco MC Frohme M Furlan LR Garnier M Goldman GH Goldman MH Gomes SL Gruber A Ho PL Hoheisel JD Junqueira ML Kemper EL 《Nature》2000,406(6792):151-159
Xylella fastidiosa is a fastidious, xylem-limited bacterium that causes a range of economically important plant diseases. Here we report the complete genome sequence of X. fastidiosa clone 9a5c, which causes citrus variegated chlorosis--a serious disease of orange trees. The genome comprises a 52.7% GC-rich 2,679,305-base-pair (bp) circular chromosome and two plasmids of 51,158 bp and 1,285 bp. We can assign putative functions to 47% of the 2,904 predicted coding regions. Efficient metabolic functions are predicted, with sugars as the principal energy and carbon source, supporting existence in the nutrient-poor xylem sap. The mechanisms associated with pathogenicity and virulence involve toxins, antibiotics and ion sequestration systems, as well as bacterium-bacterium and bacterium-host interactions mediated by a range of proteins. Orthologues of some of these proteins have only been identified in animal and human pathogens; their presence in X. fastidiosa indicates that the molecular basis for bacterial pathogenicity is both conserved and independent of host. At least 83 genes are bacteriophage-derived and include virulence-associated genes from other bacteria, providing direct evidence of phage-mediated horizontal gene transfer. 相似文献
6.
Wood V Gwilliam R Rajandream MA Lyne M Lyne R Stewart A Sgouros J Peat N Hayles J Baker S Basham D Bowman S Brooks K Brown D Brown S Chillingworth T Churcher C Collins M Connor R Cronin A Davis P Feltwell T Fraser A Gentles S Goble A Hamlin N Harris D Hidalgo J Hodgson G Holroyd S Hornsby T Howarth S Huckle EJ Hunt S Jagels K James K Jones L Jones M Leather S McDonald S McLean J Mooney P Moule S Mungall K Murphy L Niblett D Odell C Oliver K O'Neil S Pearson D Quail MA Rabbinowitsch E 《Nature》2002,415(6874):871-880
We have sequenced and annotated the genome of fission yeast (Schizosaccharomyces pombe), which contains the smallest number of protein-coding genes yet recorded for a eukaryote: 4,824. The centromeres are between 35 and 110 kilobases (kb) and contain related repeats including a highly conserved 1.8-kb element. Regions upstream of genes are longer than in budding yeast (Saccharomyces cerevisiae), possibly reflecting more-extended control regions. Some 43% of the genes contain introns, of which there are 4,730. Fifty genes have significant similarity with human disease genes; half of these are cancer related. We identify highly conserved genes important for eukaryotic cell organization including those required for the cytoskeleton, compartmentation, cell-cycle control, proteolysis, protein phosphorylation and RNA splicing. These genes may have originated with the appearance of eukaryotic life. Few similarly conserved genes that are important for multicellular organization were identified, suggesting that the transition from prokaryotes to eukaryotes required more new genes than did the transition from unicellular to multicellular organization. 相似文献
7.
The complete sequence of the mucosal pathogen Ureaplasma urealyticum 总被引:17,自引:0,他引:17
The comparison of the genomes of two very closely related human mucosal pathogens, Mycoplasma genitalium and Mycoplasma pneumoniae, has helped define the essential functions of a self-replicating minimal cell, as well as what constitutes a mycoplasma. Here we report the complete sequence of a more distant phylogenetic relative of those bacteria, Ureaplasma urealyticum (parvum biovar), which is also a mucosal pathogen of humans. It is the third mycoplasma to be sequenced, and has the smallest sequenced prokaryotic genome except for M. genitalium. Although the U. urealyticum genome is similar to the two sequenced mycoplasma genomes, features make this organism unique among mycoplasmas and all bacteria. Almost all ATP synthesis is the result of urea hydrolysis, which generates an energy-producing electrochemical gradient. Some highly conserved eubacterial enzymes appear not to be encoded by U. urealyticum, including the cell-division protein FtsZ, chaperonins GroES and GroEL, and ribonucleoside-diphosphate reductase. U. urealyticum has six closely related iron transporters, which apparently arose through gene duplication, suggesting that it has a kind of respiration system not present in other small genome bacteria The genome is only 25.5% G+C in nucleotide content, and the G+C content of individual genes may predict how essential those genes are to ureaplasma survival. 相似文献
8.
Bowler C Allen AE Badger JH Grimwood J Jabbari K Kuo A Maheswari U Martens C Maumus F Otillar RP Rayko E Salamov A Vandepoele K Beszteri B Gruber A Heijde M Katinka M Mock T Valentin K Verret F Berges JA Brownlee C Cadoret JP Chiovitti A Choi CJ Coesel S De Martino A Detter JC Durkin C Falciatore A Fournet J Haruta M Huysman MJ Jenkins BD Jiroutova K Jorgensen RE Joubert Y Kaplan A Kröger N Kroth PG La Roche J Lindquist E Lommer M Martin-Jézéquel V Lopez PJ Lucas S Mangogna M McGinnis K 《Nature》2008,456(7219):239-244
Diatoms are photosynthetic secondary endosymbionts found throughout marine and freshwater environments, and are believed to be responsible for around one-fifth of the primary productivity on Earth. The genome sequence of the marine centric diatom Thalassiosira pseudonana was recently reported, revealing a wealth of information about diatom biology. Here we report the complete genome sequence of the pennate diatom Phaeodactylum tricornutum and compare it with that of T. pseudonana to clarify evolutionary origins, functional significance and ubiquity of these features throughout diatoms. In spite of the fact that the pennate and centric lineages have only been diverging for 90 million years, their genome structures are dramatically different and a substantial fraction of genes ( approximately 40%) are not shared by these representatives of the two lineages. Analysis of molecular divergence compared with yeasts and metazoans reveals rapid rates of gene diversification in diatoms. Contributing factors include selective gene family expansions, differential losses and gains of genes and introns, and differential mobilization of transposable elements. Most significantly, we document the presence of hundreds of genes from bacteria. More than 300 of these gene transfers are found in both diatoms, attesting to their ancient origins, and many are likely to provide novel possibilities for metabolite management and for perception of environmental signals. These findings go a long way towards explaining the incredible diversity and success of the diatoms in contemporary oceans. 相似文献
9.
Ruepp A Graml W Santos-Martinez ML Koretke KK Volker C Mewes HW Frishman D Stocker S Lupas AN Baumeister W 《Nature》2000,407(6803):508-513
Thermoplasma acidophilum is a thermoacidophilic archaeon that thrives at 59 degrees C and pH 2, which was isolated from self-heating coal refuse piles and solfatara fields. Species of the genus Thermoplasma do not possess a rigid cell wall, but are only delimited by a plasma membrane. Many macromolecular assemblies from Thermoplasma, primarily proteases and chaperones, have been pivotal in elucidating the structure and function of their more complex eukaryotic homologues. Our interest in protein folding and degradation led us to seek a more complete representation of the proteins involved in these pathways by determining the genome sequence of the organism. Here we have sequenced the 1,564,905-base-pair genome in just 7,855 sequencing reactions by using a new strategy. The 1,509 open reading frames identify Thermoplasma as a typical euryarchaeon with a substantial complement of bacteria-related genes; however, evidence indicates that there has been much lateral gene transfer between Thermoplasma and Sulfolobus solfataricus, a phylogenetically distant crenarchaeon inhabiting the same environment. At least 252 open reading frames, including a complete protein degradation pathway and various transport proteins, resemble Sulfolobus proteins most closely. 相似文献
10.
The genome sequence of the filamentous fungus Neurospora crassa 总被引:1,自引:0,他引:1
Galagan JE Calvo SE Borkovich KA Selker EU Read ND Jaffe D FitzHugh W Ma LJ Smirnov S Purcell S Rehman B Elkins T Engels R Wang S Nielsen CB Butler J Endrizzi M Qui D Ianakiev P Bell-Pedersen D Nelson MA Werner-Washburne M Selitrennikoff CP Kinsey JA Braun EL Zelter A Schulte U Kothe GO Jedd G Mewes W Staben C Marcotte E Greenberg D Roy A Foley K Naylor J Stange-Thomann N Barrett R Gnerre S Kamal M Kamvysselis M Mauceli E Bielke C Rudd S Frishman D Krystofova S Rasmussen C Metzenberg RL 《Nature》2003,422(6934):859-868
Neurospora crassa is a central organism in the history of twentieth-century genetics, biochemistry and molecular biology. Here, we report a high-quality draft sequence of the N. crassa genome. The approximately 40-megabase genome encodes about 10,000 protein-coding genes--more than twice as many as in the fission yeast Schizosaccharomyces pombe and only about 25% fewer than in the fruitfly Drosophila melanogaster. Analysis of the gene set yields insights into unexpected aspects of Neurospora biology including the identification of genes potentially associated with red light photobiology, genes implicated in secondary metabolism, and important differences in Ca2+ signalling as compared with plants and animals. Neurospora possesses the widest array of genome defence mechanisms known for any eukaryotic organism, including a process unique to fungi called repeat-induced point mutation (RIP). Genome analysis suggests that RIP has had a profound impact on genome evolution, greatly slowing the creation of new genes through genomic duplication and resulting in a genome with an unusually low proportion of closely related genes. 相似文献
11.
Grbić M Van Leeuwen T Clark RM Rombauts S Rouzé P Grbić V Osborne EJ Dermauw W Ngoc PC Ortego F Hernández-Crespo P Diaz I Martinez M Navajas M Sucena É Magalhães S Nagy L Pace RM Djuranović S Smagghe G Iga M Christiaens O Veenstra JA Ewer J Villalobos RM Hutter JL Hudson SD Velez M Yi SV Zeng J Pires-daSilva A Roch F Cazaux M Navarro M Zhurov V Acevedo G Bjelica A Fawcett JA Bonnet E Martens C Baele G Wissler L Sanchez-Rodriguez A Tirry L Blais C Demeestere K Henz SR Gregory TR Mathieu J 《Nature》2011,479(7374):487-492
12.
用黄芪,苦参,狼毒提取物为阳性对照对紫茎泽兰提取物进行抑菌活性比较.并利用液液萃取、色谱法对紫荆泽兰醇提物进行分离精制.结果表明,紫茎泽兰醇提物具有明显抑菌效果,且效果优于狼毒提取物和黄芪提取物;对紫茎泽兰醇提物进行分离精制及抑菌试验,结果显示其乙酸乙酯萃取物柱层析组分SF4-2抑菌活性最强,能显著抑制沙门氏菌, 蜡样芽孢杆菌, 金黄色葡萄球菌和枯草芽孢杆菌生长, 可用于食源性抗菌制剂的开发利用. 相似文献
13.
Wang J Wang W Li R Li Y Tian G Goodman L Fan W Zhang J Li J Zhang J Guo Y Feng B Li H Lu Y Fang X Liang H Du Z Li D Zhao Y Hu Y Yang Z Zheng H Hellmann I Inouye M Pool J Yi X Zhao J Duan J Zhou Y Qin J Ma L Li G Yang Z Zhang G Yang B Yu C Liang F Li W Li S Li D Ni P Ruan J Li Q Zhu H Liu D Lu Z Li N Guo G Zhang J Ye J Fang L Hao Q Chen Q Liang Y Su Y San A Ping C Yang S Chen F Li L Zhou K Zheng H Ren Y Yang L Gao Y Yang G Li Z Feng X Kristiansen K Wong GK Nielsen R Durbin R Bolund L Zhang X 《Nature》2008,456(7218):60-65
Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics. 相似文献
14.
The genome sequence of the rice blast fungus Magnaporthe grisea 总被引:8,自引:0,他引:8
Dean RA Talbot NJ Ebbole DJ Farman ML Mitchell TK Orbach MJ Thon M Kulkarni R Xu JR Pan H Read ND Lee YH Carbone I Brown D Oh YY Donofrio N Jeong JS Soanes DM Djonovic S Kolomiets E Rehmeyer C Li W Harding M Kim S Lebrun MH Bohnert H Coughlan S Butler J Calvo S Ma LJ Nicol R Purcell S Nusbaum C Galagan JE Birren BW 《Nature》2005,434(7036):980-986
Magnaporthe grisea is the most destructive pathogen of rice worldwide and the principal model organism for elucidating the molecular basis of fungal disease of plants. Here, we report the draft sequence of the M. grisea genome. Analysis of the gene set provides an insight into the adaptations required by a fungus to cause disease. The genome encodes a large and diverse set of secreted proteins, including those defined by unusual carbohydrate-binding domains. This fungus also possesses an expanded family of G-protein-coupled receptors, several new virulence-associated genes and large suites of enzymes involved in secondary metabolism. Consistent with a role in fungal pathogenesis, the expression of several of these genes is upregulated during the early stages of infection-related development. The M. grisea genome has been subject to invasion and proliferation of active transposable elements, reflecting the clonal nature of this fungus imposed by widespread rice cultivation. 相似文献
15.
Genome sequence of the plant pathogen Ralstonia solanacearum 总被引:49,自引:0,他引:49
Salanoubat M Genin S Artiguenave F Gouzy J Mangenot S Arlat M Billault A Brottier P Camus JC Cattolico L Chandler M Choisne N Claudel-Renard C Cunnac S Demange N Gaspin C Lavie M Moisan A Robert C Saurin W Schiex T Siguier P Thébault P Whalen M Wincker P Levy M Weissenbach J Boucher CA 《Nature》2002,415(6871):497-502
Ralstonia solanacearum is a devastating, soil-borne plant pathogen with a global distribution and an unusually wide host range. It is a model system for the dissection of molecular determinants governing pathogenicity. We present here the complete genome sequence and its analysis of strain GMI1000. The 5.8-megabase (Mb) genome is organized into two replicons: a 3.7-Mb chromosome and a 2.1-Mb megaplasmid. Both replicons have a mosaic structure providing evidence for the acquisition of genes through horizontal gene transfer. Regions containing genetically mobile elements associated with the percentage of G+C bias may have an important function in genome evolution. The genome encodes many proteins potentially associated with a role in pathogenicity. In particular, many putative attachment factors were identified. The complete repertoire of type III secreted effector proteins can be studied. Over 40 candidates were identified. Comparison with other genomes suggests that bacterial plant pathogens and animal pathogens harbour distinct arrays of specialized type III-dependent effectors. 相似文献
16.
17.
Here we present a draft genome sequence of the common chimpanzee (Pan troglodytes). Through comparison with the human genome, we have generated a largely complete catalogue of the genetic differences that have accumulated since the human and chimpanzee species diverged from our common ancestor, constituting approximately thirty-five million single-nucleotide changes, five million insertion/deletion events, and various chromosomal rearrangements. We use this catalogue to explore the magnitude and regional variation of mutational forces shaping these two genomes, and the strength of positive and negative selection acting on their genes. In particular, we find that the patterns of evolution in human and chimpanzee protein-coding genes are highly correlated and dominated by the fixation of neutral and slightly deleterious alleles. We also use the chimpanzee genome as an outgroup to investigate human population genetics and identify signatures of selective sweeps in recent human evolution. 相似文献
18.
Schmutz J Wheeler J Grimwood J Dickson M Yang J Caoile C Bajorek E Black S Chan YM Denys M Escobar J Flowers D Fotopulos D Garcia C Gomez M Gonzales E Haydu L Lopez F Ramirez L Retterer J Rodriguez A Rogers S Salazar A Tsai M Myers RM 《Nature》2004,429(6990):365-368
As the final sequencing of the human genome has now been completed, we present the results of the largest examination of the quality of the finished DNA sequence. The completed study covers the major contributing sequencing centres and is based on a rigorous combination of laboratory experiments and computational analysis. 相似文献
19.
The genome sequence and structure of rice chromosome 1 总被引:2,自引:0,他引:2
Sasaki T Matsumoto T Yamamoto K Sakata K Baba T Katayose Y Wu J Niimura Y Cheng Z Nagamura Y Antonio BA Kanamori H Hosokawa S Masukawa M Arikawa K Chiden Y Hayashi M Okamoto M Ando T Aoki H Arita K Hamada M Harada C Hijishita S Honda M Ichikawa Y Idonuma A Iijima M Ikeda M Ikeno M Ito S Ito T Ito Y Ito Y Iwabuchi A Kamiya K Karasawa W Katagiri S Kikuta A Kobayashi N Kono I Machita K Maehara T Mizuno H Mizubayashi T Mukai Y Nagasaki H Nakashima M Nakama Y Nakamichi Y Nakamura M Namiki N 《Nature》2002,420(6913):312-316
The rice species Oryza sativa is considered to be a model plant because of its small genome size, extensive genetic map, relative ease of transformation and synteny with other cereal crops. Here we report the essentially complete sequence of chromosome 1, the longest chromosome in the rice genome. We summarize characteristics of the chromosome structure and the biological insight gained from the sequence. The analysis of 43.3 megabases (Mb) of non-overlapping sequence reveals 6,756 protein coding genes, of which 3,161 show homology to proteins of Arabidopsis thaliana, another model plant. About 30% (2,073) of the genes have been functionally categorized. Rice chromosome 1 is (G + C)-rich, especially in its coding regions, and is characterized by several gene families that are dispersed or arranged in tandem repeats. Comparison with a draft sequence indicates the importance of a high-quality finished sequence. 相似文献
20.
Finishing the euchromatic sequence of the human genome 总被引:3,自引:0,他引:3
International Human Genome Sequencing Consortium 《Nature》2004,431(7011):931-945
The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers approximately 99% of the euchromatic genome and is accurate to an error rate of approximately 1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human genome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead. 相似文献