共查询到20条相似文献,搜索用时 46 毫秒
1.
Guide to the draft human genome 总被引:5,自引:0,他引:5
There are a number of ways to investigate the structure, function and evolution of the human genome. These include examining the morphology of normal and abnormal chromosomes, constructing maps of genomic landmarks, following the genetic transmission of phenotypes and DNA sequence variations, and characterizing thousands of individual genes. To this list we can now add the elucidation of the genomic DNA sequence, albeit at 'working draft' accuracy. The current challenge is to weave together these disparate types of data to produce the information infrastructure needed to support the next generation of biomedical research. Here we provide an overview of the different sources of information about the human genome and how modern information technology, in particular the internet, allows us to link them together. 相似文献
2.
A physical map of the mouse genome 总被引:1,自引:0,他引:1
Gregory SG Sekhon M Schein J Zhao S Osoegawa K Scott CE Evans RS Burridge PW Cox TV Fox CA Hutton RD Mullenger IR Phillips KJ Smith J Stalker J Threadgold GJ Birney E Wylie K Chinwalla A Wallis J Hillier L Carter J Gaige T Jaeger S Kremitzki C Layman D Maas J McGrane R Mead K Walker R Jones S Smith M Asano J Bosdet I Chan S Chittaranjan S Chiu R Fjell C Fuhrmann D Girn N Gray C Guin R Hsiao L Krzywinski M Kutsche R Lee SS Mathewson C McLeavy C Messervier S Ness S Pandoh P Prabhu AL Saeedi P 《Nature》2002,418(6899):743-750
A physical map of a genome is an essential guide for navigation, allowing the location of any gene or other landmark in the chromosomal DNA. We have constructed a physical map of the mouse genome that contains 296 contigs of overlapping bacterial clones and 16,992 unique markers. The mouse contigs were aligned to the human genome sequence on the basis of 51,486 homology matches, thus enabling use of the conserved synteny (correspondence between chromosome blocks) of the two genomes to accelerate construction of the mouse map. The map provides a framework for assembly of whole-genome shotgun sequence data, and a tile path of clones for generation of the reference sequence. Definition of the human-mouse alignment at this level of resolution enables identification of a mouse clone that corresponds to almost any position in the human genome. The human sequence may be used to facilitate construction of other mammalian genome maps using the same strategy. 相似文献
3.
Integration of telomere sequences with the draft human genome sequence 总被引:15,自引:0,他引:15
Riethman HC Xiang Z Paul S Morse E Hu XL Flint J Chi HC Grady DL Moyzis RK 《Nature》2001,409(6822):948-951
Telomeres are the ends of linear eukaryotic chromosomes. To ensure that no large stretches of uncharacterized DNA remain between the ends of the human working draft sequence and the ends of each chromosome, we would need to connect the sequences of the telomeres to the working draft sequence. But telomeres have an unusual DNA sequence composition and organization that makes them particularly difficult to isolate and analyse. Here we use specialized linear yeast artificial chromosome clones, each carrying a large telomere-terminal fragment of human DNA, to integrate most human telomeres with the working draft sequence. Subtelomeric sequence structure appears to vary widely, mainly as a result of large differences in subtelomeric repeat sequence abundance and organization at individual telomeres. Many subtelomeric regions appear to be gene-rich, matching both known and unknown expressed genes. This indicates that human subtelomeric regions are not simply buffers of nonfunctional 'junk DNA' next to the molecular telomere, but are instead functional parts of the expressed genome. 相似文献
4.
Carlton JM Angiuoli SV Suh BB Kooij TW Pertea M Silva JC Ermolaeva MD Allen JE Selengut JD Koo HL Peterson JD Pop M Kosack DS Shumway MF Bidwell SL Shallom SJ van Aken SE Riedmuller SB Feldblyum TV Cho JK Quackenbush J Sedegah M Shoaibi A Cummings LM Florens L Yates JR Raine JD Sinden RE Harris MA Cunningham DA Preiser PR Bergman LW Vaidya AB van Lin LH Janse CJ Waters AP Smith HO White OR Salzberg SL Venter JC Fraser CM Hoffman SL Gardner MJ Carucci DJ 《Nature》2002,419(6906):512-519
Species of malaria parasite that infect rodents have long been used as models for malaria disease research. Here we report the whole-genome shotgun sequence of one species, Plasmodium yoelii yoelii, and comparative studies with the genome of the human malaria parasite Plasmodium falciparum clone 3D7. A synteny map of 2,212 P. y. yoelii contiguous DNA sequences (contigs) aligned to 14 P. falciparum chromosomes reveals marked conservation of gene synteny within the body of each chromosome. Of about 5,300 P. falciparum genes, more than 3,300 P. y. yoelii orthologues of predominantly metabolic function were identified. Over 800 copies of a variant antigen gene located in subtelomeric regions were found. This is the first genome sequence of a model eukaryotic parasite, and it provides insight into the use of such systems in the modelling of Plasmodium biology and disease. 相似文献
5.
Comparison of human genetic and sequence-based physical maps 总被引:40,自引:0,他引:40
Yu A Zhao C Fan Y Jang W Mungall AJ Deloukas P Olsen A Doggett NA Ghebranious N Broman KW Weber JL 《Nature》2001,409(6822):951-953
Recombination is the exchange of information between two homologous chromosomes during meiosis. The rate of recombination per nucleotide, which profoundly affects the evolution of chromosomal segments, is calculated by comparing genetic and physical maps. Human physical maps have been constructed using cytogenetics, overlapping DNA clones and radiation hybrids; but the ultimate and by far the most accurate physical map is the actual nucleotide sequence. The completion of the draft human genomic sequence provides us with the best opportunity yet to compare the genetic and physical maps. Here we describe our estimates of female, male and sex-average recombination rates for about 60% of the genome. Recombination rates varied greatly along each chromosome, from 0 to at least 9 centiMorgans per megabase (cM Mb(-1)). Among several sequence and marker parameters tested, only relative marker position along the metacentric chromosomes in males correlated strongly with recombination rate. We identified several chromosomal regions up to 6 Mb in length with particularly low (deserts) or high (jungles) recombination rates. Linkage disequilibrium was much more common and extended for greater distances in the deserts than in the jungles. 相似文献
6.
Genome linking with yeast artificial chromosomes 总被引:76,自引:0,他引:76
The haploid genome of Caenorhabditis elegans consists of some 80 x 10(6) base pairs of DNA contained in six chromosomes. The large number of interesting loci that have been recognized by mutation, and the accuracy of the genetic map, mean that a physical map of the genome is highly desirable, because it will facilitate the molecular cloning of chosen loci. The first steps towards such a map used a fingerprinting method to link cosmid clones together. This approach reached its practical limit last year, when 90-95% of the genome had been cloned into 17,500 cosmids assembled into some 700 clusters (contigs), but the linking clones needed were either non-existent or extremely rare. Anticipating this, we had planned to link by physical means--probably by hybridization to NotI fragments separated by pulse field gel electrophoresis. NotI recognizes an eight base sequence of GC pairs; thus the fragments should be large enough to bridge regions that clone poorly in cosmids, and, with no selective step involved, would necessarily be fully representative. However, with the availability of a yeast artificial chromosome (YAC) vector, we decided to use this alternative source of large DNA fragments to obtain linkage. The technique involves the ligation of large (50-1,000 kilobase) genomic fragments into a vector that provides centromeric, telomeric and selective functions; the constructs are then introduced into Saccharomyces cerevisiae, and replicate in the same manner as the host chromosomes. 相似文献
7.
An SNP map of human chromosome 22 总被引:35,自引:0,他引:35
Mullikin JC Hunt SE Cole CG Mortimore BJ Rice CM Burton J Matthews LH Pavitt R Plumb RW Sims SK Ainscough RM Attwood J Bailey JM Barlow K Bruskiewich RM Butcher PN Carter NP Chen Y Clee CM Coggill PC Davies J Davies RM Dawson E Francis MD Joy AA Lamble RG Langford CF Macarthy J Mall V Moreland A Overton-Larty EK Ross MT Smith LC Steward CA Sulston JE Tinsley EJ Turney KJ Willey DL Wilson GD McMurray AA Dunham I Rogers J Bentley DR 《Nature》2000,407(6803):516-520
The human genome sequence will provide a reference for measuring DNA sequence variation in human populations. Sequence variants are responsible for the genetic component of individuality, including complex characteristics such as disease susceptibility and drug response. Most sequence variants are single nucleotide polymorphisms (SNPs), where two alternate bases occur at one position. Comparison of any two genomes reveals around 1 SNP per kilobase. A sufficiently dense map of SNPs would allow the detection of sequence variants responsible for particular characteristics on the basis that they are associated with a specific SNP allele. Here we have evaluated large-scale sequencing approaches to obtaining SNPs, and have constructed a map of 2,730 SNPs on human chromosome 22. Most of the SNPs are within 25 kilobases of a transcribed exon, and are valuable for association studies. We have scaled up the process, detecting over 65,000 SNPs in the genome as part of The SNP Consortium programme, which is on target to build a map of 1 SNP every 5 kilobases that is integrated with the human genome sequence and that is freely available in the public domain. 相似文献
8.
Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype 总被引:2,自引:0,他引:2
Jaillon O Aury JM Brunet F Petit JL Stange-Thomann N Mauceli E Bouneau L Fischer C Ozouf-Costaz C Bernot A Nicaud S Jaffe D Fisher S Lutfalla G Dossat C Segurens B Dasilva C Salanoubat M Levy M Boudet N Castellano S Anthouard V Jubin C Castelli V Katinka M Vacherie B Biémont C Skalli Z Cattolico L Poulain J De Berardinis V Cruaud C Duprat S Brottier P Coutanceau JP Gouzy J Parra G Lardier G Chapple C McKernan KJ McEwan P Bosak S Kellis M Volff JN Guigó R Zody MC Mesirov J Lindblad-Toh K 《Nature》2004,431(7011):946-957
Tetraodon nigroviridis is a freshwater puffer fish with the smallest known vertebrate genome. Here, we report a draft genome sequence with long-range linkage and substantial anchoring to the 21 Tetraodon chromosomes. Genome analysis provides a greatly improved fish gene catalogue, including identifying key genes previously thought to be absent in fish. Comparison with other vertebrates and a urochordate indicates that fish proteins have diverged markedly faster than their mammalian homologues. Comparison with the human genome suggests approximately 900 previously unannotated human genes. Analysis of the Tetraodon and human genomes shows that whole-genome duplication occurred in the teleost fish lineage, subsequent to its divergence from mammals. The analysis also makes it possible to infer the basic structure of the ancestral bony vertebrate genome, which was composed of 12 chromosomes, and to reconstruct much of the evolutionary history of ancient and recent chromosome rearrangements leading to the modern human karyotype. 相似文献
9.
Cloning defined regions of the human genome by microdissection of banded chromosomes and enzymatic amplification 总被引:48,自引:0,他引:48
The molecular analysis of many genetic diseases requires the isolation of probes for defined human chromosome regions. Existing techniques such as the screening of chromosome-specific libraries, subtractive DNA cloning and chromosome jumping are either tedious or not generally applicable. Microdissection and microcloning has successfully been applied to various chromosome regions in Drosophila and mouse, but conventional microtechniques are too coarse and inefficient for analysis of the human genome. Because microdissection has previously been used on unbanded chromosomes only, cell lines in which the chromosome of interest could be identified without banding had to be used. At least one hundred chromosomes were needed for dissection and lambda vectors used to achieve maximum cloning efficiency. Recombinant phage clones are, however, more difficult to characterize than plasmid clones. Here we describe the dissection of the Langer-Giedion syndrome region on chromosome 8 from GTG-banded metaphase chromosomes (G-banding with trypsin-Giemsa) and the universal enzymatic amplification of the dissected DNA. Eighty per cent of clones from this library (total yield 20,000) identify single-copy DNA sequences. Fifty per cent of clones detect deletions in two patients with Langer-Giedion syndrome. Although the other clones have not yet been mapped, this result demonstrates that thousands of region-specific probes can be isolated within ten days. 相似文献
10.
CpG island methylation plays important role in various biological processes. To investigate methylation landscape of all CpG islands on the human genome, we develop a model for predicting the CpG island methylation status. This model outperforms other existing methods. We apply the model on the whole human genome and predict the landscape of DNA methylation of all CpG islands. Based on the methylation profile, we find that about 31% of CpG islands are methylation-prone and CpG islands located in promoter regions are seldom methylated. There is no significant difference in the CpG island methylation level between R and G bands among the chromosomes. The occupancy of RNA polymerase II is significantly higher in methylation-resistant promoter CpG islands, indicating that genes with such promoter CpG islands tend to be more active. 相似文献
11.
Hyman RW Fung E Conway A Kurdi O Mao J Miranda M Nakao B Rowley D Tamaki T Wang F Davis RW 《Nature》2002,419(6906):534-537
The human malaria parasite Plasmodium falciparum is responsible for the death of more than a million people every year. To stimulate basic research on the disease, and to promote the development of effective drugs and vaccines against the parasite, the complete genome of P. falciparum clone 3D7 has been sequenced, using a chromosome-by-chromosome shotgun strategy. Here we report the nucleotide sequence of the third largest of the parasite's 14 chromosomes, chromosome 12, which comprises about 10% of the 23-megabase genome. As the most (A + T)-rich (80.6%) genome sequenced to date, the P. falciparum genome presented severe problems during the assembly of primary sequence reads. We discuss the methodology that yielded a finished and fully contiguous sequence for chromosome 12. The biological implications of the sequence data are more thoroughly discussed in an accompanying Article (ref. 3). 相似文献
12.
Wallis JW Aerts J Groenen MA Crooijmans RP Layman D Graves TA Scheer DE Kremitzki C Fedele MJ Mudd NK Cardenas M Higginbotham J Carter J McGrane R Gaige T Mead K Walker J Albracht D Davito J Yang SP Leong S Chinwalla A Sekhon M Wylie K Dodgson J Romanov MN Cheng H de Jong PJ Osoegawa K Nefedov M Zhang H McPherson JD Krzywinski M Schein J Hillier L Mardis ER Wilson RK Warren WC 《Nature》2004,432(7018):761-764
Strategies for assembling large, complex genomes have evolved to include a combination of whole-genome shotgun sequencing and hierarchal map-assisted sequencing. Whole-genome maps of all types can aid genome assemblies, generally starting with low-resolution cytogenetic maps and ending with the highest resolution of sequence. Fingerprint clone maps are based upon complete restriction enzyme digests of clones representative of the target genome, and ultimately comprise a near-contiguous path of clones across the genome. Such clone-based maps are used to validate sequence assembly order, supply long-range linking information for assembled sequences, anchor sequences to the genetic map and provide templates for closing gaps. Fingerprint maps are also a critical resource for subsequent functional genomic studies, because they provide a redundant and ordered sampling of the genome with clones. In an accompanying paper we describe the draft genome sequence of the chicken, Gallus gallus, the first species sequenced that is both a model organism and a global food source. Here we present a clone-based physical map of the chicken genome at 20-fold coverage, containing 260 contigs of overlapping clones. This map represents approximately 91% of the chicken genome and enables identification of chicken clones aligned to positions in other sequenced genomes. 相似文献
13.
Yuan Guan Qi Chen Junsong Pan Zheng Li Huanle He Aizhong Wu Rentao Song Run Cai 《自然科学进展》2008,18(2):143-148
A bacterial artificial chromosome (BAC) library consisting of 19,200 clones with an average insert size of 105 kb has been constructed from a cucumber (Cucumis sativus L.) inbred line S94; derived from a cultivar in North China. The entire library was equivalent to approximately 5 haploid cucumber genomes. To facilitate chromosome engineering and anchor the cucumber genetic linkage map to its chromosomes, 15 sequence-characterized amplified regions (SCAR) and seven simple sequence repeats (SSR) markers from each linkage group of cucumber were used to screen an ordered array of pooled BAC DNA with polymerase chain reaction (PCR). Fifteen markers gave at least two positive clones. As a result, 32 BAC clones representing 7 linkage groups of cucumber were identified, which further validated the genome coverage and utility of the library. This BAC library and linkage group specific clones provide essential resources for future research of the cucumber genome. 相似文献
14.
Tabata S Kaneko T Nakamura Y Kotani H Kato T Asamizu E Miyajima N Sasamoto S Kimura T Hosouchi T Kawashima K Kohara M Matsumoto M Matsuno A Muraki A Nakayama S Nakazaki N Naruo K Okumura S Shinpo S Takeuchi C Wada T Watanabe A Yamada M Yasuda M Sato S de la Bastide M Huang E Spiegel L Gnoj L O'Shaughnessy A Preston R Habermann K Murray J Johnson D Rohlfing T Nelson J Stoneking T Pepin K Spieth J Sekhon M Armstrong J Becker M Belter E Cordum H Cordes M Courtney L Courtney W Dante M Du H 《Nature》2000,408(6814):823-826
The genome of the model plant Arabidopsis thaliana has been sequenced by an international collaboration, The Arabidopsis Genome Initiative. Here we report the complete sequence of chromosome 5. This chromosome is 26 megabases long; it is the second largest Arabidopsis chromosome and represents 21% of the sequenced regions of the genome. The sequence of chromosomes 2 and 4 have been reported previously and that of chromosomes 1 and 3, together with an analysis of the complete genome sequence, are reported in this issue. Analysis of the sequence of chromosome 5 yields further insights into centromere structure and the sequence determinants of heterochromatin condensation. The 5,874 genes encoded on chromosome 5 reveal several new functions in plants, and the patterns of gene organization provide insights into the mechanisms and extent of genome evolution in plants. 相似文献
15.
Yuan Guan Qi Chen Junsong Pan Zheng Li Huanle He Aizhong Wu Rentao Song Run Cai 《自然科学进展(英文版)》2008,18(2):143-148
A bacterial artificial chromosome (BAC) library consisting of 19,200 clones with an average insert size of 105 kb has been constructed from a cucumber (Cucumis sativus L.) inbred line S94; derived from a cultivar in North China. The entire library was equivalent to approximately 5 haploid cucumber genomes. To facilitate chromosome engineering and anchor the cucumber genetic linkage map to its chromosomes, 15 sequence-characterized amplified regions (SCAR) and seven simple sequence repeats (SSR) markers from each linkage group of cucumber were used to screen an ordered array of pooled BAC DNA with polymerase chain reaction (PCR). Fifteen markers gave at least two positive clones. As a result, 32 BAC clones representing 7 linkage groups of cucumber were identified, which further validated the genome coverage and utility of the library. This BAC library and linkage group specific clones provide essential resources for future research of the cucumber genome. 相似文献
16.
McPherson JD Marra M Hillier L Waterston RH Chinwalla A Wallis J Sekhon M Wylie K Mardis ER Wilson RK Fulton R Kucaba TA Wagner-McPherson C Barbazuk WB Gregory SG Humphray SJ French L Evans RS Bethel G Whittaker A Holden JL McCann OT Dunham A Soderlund C Scott CE Bentley DR Schuler G Chen HC Jang W Green ED Idol JR Maduro VV Montgomery KT Lee E Miller A Emerling S Kucherlapati Gibbs R Scherer S Gorrell JH Sodergren E Clerc-Blankenburg K Tabor P Naylor S Garcia D de Jong PJ Catanese JJ Nowak N 《Nature》2001,409(6822):934-941
The human genome is by far the largest genome to be sequenced, and its size and complexity present many challenges for sequence assembly. The International Human Genome Sequencing Consortium constructed a map of the whole genome to enable the selection of clones for sequencing and for the accurate assembly of the genome sequence. Here we report the construction of the whole-genome bacterial artificial chromosome (BAC) map and its integration with previous landmark maps and information from mapping efforts focused on specific chromosomal regions. We also describe the integration of sequence data with the map. 相似文献
17.
Nusbaum C Zody MC Borowsky ML Kamal M Kodira CD Taylor TD Whittaker CA Chang JL Cuomo CA Dewar K FitzGerald MG Yang X Abouelleil A Allen NR Anderson S Bloom T Bugalter B Butler J Cook A DeCaprio D Engels R Garber M Gnirke A Hafez N Hall JL Norman CH Itoh T Jaffe DB Kuroki Y Lehoczky J Lui A Macdonald P Mauceli E Mikkelsen TS Naylor JW Nicol R Nguyen C Noguchi H O'Leary SB O'Neill K Piqani B Smith CL Talamas JA Topham K Totoki Y Toyoda A Wain HM Young SK Zeng Q Zimmer AR Fujiyama A Hattori M 《Nature》2005,437(7058):551-555
Chromosome 18 appears to have the lowest gene density of any human chromosome and is one of only three chromosomes for which trisomic individuals survive to term. There are also a number of genetic disorders stemming from chromosome 18 trisomy and aneuploidy. Here we report the finished sequence and gene annotation of human chromosome 18, which will allow a better understanding of the normal and disease biology of this chromosome. Despite the low density of protein-coding genes on chromosome 18, we find that the proportion of non-protein-coding sequences evolutionarily conserved among mammals is close to the genome-wide average. Extending this analysis to the entire human genome, we find that the density of conserved non-protein-coding sequences is largely uncorrelated with gene density. This has important implications for the nature and roles of non-protein-coding sequence elements. 相似文献
18.
Gardner MJ Hall N Fung E White O Berriman M Hyman RW Carlton JM Pain A Nelson KE Bowman S Paulsen IT James K Eisen JA Rutherford K Salzberg SL Craig A Kyes S Chan MS Nene V Shallom SJ Suh B Peterson J Angiuoli S Pertea M Allen J Selengut J Haft D Mather MW Vaidya AB Martin DM Fairlamb AH Fraunholz MJ Roos DS Ralph SA McFadden GI Cummings LM Subramanian GM Mungall C Venter JC Carucci DJ Hoffman SL Newbold C Davis RW Fraser CM Barrell B 《Nature》2002,419(6906):498-511
The parasite Plasmodium falciparum is responsible for hundreds of millions of cases of malaria, and kills more than one million African children annually. Here we report an analysis of the genome sequence of P. falciparum clone 3D7. The 23-megabase nuclear genome consists of 14 chromosomes, encodes about 5,300 genes, and is the most (A + T)-rich genome sequenced to date. Genes involved in antigenic variation are concentrated in the subtelomeric regions of the chromosomes. Compared to the genomes of free-living eukaryotic microbes, the genome of this intracellular parasite encodes fewer enzymes and transporters, but a large proportion of genes are devoted to immune evasion and host-parasite interactions. Many nuclear-encoded proteins are targeted to the apicoplast, an organelle involved in fatty-acid and isoprenoid metabolism. The genome sequence provides the foundation for future studies of this organism, and is being exploited in the search for new drugs and vaccines to fight malaria. 相似文献
19.
Construction of a BAC library from cucumber (Cucumis sativus L.) and identification of linkage group specific clones 总被引:2,自引:0,他引:2
Yuan Guan ;Qi Chen ;Junsong Pan ;Zheng Li ;Huanle He ;Aizhong Wu ;Rentao Song ;Run Cai 《自然科学进展(英文版)》2008,18(2):143-147
A bacterial artificial chromosome (BAC) library consisting of 19,200 clones with an average insert size of 105 kb has been constructed from a cucumber (Cucumis sativus L.) inbred line S94, derived from a cultivar in North China. The entire library was equivalent to approximately 5 haploid cucumber genomes. To facilitate chromosome engineering and anchor the cucumber genetic linkage map to its chromosomes, 15 sequence-characterized amplified regions (SCAR) and seven simple sequence repeats (SSR) markers from each linkage group of cucumber were used to screen an ordered array of pooled BAC DNA with polymerase chain reaction (PCR). Fifteen markers gave at least two positive clones. As a result, 22 BAC clones representing 7 linkage groups of cucumber were identified, which further validated the genome coverage and utility of the library. This BAC library and linkage group specific clones provide essential resources for future research of the cucumber genome. 相似文献
20.
Zody MC Garber M Sharpe T Young SK Rowen L O'Neill K Whittaker CA Kamal M Chang JL Cuomo CA Dewar K FitzGerald MG Kodira CD Madan A Qin S Yang X Abbasi N Abouelleil A Arachchi HM Baradarani L Birditt B Bloom S Bloom T Borowsky ML Burke J Butler J Cook A DeArellano K DeCaprio D Dorris L Dors M Eichler EE Engels R Fahey J Fleetwood P Friedman C Gearin G Hall JL Hensley G Johnson E Jones C Kamat A Kaur A Locke DP Madan A Munson G Jaffe DB Lui A Macdonald P Mauceli E Naylor JW Nesbitt R Nicol R 《Nature》2006,440(7084):671-675
Here we present a finished sequence of human chromosome 15, together with a high-quality gene catalogue. As chromosome 15 is one of seven human chromosomes with a high rate of segmental duplication, we have carried out a detailed analysis of the duplication structure of the chromosome. Segmental duplications in chromosome 15 are largely clustered in two regions, on proximal and distal 15q; the proximal region is notable because recombination among the segmental duplications can result in deletions causing Prader-Willi and Angelman syndromes. Sequence analysis shows that the proximal and distal regions of 15q share extensive ancient similarity. Using a simple approach, we have been able to reconstruct many of the events by which the current duplication structure arose. We find that most of the intrachromosomal duplications seem to share a common ancestry. Finally, we demonstrate that some remaining gaps in the genome sequence are probably due to structural polymorphisms between haplotypes; this may explain a significant fraction of the gaps remaining in the human genome. 相似文献