共查询到20条相似文献,搜索用时 15 毫秒
1.
As consortium plans free SNP map of human genome 总被引:5,自引:0,他引:5
2.
An SNP map of human chromosome 22 总被引:35,自引:0,他引:35
Mullikin JC Hunt SE Cole CG Mortimore BJ Rice CM Burton J Matthews LH Pavitt R Plumb RW Sims SK Ainscough RM Attwood J Bailey JM Barlow K Bruskiewich RM Butcher PN Carter NP Chen Y Clee CM Coggill PC Davies J Davies RM Dawson E Francis MD Joy AA Lamble RG Langford CF Macarthy J Mall V Moreland A Overton-Larty EK Ross MT Smith LC Steward CA Sulston JE Tinsley EJ Turney KJ Willey DL Wilson GD McMurray AA Dunham I Rogers J Bentley DR 《Nature》2000,407(6803):516-520
The human genome sequence will provide a reference for measuring DNA sequence variation in human populations. Sequence variants are responsible for the genetic component of individuality, including complex characteristics such as disease susceptibility and drug response. Most sequence variants are single nucleotide polymorphisms (SNPs), where two alternate bases occur at one position. Comparison of any two genomes reveals around 1 SNP per kilobase. A sufficiently dense map of SNPs would allow the detection of sequence variants responsible for particular characteristics on the basis that they are associated with a specific SNP allele. Here we have evaluated large-scale sequencing approaches to obtaining SNPs, and have constructed a map of 2,730 SNPs on human chromosome 22. Most of the SNPs are within 25 kilobases of a transcribed exon, and are valuable for association studies. We have scaled up the process, detecting over 65,000 SNPs in the genome as part of The SNP Consortium programme, which is on target to build a map of 1 SNP every 5 kilobases that is integrated with the human genome sequence and that is freely available in the public domain. 相似文献
3.
A map of human genome variation from population-scale sequencing 总被引:2,自引:0,他引:2
Genomes Project Consortium 《Nature》2010,467(7319):1061-1073
The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother-father-child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10(-8) per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research. 相似文献
4.
McPherson JD Marra M Hillier L Waterston RH Chinwalla A Wallis J Sekhon M Wylie K Mardis ER Wilson RK Fulton R Kucaba TA Wagner-McPherson C Barbazuk WB Gregory SG Humphray SJ French L Evans RS Bethel G Whittaker A Holden JL McCann OT Dunham A Soderlund C Scott CE Bentley DR Schuler G Chen HC Jang W Green ED Idol JR Maduro VV Montgomery KT Lee E Miller A Emerling S Kucherlapati Gibbs R Scherer S Gorrell JH Sodergren E Clerc-Blankenburg K Tabor P Naylor S Garcia D de Jong PJ Catanese JJ Nowak N 《Nature》2001,409(6822):934-941
The human genome is by far the largest genome to be sequenced, and its size and complexity present many challenges for sequence assembly. The International Human Genome Sequencing Consortium constructed a map of the whole genome to enable the selection of clones for sequencing and for the accurate assembly of the genome sequence. Here we report the construction of the whole-genome bacterial artificial chromosome (BAC) map and its integration with previous landmark maps and information from mapping efforts focused on specific chromosomal regions. We also describe the integration of sequence data with the map. 相似文献
5.
A haplotype map of the human genome 总被引:2,自引:0,他引:2
International HapMap Consortium 《Nature》2005,437(7063):1299-1320
Inherited genetic variation has a critical but as yet largely uncharacterized role in human disease. Here we report a public database of common variation in the human genome: more than one million single nucleotide polymorphisms (SNPs) for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted. These data document the generality of recombination hotspots, a block-like structure of linkage disequilibrium and low haplotype diversity, leading to substantial correlations of SNPs with many of their neighbours. We show how the HapMap resource can guide the design and analysis of genetic association studies, shed light on structural variation and recombination, and identify loci that may have been subject to natural selection during human evolution. 相似文献
6.
Initial sequencing and analysis of the human genome 总被引:11,自引:0,他引:11
Lander ES Linton LM Birren B Nusbaum C Zody MC Baldwin J Devon K Dewar K Doyle M FitzHugh W Funke R Gage D Harris K Heaford A Howland J Kann L Lehoczky J LeVine R McEwan P McKernan K Meldrim J Mesirov JP Miranda C Morris W Naylor J Raymond C Rosetti M Santos R Sheridan A Sougnez C Stange-Thomann N Stojanovic N Subramanian A Wyman D Rogers J Sulston J Ainscough R Beck S Bentley D Burton J Clee C Carter N Coulson A Deadman R Deloukas P Dunham A Dunham I Durbin R French L Grafham D Gregory S 《Nature》2001,409(6822):860-921
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence. 相似文献
7.
The sequence of the human genome has dramatically accelerated biomedical research. Here I explore its impact, in the decade since its publication, on our understanding of the biological functions encoded in the genome, on the biological basis of inherited diseases and cancer, and on the evolution and history of the human species. I also discuss the road ahead in fulfilling the promise of genomics for medicine. 相似文献
8.
A second-generation linkage map of the human genome. 总被引:283,自引:0,他引:283
J Weissenbach G Gyapay C Dib A Vignal J Morissette P Millasseau G Vaysseix M Lathrop 《Nature》1992,359(6398):794-801
A linkage map of the human genome has been constructed based on the segregation analysis of 814 newly characterized polymorphic loci containing short tracts of (C-A)n repeats in a panel of DNAs from eight large families. Statistical linkage analysis placed 813 of the markers into 23 linkage groups corresponding to the 22 autosomes and the X chromosome; 605 show a heterozygosity above 0.7 and 553 could be ordered with odds ratios above 1,000:1. The distance spanned corresponds to approximately 90% of the estimated length of the human genome. 相似文献
9.
A high-resolution map of active promoters in the human genome 总被引:1,自引:0,他引:1
Kim TH Barrera LO Zheng M Qu C Singer MA Richmond TA Wu Y Green RD Ren B 《Nature》2005,436(7052):876-880
10.
Rothberg JM Hinz W Rearick TM Schultz J Mileski W Davey M Leamon JH Johnson K Milgrew MJ Edwards M Hoon J Simons JF Marran D Myers JW Davidson JF Branting A Nobile JR Puc BP Light D Clark TA Huber M Branciforte JT Stoner IB Cawley SE Lyons M Fu Y Homer N Sedova M Miao X Reed B Sabina J Feierstein E Schorn M Alanjary M Dimalanta E Dressman D Kasinskas R Sokolsky T Fidanza JA Namsaraev E McKernan KJ Williams A Roth GT Bustillo J 《Nature》2011,475(7356):348-352
The seminal importance of DNA sequencing to the life sciences, biotechnology and medicine has driven the search for more scalable and lower-cost solutions. Here we describe a DNA sequencing technology in which scalable, low-cost semiconductor manufacturing techniques are used to make an integrated circuit able to directly perform non-optical DNA sequencing of genomes. Sequence data are obtained by directly sensing the ions produced by template-directed DNA polymerase synthesis using all-natural nucleotides on this massively parallel semiconductor-sensing device or ion chip. The ion chip contains ion-sensitive, field-effect transistor-based sensors in perfect register with 1.2 million wells, which provide confinement and allow parallel, simultaneous detection of independent sequencing reactions. Use of the most widely used technology for constructing integrated circuits, the complementary metal-oxide semiconductor (CMOS) process, allows for low-cost, large-scale production and scaling of the device to higher densities and larger array sizes. We show the performance of the system by sequencing three bacterial genomes, its robustness and scalability by producing ion chips with up to 10 times as many sensors and sequencing a human genome. 相似文献
11.
Bentley DR Balasubramanian S Swerdlow HP Smith GP Milton J Brown CG Hall KP Evers DJ Barnes CL Bignell HR Boutell JM Bryant J Carter RJ Keira Cheetham R Cox AJ Ellis DJ Flatbush MR Gormley NA Humphray SJ Irving LJ Karbelashvili MS Kirk SM Li H Liu X Maisinger KS Murray LJ Obradovic B Ost T Parkinson ML Pratt MR Rasolonjatovo IM Reed MT Rigatti R Rodighiero C Ross MT Sabot A Sankar SV Scally A Schroth GP Smith ME Smith VP Spiridou A Torrance PE Tzonev SS Vermaas EH Walter K Wu X Zhang L Alam MD 《Nature》2008,456(7218):53-59
DNA sequence information underpins genetic research, enabling discoveries of important biological or medical benefit. Sequencing projects have traditionally used long (400-800 base pair) reads, but the existence of reference sequences for the human and many other genomes makes it possible to develop new, fast approaches to re-sequencing, whereby shorter reads are compared to a reference to identify intraspecies genetic variation. Here we report an approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost. Single molecules of DNA are attached to a flat surface, amplified in situ and used as templates for synthetic sequencing with fluorescent reversible terminator deoxyribonucleotides. Images of the surface are analysed to generate high-quality sequence. We demonstrate application of this approach to human genome sequencing on flow-sorted X chromosomes and then scale the approach to determine the genome sequence of a male Yoruba from Ibadan, Nigeria. We build an accurate consensus sequence from >30x average depth of paired 35-base reads. We characterize four million single-nucleotide polymorphisms and four hundred thousand structural variants, many of which were previously unknown. Our approach is effective for accurate, rapid and economical whole-genome re-sequencing and many other biomedical applications. 相似文献
12.
Wallis JW Aerts J Groenen MA Crooijmans RP Layman D Graves TA Scheer DE Kremitzki C Fedele MJ Mudd NK Cardenas M Higginbotham J Carter J McGrane R Gaige T Mead K Walker J Albracht D Davito J Yang SP Leong S Chinwalla A Sekhon M Wylie K Dodgson J Romanov MN Cheng H de Jong PJ Osoegawa K Nefedov M Zhang H McPherson JD Krzywinski M Schein J Hillier L Mardis ER Wilson RK Warren WC 《Nature》2004,432(7018):761-764
Strategies for assembling large, complex genomes have evolved to include a combination of whole-genome shotgun sequencing and hierarchal map-assisted sequencing. Whole-genome maps of all types can aid genome assemblies, generally starting with low-resolution cytogenetic maps and ending with the highest resolution of sequence. Fingerprint clone maps are based upon complete restriction enzyme digests of clones representative of the target genome, and ultimately comprise a near-contiguous path of clones across the genome. Such clone-based maps are used to validate sequence assembly order, supply long-range linking information for assembled sequences, anchor sequences to the genetic map and provide templates for closing gaps. Fingerprint maps are also a critical resource for subsequent functional genomic studies, because they provide a redundant and ordered sampling of the genome with clones. In an accompanying paper we describe the draft genome sequence of the chicken, Gallus gallus, the first species sequenced that is both a model organism and a global food source. Here we present a clone-based physical map of the chicken genome at 20-fold coverage, containing 260 contigs of overlapping clones. This map represents approximately 91% of the chicken genome and enables identification of chicken clones aligned to positions in other sequenced genomes. 相似文献
13.
Mills RE Walter K Stewart C Handsaker RE Chen K Alkan C Abyzov A Yoon SC Ye K Cheetham RK Chinwalla A Conrad DF Fu Y Grubert F Hajirasouliha I Hormozdiari F Iakoucheva LM Iqbal Z Kang S Kidd JM Konkel MK Korn J Khurana E Kural D Lam HY Leng J Li R Li Y Lin CY Luo R Mu XJ Nemesh J Peckham HE Rausch T Scally A Shi X Stromberg MP Stütz AM Urban AE Walker JA Wu J Zhang Y Zhang ZD Batzer MA Ding L Marth GT McVean G Sebat J Snyder M Wang J Ye K Eichler EE Gerstein MB Hurles ME Lee C McCarroll SA 《Nature》2011,470(7332):59-65
Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies. 相似文献
14.
Large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution 总被引:1,自引:0,他引:1
Ghedin E Sengamalay NA Shumway M Zaborsky J Feldblyum T Subbu V Spiro DJ Sitz J Koo H Bolotov P Dernovoy D Tatusova T Bao Y St George K Taylor J Lipman DJ Fraser CM Taubenberger JK Salzberg SL 《Nature》2005,437(7062):1162-1166
Influenza viruses are remarkably adept at surviving in the human population over a long timescale. The human influenza A virus continues to thrive even among populations with widespread access to vaccines, and continues to be a major cause of morbidity and mortality. The virus mutates from year to year, making the existing vaccines ineffective on a regular basis, and requiring that new strains be chosen for a new vaccine. Less-frequent major changes, known as antigenic shift, create new strains against which the human population has little protective immunity, thereby causing worldwide pandemics. The most recent pandemics include the 1918 'Spanish' flu, one of the most deadly outbreaks in recorded history, which killed 30-50 million people worldwide, the 1957 'Asian' flu, and the 1968 'Hong Kong' flu. Motivated by the need for a better understanding of influenza evolution, we have developed flexible protocols that make it possible to apply large-scale sequencing techniques to the highly variable influenza genome. Here we report the results of sequencing 209 complete genomes of the human influenza A virus, encompassing a total of 2,821,103 nucleotides. In addition to increasing markedly the number of publicly available, complete influenza virus genomes, we have discovered several anomalies in these first 209 genomes that demonstrate the dynamic nature of influenza transmission and evolution. This new, large-scale sequencing effort promises to provide a more comprehensive picture of the evolution of influenza viruses and of their pattern of transmission through human and animal populations. All data from this project are being deposited, without delay, in public archives. 相似文献
15.
Nègre N Brown CD Ma L Bristow CA Miller SW Wagner U Kheradpour P Eaton ML Loriaux P Sealfon R Li Z Ishii H Spokony RF Chen J Hwang L Cheng C Auburn RP Davis MB Domanus M Shah PK Morrison CA Zieba J Suchy S Senderowicz L Victorsen A Bild NA Grundstad AJ Hanley D MacAlpine DM Mannervik M Venken K Bellen H White R Gerstein M Russell S Grossman RL Ren B Posakony JW Kellis M White KP 《Nature》2011,471(7339):527-531
16.
A physical map of the mouse genome 总被引:1,自引:0,他引:1
Gregory SG Sekhon M Schein J Zhao S Osoegawa K Scott CE Evans RS Burridge PW Cox TV Fox CA Hutton RD Mullenger IR Phillips KJ Smith J Stalker J Threadgold GJ Birney E Wylie K Chinwalla A Wallis J Hillier L Carter J Gaige T Jaeger S Kremitzki C Layman D Maas J McGrane R Mead K Walker R Jones S Smith M Asano J Bosdet I Chan S Chittaranjan S Chiu R Fjell C Fuhrmann D Girn N Gray C Guin R Hsiao L Krzywinski M Kutsche R Lee SS Mathewson C McLeavy C Messervier S Ness S Pandoh P Prabhu AL Saeedi P 《Nature》2002,418(6899):743-750
A physical map of a genome is an essential guide for navigation, allowing the location of any gene or other landmark in the chromosomal DNA. We have constructed a physical map of the mouse genome that contains 296 contigs of overlapping bacterial clones and 16,992 unique markers. The mouse contigs were aligned to the human genome sequence on the basis of 51,486 homology matches, thus enabling use of the conserved synteny (correspondence between chromosome blocks) of the two genomes to accelerate construction of the mouse map. The map provides a framework for assembly of whole-genome shotgun sequence data, and a tile path of clones for generation of the reference sequence. Definition of the human-mouse alignment at this level of resolution enables identification of a mouse clone that corresponds to almost any position in the human genome. The human sequence may be used to facilitate construction of other mammalian genome maps using the same strategy. 相似文献
17.
A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms 总被引:69,自引:0,他引:69
Sachidanandam R Weissman D Schmidt SC Kakol JM Stein LD Marth G Sherry S Mullikin JC Mortimore BJ Willey DL Hunt SE Cole CG Coggill PC Rice CM Ning Z Rogers J Bentley DR Kwok PY Mardis ER Yeh RT Schultz B Cook L Davenport R Dante M Fulton L Hillier L Waterston RH McPherson JD Gilman B Schaffner S Van Etten WJ Reich D Higgins J Daly MJ Blumenstiel B Baldwin J Stange-Thomann N Zody MC Linton L Lander ES Altshuler D;International SNP Map Working Group 《Nature》2001,409(6822):928-933
We describe a map of 1.42 million single nucleotide polymorphisms (SNPs) distributed throughout the human genome, providing an average density on available sequence of one SNP every 1.9 kilobases. These SNPs were primarily discovered by two projects: The SNP Consortium and the analysis of clone overlaps by the International Human Genome Sequencing Consortium. The map integrates all publicly available SNPs with described genes and other genomic features. We estimate that 60,000 SNPs fall within exon (coding and untranslated regions), and 85% of exons are within 5 kb of the nearest SNP. Nucleotide diversity varies greatly across the genome, in a manner broadly consistent with a standard population genetic model of human history. This high-density SNP map provides a public resource for defining haplotype variation across the genome, and should help to identify biomedically important genes for diagnosis and therapy. 相似文献
18.
Wheeler DA Srinivasan M Egholm M Shen Y Chen L McGuire A He W Chen YJ Makhijani V Roth GT Gomes X Tartaro K Niazi F Turcotte CL Irzyk GP Lupski JR Chinault C Song XZ Liu Y Yuan Y Nazareth L Qin X Muzny DM Margulies M Weinstock GM Gibbs RA Rothberg JM 《Nature》2008,452(7189):872-876
The association of genetic variation with disease and drug response, and improvements in nucleic acid technologies, have given great optimism for the impact of 'genomic medicine'. However, the formidable size of the diploid human genome, approximately 6 gigabases, has prevented the routine application of sequencing methods to deciphering complete individual human genomes. To realize the full potential of genomics for human health, this limitation must be overcome. Here we report the DNA sequence of a diploid genome of a single individual, James D. Watson, sequenced to 7.4-fold redundancy in two months using massively parallel sequencing in picolitre-size reaction vessels. This sequence was completed in two months at approximately one-hundredth of the cost of traditional capillary electrophoresis methods. Comparison of the sequence to the reference genome led to the identification of 3.3 million single nucleotide polymorphisms, of which 10,654 cause amino-acid substitution within the coding sequence. In addition, we accurately identified small-scale (2-40,000 base pair (bp)) insertion and deletion polymorphism as well as copy number variation resulting in the large-scale gain and loss of chromosomal segments ranging from 26,000 to 1.5 million base pairs. Overall, these results agree well with recent results of sequencing of a single individual by traditional methods. However, in addition to being faster and significantly less expensive, this sequencing technology avoids the arbitrary loss of genomic sequences inherent in random shotgun sequencing by bacterial cloning because it amplifies DNA in a cell-free system. As a result, we further demonstrate the acquisition of novel human sequence, including novel genes not previously identified by traditional genomic sequencing. This is the first genome sequenced by next-generation technologies. Therefore it is a pilot for the future challenges of 'personalized genome sequencing'. 相似文献
19.
A map of the cis-regulatory sequences in the mouse genome 总被引:1,自引:0,他引:1
Y Shen F Yue DF McCleary Z Ye L Edsall S Kuan U Wagner J Dixon L Lee VV Lobanenkov B Ren 《Nature》2012,488(7409):116-120
20.
Initial sequencing and comparative analysis of the mouse genome 总被引:2,自引:0,他引:2
Mouse Genome Sequencing Consortium Waterston RH Lindblad-Toh K Birney E Rogers J Abril JF Agarwal P Agarwala R Ainscough R Alexandersson M An P Antonarakis SE Attwood J Baertsch R Bailey J Barlow K Beck S Berry E Birren B Bloom T Bork P Botcherby M Bray N Brent MR Brown DG Brown SD Bult C Burton J Butler J Campbell RD Carninci P Cawley S Chiaromonte F Chinwalla AT Church DM Clamp M Clee C Collins FS Cook LL Copley RR Coulson A Couronne O Cuff J Curwen V Cutts T Daly M David R Davies J 《Nature》2002,420(6915):520-562
The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism. 相似文献