共查询到20条相似文献,搜索用时 46 毫秒
1.
Bass AJ Lawrence MS Brace LE Ramos AH Drier Y Cibulskis K Sougnez C Voet D Saksena G Sivachenko A Jing R Parkin M Pugh T Verhaak RG Stransky N Boutin AT Barretina J Solit DB Vakiani E Shao W Mishina Y Warmuth M Jimenez J Chiang DY Signoretti S Kaelin WG Spardy N Hahn WC Hoshida Y Ogino S Depinho RA Chin L Garraway LA Fuchs CS Baselga J Tabernero J Gabriel S Lander ES Getz G Meyerson M 《Nature genetics》2011,43(10):964-968
Prior studies have identified recurrent oncogenic mutations in colorectal adenocarcinoma and have surveyed exons of protein-coding genes for mutations in 11 affected individuals. Here we report whole-genome sequencing from nine individuals with colorectal cancer, including primary colorectal tumors and matched adjacent non-tumor tissues, at an average of 30.7× and 31.9× coverage, respectively. We identify an average of 75 somatic rearrangements per tumor, including complex networks of translocations between pairs of chromosomes. Eleven rearrangements encode predicted in-frame fusion proteins, including a fusion of VTI1A and TCF7L2 found in 3 out of 97 colorectal cancers. Although TCF7L2 encodes TCF4, which cooperates with β-catenin in colorectal carcinogenesis, the fusion lacks the TCF4 β-catenin-binding domain. We found a colorectal carcinoma cell line harboring the fusion gene to be dependent on VTI1A-TCF7L2 for anchorage-independent growth using RNA interference-mediated knockdown. This study shows previously unidentified levels of genomic rearrangements in colorectal carcinoma that can lead to essential gene fusions and other oncogenic events. 相似文献
2.
Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing 总被引:1,自引:0,他引:1
Fujimoto A Nakagawa H Hosono N Nakano K Abe T Boroevich KA Nagasaki M Yamaguchi R Shibuya T Kubo M Miyano S Nakamura Y Tsunoda T 《Nature genetics》2010,42(11):931-936
We report the analysis of a Japanese male using high-throughput sequencing to × 40 coverage. More than 99% of the sequence reads were mapped to the reference human genome. Using a Bayesian decision method, we identified 3,132,608 single nucleotide variations (SNVs). Comparison with six previously reported genomes revealed an excess of singleton nonsense and nonsynonymous SNVs, as well as singleton SNVs in conserved non-coding regions. We also identified 5,319 deletions smaller than 10 kb with high accuracy, in addition to copy number variations and rearrangements. De novo assembly of the unmapped sequence reads generated around 3 Mb of novel sequence, which showed high similarity to non-reference human genomes and the human herpesvirus 4 genome. Our analysis suggests that considerable variation remains undiscovered in the human genome and that whole-genome sequencing is an invaluable tool for obtaining a complete understanding of human genetic variation. 相似文献
3.
Fujimoto A Totoki Y Abe T Boroevich KA Hosoda F Nguyen HH Aoki M Hosono N Kubo M Miya F Arai Y Takahashi H Shirakihara T Nagasaki M Shibuya T Nakano K Watanabe-Makino K Tanaka H Nakamura H Kusuda J Ojima H Shimada K Okusaka T Ueno M Shigekawa Y Kawakami Y Arihiro K Ohdan H Gotoh K Ishikawa O Ariizumi S Yamamoto M Yamada T Chayama K Kosuge T Yamaue H Kamatani N Miyano S Nakagama H Nakamura Y Tsunoda T Shibata T Nakagawa H 《Nature genetics》2012,44(7):760-764
Hepatocellular carcinoma (HCC) is the third leading cause of cancer-related death worldwide. We sequenced and analyzed the whole genomes of 27 HCCs, 25 of which were associated with hepatitis B or C virus infections, including two sets of multicentric tumors. Although no common somatic mutations were identified in the multicentric tumor pairs, their whole-genome substitution patterns were similar, suggesting that these tumors developed from independent mutations, although their shared etiological backgrounds may have strongly influenced their somatic mutation patterns. Statistical and functional analyses yielded a list of recurrently mutated genes. Multiple chromatin regulators, including ARID1A, ARID1B, ARID2, MLL and MLL3, were mutated in ~50% of the tumors. Hepatitis B virus genome integration in the TERT locus was frequently observed in a high clonal proportion. Our whole-genome sequencing analysis of HCCs identified the influence of etiological background on somatic mutation patterns and subsequent carcinogenesis, as well as recurrent mutations in chromatin regulators in HCCs. 相似文献
4.
Stacey SN Sulem P Jonasdottir A Masson G Gudmundsson J Gudbjartsson DF Magnusson OT Gudjonsson SA Sigurgeirsson B Thorisdottir K Ragnarsson R Benediktsdottir KR Nexø BA Tjønneland A Overvad K Rudnai P Gurzau E Koppova K Hemminki K Corredera C Fuentelsaz V Grasa P Navarrete S Fuertes F García-Prats MD Sanambrosio E Panadero A De Juan A Garcia A Rivera F Planelles D Soriano V Requena C Aben KK van Rossum MM Cremers RG van Oort IM van Spronsen DJ Schalken JA Peters WH Helfand BT Donovan JL 《Nature genetics》2011,43(11):1098-1103
To identify new risk variants for cutaneous basal cell carcinoma, we performed a genome-wide association study of 16 million SNPs identified through whole-genome sequencing of 457 Icelanders. We imputed genotypes for 41,675 Illumina SNP chip-typed Icelanders and their relatives. In the discovery phase, the strongest signal came from rs78378222[C] (odds ratio (OR) = 2.36, P = 5.2 × 10(-17)), which has a frequency of 0.0192 in the Icelandic population. We then confirmed this association in non-Icelandic samples (OR = 1.75, P = 0.0060; overall OR = 2.16, P = 2.2 × 10(-20)). rs78378222 is in the 3' untranslated region of TP53 and changes the AATAAA polyadenylation signal to AATACA, resulting in impaired 3'-end processing of TP53 mRNA. Investigation of other tumor types identified associations of this SNP with prostate cancer (OR = 1.44, P = 2.4 × 10(-6)), glioma (OR = 2.35, P = 1.0 × 10(-5)) and colorectal adenoma (OR = 1.39, P = 1.6 × 10(-4)). However, we observed no effect for breast cancer, a common Li-Fraumeni syndrome tumor (OR = 1.06, P = 0.57, 95% confidence interval 0.88-1.27). 相似文献
5.
Extremely low-coverage sequencing and imputation increases power for genome-wide association studies 总被引:1,自引:0,他引:1
Pasaniuc B Rohland N McLaren PJ Garimella K Zaitlen N Li H Gupta N Neale BM Daly MJ Sklar P Sullivan PF Bergen S Moran JL Hultman CM Lichtenstein P Magnusson P Purcell SM Haas DW Liang L Sunyaev S Patterson N de Bakker PI Reich D Price AL 《Nature genetics》2012,44(6):631-635
Genome-wide association studies (GWAS) have proven to be a powerful method to identify common genetic variants contributing to susceptibility to common diseases. Here, we show that extremely low-coverage sequencing (0.1-0.5×) captures almost as much of the common (>5%) and low-frequency (1-5%) variation across the genome as SNP arrays. As an empirical demonstration, we show that genome-wide SNP genotypes can be inferred at a mean r(2) of 0.71 using off-target data (0.24× average coverage) in a whole-exome study of 909 samples. Using both simulated and real exome-sequencing data sets, we show that association statistics obtained using extremely low-coverage sequencing data attain similar P values at known associated variants as data from genotyping arrays, without an excess of false positives. Within the context of reductions in sample preparation and sequencing costs, funds invested in extremely low-coverage sequencing can yield several times the effective sample size of GWAS based on SNP array data and a commensurate increase in statistical power. 相似文献
6.
Mitreva M Jasmer DP Zarlenga DS Wang Z Abubucker S Martin J Taylor CM Yin Y Fulton L Minx P Yang SP Warren WC Fulton RS Bhonagiri V Zhang X Hallsworth-Pepin K Clifton SW McCarter JP Appleton J Mardis ER Wilson RK 《Nature genetics》2011,43(3):228-235
Genome evolution studies for the phylum Nematoda have been limited by focusing on comparisons involving Caenorhabditis elegans. We report a draft genome sequence of Trichinella spiralis, a food-borne zoonotic parasite, which is the most common cause of human trichinellosis. This parasitic nematode is an extant member of a clade that diverged early in the evolution of the phylum, enabling identification of archetypical genes and molecular signatures exclusive to nematodes. We sequenced the 64-Mb nuclear genome, which is estimated to contain 15,808 protein-coding genes, at ~35-fold coverage using whole-genome shotgun and hierarchal map-assisted sequencing. Comparative genome analyses support intrachromosomal rearrangements across the phylum, disproportionate numbers of protein family deaths over births in parasitic compared to a non-parasitic nematode and a preponderance of gene-loss and -gain events in nematodes relative to Drosophila melanogaster. This genome sequence and the identified pan-phylum characteristics will contribute to genome evolution studies of Nematoda as well as strategies to combat global parasites of humans, food animals and crops. 相似文献
7.
Hu TT Pattyn P Bakker EG Cao J Cheng JF Clark RM Fahlgren N Fawcett JA Grimwood J Gundlach H Haberer G Hollister JD Ossowski S Ottilar RP Salamov AA Schneeberger K Spannagl M Wang X Yang L Nasrallah ME Bergelson J Carrington JC Gaut BS Schmutz J Mayer KF Van de Peer Y Grigoriev IV Nordborg M Weigel D Guo YL 《Nature genetics》2011,43(5):476-481
We report the 207-Mb genome sequence of the North American Arabidopsis lyrata strain MN47 based on 8.3× dideoxy sequence coverage. We predict 32,670 genes in this outcrossing species compared to the 27,025 genes in the selfing species Arabidopsis thaliana. The much smaller 125-Mb genome of A. thaliana, which diverged from A. lyrata 10 million years ago, likely constitutes the derived state for the family. We found evidence for DNA loss from large-scale rearrangements, but most of the difference in genome size can be attributed to hundreds of thousands of small deletions, mostly in noncoding DNA and transposons. Analysis of deletions and insertions still segregating in A. thaliana indicates that the process of DNA loss is ongoing, suggesting pervasive selection for a smaller genome. The high-quality reference genome sequence for A. lyrata will be an important resource for functional, evolutionary and ecological studies in the genus Arabidopsis. 相似文献
8.
The genome of woodland strawberry (Fragaria vesca) 总被引:3,自引:0,他引:3
Shulaev V Sargent DJ Crowhurst RN Mockler TC Folkerts O Delcher AL Jaiswal P Mockaitis K Liston A Mane SP Burns P Davis TM Slovin JP Bassil N Hellens RP Evans C Harkins T Kodira C Desany B Crasta OR Jensen RV Allan AC Michael TP Setubal JC Celton JM Rees DJ Williams KP Holt SH Ruiz Rojas JJ Chatterjee M Liu B Silva H Meisel L Adato A Filichkin SA Troggio M Viola R Ashman TL Wang H Dharmawardhana P Elser J Raja R Priest HD Bryant DW Fox SE Givan SA Wilhelm LJ Naithani S Christoffels A Salama DY 《Nature genetics》2011,43(2):109-116
9.
Sung WK Zheng H Li S Chen R Liu X Li Y Lee NP Lee WH Ariyaratne PN Tennakoon C Mulawadi FH Wong KF Liu AM Poon RT Fan ST Chan KL Gong Z Hu Y Lin Z Wang G Zhang Q Barber TD Chou WC Aggarwal A Hao K Zhou W Zhang C Hardwick J Buser C Xu J Kan Z Dai H Mao M Reinhard C Wang J Luk JM 《Nature genetics》2012,44(7):765-769
To survey hepatitis B virus (HBV) integration in liver cancer genomes, we conducted massively parallel sequencing of 81 HBV-positive and 7 HBV-negative hepatocellular carcinomas (HCCs) and adjacent normal tissues. We found that HBV integration is observed more frequently in the tumors (86.4%) than in adjacent liver tissues (30.7%). Copy-number variations (CNVs) were significantly increased at HBV breakpoint locations where chromosomal instability was likely induced. Approximately 40% of HBV breakpoints within the HBV genome were located within a 1,800-bp region where the viral enhancer, X gene and core gene are located. We also identified recurrent HBV integration events (in ≥ 4 HCCs) that were validated by RNA sequencing (RNA-seq) and Sanger sequencing at the known and putative cancer-related TERT, MLL4 and CCNE1 genes, which showed upregulated gene expression in tumor versus normal tissue. We also report evidence that suggests that the number of HBV integrations is associated with patient survival. 相似文献
10.
Barbieri CE Baca SC Lawrence MS Demichelis F Blattner M Theurillat JP White TA Stojanov P Van Allen E Stransky N Nickerson E Chae SS Boysen G Auclair D Onofrio RC Park K Kitabayashi N MacDonald TY Sheikh K Vuong T Guiducci C Cibulskis K Sivachenko A Carter SL Saksena G Voet D Hussain WM Ramos AH Winckler W Redman MC Ardlie K Tewari AK Mosquera JM Rupp N Wild PJ Moch H Morrissey C Nelson PS Kantoff PW Gabriel SB Golub TR Meyerson M Lander ES Getz G Rubin MA Garraway LA 《Nature genetics》2012,44(6):685-689
11.
Discovery and genotyping of genome structural polymorphism by sequencing on a population scale 总被引:1,自引:0,他引:1
Accurate and complete analysis of genome variation in large populations will be required to understand the role of genome variation in complex disease. We present an analytical framework for characterizing genome deletion polymorphism in populations using sequence data that are distributed across hundreds or thousands of genomes. Our approach uses population-level concepts to reinterpret the technical features of sequence data that often reflect structural variation. In the 1000 Genomes Project pilot, this approach identified deletion polymorphism across 168 genomes (sequenced at 4 × average coverage) with sensitivity and specificity unmatched by other algorithms. We also describe a way to determine the allelic state or genotype of each deletion polymorphism in each genome; the 1000 Genomes Project used this approach to type 13,826 deletion polymorphisms (48-995,664 bp) at high accuracy in populations. These methods offer a way to relate genome structural polymorphism to complex disease in populations. 相似文献
12.
Fusion genes and rearranged genes as a linear function of chromosome aberrations in cancer 总被引:13,自引:0,他引:13
Cytogenetic aberrations have been reported in 45,000 human neoplasms. Structural balanced rearrangements are associated with distinct tumor subtypes with remarkable specificity and have been essential for identifying genes involved in tumorigenesis. All balanced rearrangements that have been characterized molecularly act by deregulating a gene in one of the breakpoints or by creating a fusion gene. Because most recurrent aberrations and rearranged genes have been found in hematological disorders, whereas numerous genomic imbalances have been identified in solid tumors, it has become generally accepted that there are pathogenetic differences between these neoplasms. We here show that in every tumor type, the numbers of recurrent balanced chromosome abnormalities, fusion genes and genes rearranged as a consequence of balanced aberrations are simply a function of the number of cases with an abnormal karyotype. Hence, there may not be any fundamental tissue-specific differences in the genetic mechanisms by which neoplasia is initiated. 相似文献
13.
A rare penetrant mutation in CFH confers high risk of age-related macular degeneration 总被引:1,自引:0,他引:1
Raychaudhuri S Iartchouk O Chin K Tan PL Tai AK Ripke S Gowrisankar S Vemuri S Montgomery K Yu Y Reynolds R Zack DJ Campochiaro B Campochiaro P Katsanis N Daly MJ Seddon JM 《Nature genetics》2011,43(12):1232-1236
14.
Chiang C Jacobsen JC Ernst C Hanscom C Heilbut A Blumenthal I Mills RE Kirby A Lindgren AM Rudiger SR McLaughlan CJ Bawden CS Reid SJ Faull RL Snell RG Hall IM Shen Y Ohsumi TK Borowsky ML Daly MJ Lee C Morton CC MacDonald ME Gusella JF Talkowski ME 《Nature genetics》2012,44(4):390-7, S1
We defined the genetic landscape of balanced chromosomal rearrangements at nucleotide resolution by sequencing 141 breakpoints from cytogenetically interpreted translocations and inversions. We confirm that the recently described phenomenon of 'chromothripsis' (massive chromosomal shattering and reorganization) is not unique to cancer cells but also occurs in the germline, where it can resolve to a relatively balanced state with frequent inversions. We detected a high incidence of complex rearrangements (19.2%) and substantially less reliance on microhomology (31%) than previously observed in benign copy-number variants (CNVs). We compared these results to experimentally generated DNA breakage-repair by sequencing seven transgenic animals, revealing extensive rearrangement of the transgene and host genome with similar complexity to human germline alterations. Inversion was the most common rearrangement, suggesting that a combined mechanism involving template switching and non-homologous repair mediates the formation of balanced complex rearrangements that are viable, stably replicated and transmitted unaltered to subsequent generations. 相似文献
15.
Li Y Vinckenbosch N Tian G Huerta-Sanchez E Jiang T Jiang H Albrechtsen A Andersen G Cao H Korneliussen T Grarup N Guo Y Hellman I Jin X Li Q Liu J Liu X Sparsø T Tang M Wu H Wu R Yu C Zheng H Astrup A Bolund L Holmkvist J Jørgensen T Kristiansen K Schmitz O Schwartz TW Zhang X Li R Yang H Wang J Hansen T Pedersen O Nielsen R Wang J 《Nature genetics》2010,42(11):969-972
Targeted capture combined with massively parallel exome sequencing is a promising approach to identify genetic variants implicated in human traits. We report exome sequencing of 200 individuals from Denmark with targeted capture of 18,654 coding genes and sequence coverage of each individual exome at an average depth of 12-fold. On average, about 95% of the target regions were covered by at least one read. We identified 121,870 SNPs in the sample population, including 53,081 coding SNPs (cSNPs). Using a statistical method for SNP calling and an estimation of allelic frequencies based on our population data, we derived the allele frequency spectrum of cSNPs with a minor allele frequency greater than 0.02. We identified a 1.8-fold excess of deleterious, non-syonomyous cSNPs over synonymous cSNPs in the low-frequency range (minor allele frequencies between 2% and 5%). This excess was more pronounced for X-linked SNPs, suggesting that deleterious substitutions are primarily recessive. 相似文献
16.
17.
Ford CB Lin PL Chase MR Shah RR Iartchouk O Galagan J Mohaideen N Ioerger TR Sacchettini JC Lipsitch M Flynn JL Fortune SM 《Nature genetics》2011,43(5):482-486
Tuberculosis poses a global health emergency, which has been compounded by the emergence of drug-resistant Mycobacterium tuberculosis (Mtb) strains. We used whole-genome sequencing to compare the accumulation of mutations in Mtb isolated from cynomolgus macaques with active, latent or reactivated disease. We sequenced 33 Mtb isolates from nine macaques with an average genome coverage of 93% and an average read depth of 117×. Based on the distribution of SNPs observed, we calculated the mutation rates for these disease states. We found a similar mutation rate during latency as during active disease or in a logarithmically growing culture over the same period of time. The pattern of polymorphisms suggests that the mutational burden in vivo is because of oxidative DNA damage. We show that Mtb continues to acquire mutations during disease latency, which may explain why isoniazid monotherapy for latent tuberculosis is a risk factor for the emergence of isoniazid resistance. 相似文献
18.
Peña-Llopis S Vega-Rubín-de-Celis S Liao A Leng N Pavía-Jiménez A Wang S Yamasaki T Zhrebker L Sivanand S Spence P Kinch L Hambuch T Jain S Lotan Y Margulis V Sagalowsky AI Summerour PB Kabbani W Wong SW Grishin N Laurent M Xie XJ Haudenschild CD Ross MT Bentley DR Kapur P Brugarolas J 《Nature genetics》2012,44(7):751-759
The molecular pathogenesis of renal cell carcinoma (RCC) is poorly understood. Whole-genome and exome sequencing followed by innovative tumorgraft analyses (to accurately determine mutant allele ratios) identified several putative two-hit tumor suppressor genes, including BAP1. The BAP1 protein, a nuclear deubiquitinase, is inactivated in 15% of clear cell RCCs. BAP1 cofractionates with and binds to HCF-1 in tumorgrafts. Mutations disrupting the HCF-1 binding motif impair BAP1-mediated suppression of cell proliferation but not deubiquitination of monoubiquitinated histone 2A lysine 119 (H2AK119ub1). BAP1 loss sensitizes RCC cells in vitro to genotoxic stress. Notably, mutations in BAP1 and PBRM1 anticorrelate in tumors (P = 3 × 10(-5)), and combined loss of BAP1 and PBRM1 in a few RCCs was associated with rhabdoid features (q = 0.0007). BAP1 and PBRM1 regulate seemingly different gene expression programs, and BAP1 loss was associated with high tumor grade (q = 0.0005). Our results establish the foundation for an integrated pathological and molecular genetic classification of RCC, paving the way for subtype-specific treatments exploiting genetic vulnerabilities. 相似文献
19.
Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection 总被引:9,自引:0,他引:9
Lam HM Xu X Liu X Chen W Yang G Wong FL Li MW He W Qin N Wang B Li J Jian M Wang J Shao G Wang J Sun SS Zhang G 《Nature genetics》2010,42(12):1053-1059
We report a large-scale analysis of the patterns of genome-wide genetic variation in soybeans. We re-sequenced a total of 17 wild and 14 cultivated soybean genomes to an average of approximately ×5 depth and >90% coverage using the Illumina Genome Analyzer II platform. We compared the patterns of genetic variation between wild and cultivated soybeans and identified higher allelic diversity in wild soybeans. We identified a high level of linkage disequilibrium in the soybean genome, suggesting that marker-assisted breeding of soybean will be less challenging than map-based cloning. We report linkage disequilibrium block location and distribution, and we identified a set of 205,614 tag SNPs that may be useful for QTL mapping and association studies. The data here provide a valuable resource for the analysis of wild soybeans and to facilitate future breeding and quantitative trait analysis. 相似文献
20.
Cooper GM Coe BP Girirajan S Rosenfeld JA Vu TH Baker C Williams C Stalker H Hamid R Hannig V Abdel-Hamid H Bader P McCracken E Niyazov D Leppig K Thiese H Hummel M Alexander N Gorski J Kussmann J Shashi V Johnson K Rehder C Ballif BC Shaffer LG Eichler EE 《Nature genetics》2011,43(9):838-846
To understand the genetic heterogeneity underlying developmental delay, we compared copy number variants (CNVs) in 15,767 children with intellectual disability and various congenital defects (cases) to CNVs in 8,329 unaffected adult controls. We estimate that ~14.2% of disease in these children is caused by CNVs >400 kb. We observed a greater enrichment of CNVs in individuals with craniofacial anomalies and cardiovascular defects compared to those with epilepsy or autism. We identified 59 pathogenic CNVs, including 14 new or previously weakly supported candidates, refined the critical interval for several genomic disorders, such as the 17q21.31 microdeletion syndrome, and identified 940 candidate dosage-sensitive genes. We also developed methods to opportunistically discover small, disruptive CNVs within the large and growing diagnostic array datasets. This evolving CNV morbidity map, combined with exome and genome sequencing, will be critical for deciphering the genetic basis of developmental delay, intellectual disability and autism spectrum disorders. 相似文献