首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Single pass sequencing and physical and genetic mapping of human brain cDNAs.   总被引:16,自引:0,他引:16  
We have performed single pass sequencing of 1,024 human brain cDNAs, over 900 of which seem to represent new human genes. Library prescreening with total brain cDNA significantly reduced repeated sequencing of highly represented cDNAs. A subset of sequenced cDNAs were physically mapped to their chromosomal locations using gene-specific STS primers derived from 3' untranslated regions. We have also determined that human brain cDNAs represent a rich source of gene-associated polymorphic markers. Microsatellite-containing cDNAs can be physically mapped and converted to highly informative genetic markers, thus facilitating integration of the human physical, expression and genetic maps.  相似文献   

2.
3.
4.
Enzymatic production of RNAi libraries from cDNAs   总被引:30,自引:0,他引:30  
RNA interference (RNAi) induced by small interfering (siRNA) or short hairpin RNA (shRNA) is an important research approach in mammalian genetics. Here we describe a technology called enzymatic production of RNAi library (EPRIL) by which cDNAs are converted by a sequence of enzymatic treatments into an RNAi library consisting of a vast array of different shRNA expression constructs. We applied EPRIL to a single cDNA source and prepared an RNAi library consisting of shRNA constructs with various RNAi efficiencies. High-throughput screening allowed us to rapidly identify the best shRNA constructs from the library. We also describe a new selection scheme using the thymidine kinase gene for obtaining efficient shRNA constructs. Furthermore, we show that EPRIL can be applied to constructing an RNAi library from a cDNA library, providing a basis for future whole-genome phenotypic screening of genes.  相似文献   

5.
6.
A total of 116,118 basepairs (bp) derived from three cosmids spanning the ERCC1 locus of human chromosome 19q13.3 have been sequenced with automated fluorescence-based sequencers and analysed by polymerase chain reaction amplification and computer methods. The assembled sequence forms two contigs totalling 105,831 bp, which contain a human fosB proto-oncogene, a gene encoding a protein phosphatase, two genes of unknown function and the previously-characterized ERCC1 DNA repair gene. This light band region has a high average density of 1.4 Alu repeats per kilobase. Human chromosome light bands could therefore contain up to 75,000 genes and 1.5 million Alu repeats.  相似文献   

7.
Chen T  Hevi S  Gay F  Tsujimoto N  He T  Zhang B  Ueda Y  Li E 《Nature genetics》2007,39(3):391-396
Studies have shown that DNA (cytosine-5-)-methyltransferase 1 (DNMT1) is the principal enzyme responsible for maintaining CpG methylation and is required for embryonic development and survival of somatic cells in mice. The role of DNMT1 in human cancer cells, however, remains highly controversial. Using homologous recombination, here we have generated a DNMT1 conditional allele in the human colorectal carcinoma cell line HCT116 in which several exons encoding the catalytic domain are flanked by loxP sites. Cre recombinase-mediated disruption of this allele results in hemimethylation of approximately 20% of CpG-CpG dyads in the genome, coupled with activation of the G2/M checkpoint, leading to arrest in the G2 phase of the cell cycle. Although cells gradually escape from this arrest, they show severe mitotic defects and undergo cell death either during mitosis or after arresting in a tetraploid G1 state. Our results thus show that DNMT1 is required for faithfully maintaining DNA methylation patterns in human cancer cells and is essential for their proliferation and survival.  相似文献   

8.
9.
We report the analysis of a Japanese male using high-throughput sequencing to × 40 coverage. More than 99% of the sequence reads were mapped to the reference human genome. Using a Bayesian decision method, we identified 3,132,608 single nucleotide variations (SNVs). Comparison with six previously reported genomes revealed an excess of singleton nonsense and nonsynonymous SNVs, as well as singleton SNVs in conserved non-coding regions. We also identified 5,319 deletions smaller than 10 kb with high accuracy, in addition to copy number variations and rearrangements. De novo assembly of the unmapped sequence reads generated around 3 Mb of novel sequence, which showed high similarity to non-reference human genomes and the human herpesvirus 4 genome. Our analysis suggests that considerable variation remains undiscovered in the human genome and that whole-genome sequencing is an invaluable tool for obtaining a complete understanding of human genetic variation.  相似文献   

10.
11.
Opisthorchis viverrini-related cholangiocarcinoma (CCA), a fatal bile duct cancer, is a major public health concern in areas endemic for this parasite. We report here whole-exome sequencing of eight O. viverrini-related tumors and matched normal tissue. We identified and validated 206 somatic mutations in 187 genes using Sanger sequencing and selected 15 genes for mutation prevalence screening in an additional 46 individuals with CCA (cases). In addition to the known cancer-related genes TP53 (mutated in 44.4% of cases), KRAS (16.7%) and SMAD4 (16.7%), we identified somatic mutations in 10 newly implicated genes in 14.8-3.7% of cases. These included inactivating mutations in MLL3 (in 14.8% of cases), ROBO2 (9.3%), RNF43 (9.3%) and PEG3 (5.6%), and activating mutations in the GNAS oncogene (9.3%). These genes have functions that can be broadly grouped into three biological classes: (i) deactivation of histone modifiers, (ii) activation of G protein signaling and (iii) loss of genome stability. This study provides insight into the mutational landscape contributing to O. viverrini-related CCA.  相似文献   

12.
The plant Arabidopsis thaliana occurs naturally in many different habitats throughout Eurasia. As a foundation for identifying genetic variation contributing to adaptation to diverse environments, a 1001 Genomes Project to sequence geographically diverse A. thaliana strains has been initiated. Here we present the first phase of this project, based on population-scale sequencing of 80 strains drawn from eight regions throughout the species' native range. We describe the majority of common small-scale polymorphisms as well as many larger insertions and deletions in the A. thaliana pan-genome, their effects on gene function, and the patterns of local and global linkage among these variants. The action of processes other than spontaneous mutation is identified by comparing the spectrum of mutations that have accumulated since A. thaliana diverged from its closest relative 10 million years ago with the spectrum observed in the laboratory. Recent species-wide selective sweeps are rare, and potentially deleterious mutations are more common in marginal populations.  相似文献   

13.
The ratio of genetic diversity on chromosome X to that on the autosomes is sensitive to both natural selection and demography. On the basis of whole-genome sequences of 69 females, we report that whereas this ratio increases with genetic distance from genes across populations, it is lower in Europeans than in West Africans independent of proximity to genes. This relative reduction is most parsimoniously explained by differences in demographic history without the need to invoke natural selection.  相似文献   

14.
Genome-wide association studies (GWAS) have proven to be a powerful method to identify common genetic variants contributing to susceptibility to common diseases. Here, we show that extremely low-coverage sequencing (0.1-0.5×) captures almost as much of the common (>5%) and low-frequency (1-5%) variation across the genome as SNP arrays. As an empirical demonstration, we show that genome-wide SNP genotypes can be inferred at a mean r(2) of 0.71 using off-target data (0.24× average coverage) in a whole-exome study of 909 samples. Using both simulated and real exome-sequencing data sets, we show that association statistics obtained using extremely low-coverage sequencing data attain similar P values at known associated variants as data from genotyping arrays, without an excess of false positives. Within the context of reductions in sample preparation and sequencing costs, funds invested in extremely low-coverage sequencing can yield several times the effective sample size of GWAS based on SNP array data and a commensurate increase in statistical power.  相似文献   

15.
Accurate and complete analysis of genome variation in large populations will be required to understand the role of genome variation in complex disease. We present an analytical framework for characterizing genome deletion polymorphism in populations using sequence data that are distributed across hundreds or thousands of genomes. Our approach uses population-level concepts to reinterpret the technical features of sequence data that often reflect structural variation. In the 1000 Genomes Project pilot, this approach identified deletion polymorphism across 168 genomes (sequenced at 4 × average coverage) with sensitivity and specificity unmatched by other algorithms. We also describe a way to determine the allelic state or genotype of each deletion polymorphism in each genome; the 1000 Genomes Project used this approach to type 13,826 deletion polymorphisms (48-995,664 bp) at high accuracy in populations. These methods offer a way to relate genome structural polymorphism to complex disease in populations.  相似文献   

16.
Recent advances in sequencing technology make it possible to comprehensively catalog genetic variation in population samples, creating a foundation for understanding human disease, ancestry and evolution. The amounts of raw data produced are prodigious, and many computational steps are required to translate this output into high-quality variant calls. We present a unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs. Our process includes (i) initial read mapping; (ii) local realignment around indels; (iii) base quality score recalibration; (iv) SNP discovery and genotyping to find all potential variants; and (v) machine learning to separate true segregating variation from machine artifacts common to next-generation sequencing technologies. We here discuss the application of these tools, instantiated in the Genome Analysis Toolkit, to deep whole-genome, whole-exome capture and multi-sample low-pass (~4×) 1000 Genomes Project datasets.  相似文献   

17.
RNA sequencing shows no dosage compensation of the active X-chromosome   总被引:1,自引:0,他引:1  
Xiong Y  Chen X  Chen Z  Wang X  Shi S  Wang X  Zhang J  He X 《Nature genetics》2010,42(12):1043-1047
Mammalian cells from both sexes typically contain one active X chromosome but two sets of autosomes. It has previously been hypothesized that X-linked genes are expressed at twice the level of autosomal genes per active allele to balance the gene dose between the X chromosome and autosomes (termed 'Ohno's hypothesis'). This hypothesis was supported by the observation that microarray-based gene expression levels were indistinguishable between one X chromosome and two autosomes (the X to two autosomes ratio (X:AA) ~1). Here we show that RNA sequencing (RNA-Seq) is more sensitive than microarray and that RNA-Seq data reveal an X:AA ratio of ~0.5 in human and mouse. In Caenorhabditis elegans hermaphrodites, the X:AA ratio reduces progressively from ~1 in larvae to ~0.5 in adults. Proteomic data are consistent with the RNA-Seq results and further suggest the lack of X upregulation at the protein level. Together, our findings reject Ohno’s hypothesis, necessitating a major revision of the current model of dosage compensation in the evolution of sex chromosomes.  相似文献   

18.
Plague is a pandemic human invasive disease caused by the bacterial agent Yersinia pestis. We here report a comparison of 17 whole genomes of Y. pestis isolates from global sources. We also screened a global collection of 286 Y. pestis isolates for 933 SNPs using Sequenom MassArray SNP typing. We conducted phylogenetic analyses on this sequence variation dataset, assigned isolates to populations based on maximum parsimony and, from these results, made inferences regarding historical transmission routes. Our phylogenetic analysis suggests that Y. pestis evolved in or near China and spread through multiple radiations to Europe, South America, Africa and Southeast Asia, leading to country-specific lineages that can be traced by lineage-specific SNPs. All 626 current isolates from the United States reflect one radiation, and 82 isolates from Madagascar represent a second radiation. Subsequent local microevolution of Y. pestis is marked by sequential, geographically specific SNPs.  相似文献   

19.
Isolates of Salmonella enterica serovar Typhi (Typhi), a human-restricted bacterial pathogen that causes typhoid, show limited genetic variation. We generated whole-genome sequences for 19 Typhi isolates using 454 (Roche) and Solexa (Illumina) technologies. Isolates, including the previously sequenced CT18 and Ty2 isolates, were selected to represent major nodes in the phylogenetic tree. Comparative analysis showed little evidence of purifying selection, antigenic variation or recombination between isolates. Rather, evolution in the Typhi population seems to be characterized by ongoing loss of gene function, consistent with a small effective population size. The lack of evidence for antigenic variation driven by immune selection is in contrast to strong adaptive selection for mutations conferring antibiotic resistance in Typhi. The observed patterns of genetic isolation and drift are consistent with the proposed key role of asymptomatic carriers of Typhi as the main reservoir of this pathogen, highlighting the need for identification and treatment of carriers.  相似文献   

20.
We used exome sequencing to identify the genetic basis of combined malonic and methylmalonic aciduria (CMAMMA). We sequenced the exome of an individual with CMAMMA and followed up with sequencing of eight additional affected individuals (cases). This included one individual who was identified and diagnosed by searching an exome database. We identify mutations in ACSF3, encoding a putative methylmalonyl-CoA and malonyl-CoA synthetase as a cause of CMAMMA. We also examined a canine model of CMAMMA, which showed pathogenic mutations in a predicted ACSF3 ortholog. ACSF3 mutant alleles occur with a minor allele frequency of 0.0058 in ~1,000 control individuals, predicting a CMAMMA population incidence of ~1:30,000. ACSF3 deficiency is the first human disorder identified as caused by mutations in a gene encoding a member of the acyl-CoA synthetase family, a diverse group of evolutionarily conserved proteins, and may emerge as one of the more common human metabolic disorders.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号