共查询到20条相似文献,搜索用时 15 毫秒
1.
Characterization of single-nucleotide polymorphisms in coding regions of human genes. 总被引:46,自引:0,他引:46
M Cargill D Altshuler J Ireland P Sklar K Ardlie N Patil N Shaw C R Lane E P Lim N Kalyanaraman J Nemesh L Ziaugra L Friedland A Rolfe J Warrington R Lipshutz G Q Daley E S Lander 《Nature genetics》1999,22(3):231-238
A major goal in human genetics is to understand the role of common genetic variants in susceptibility to common diseases. This will require characterizing the nature of gene variation in human populations, assembling an extensive catalogue of single-nucleotide polymorphisms (SNPs) in candidate genes and performing association studies for particular diseases. At present, our knowledge of human gene variation remains rudimentary. Here we describe a systematic survey of SNPs in the coding regions of human genes. We identified SNPs in 106 genes relevant to cardiovascular disease, endocrinology and neuropsychiatry by screening an average of 114 independent alleles using 2 independent screening methods. To ensure high accuracy, all reported SNPs were confirmed by DNA sequencing. We identified 560 SNPs, including 392 coding-region SNPs (cSNPs) divided roughly equally between those causing synonymous and non-synonymous changes. We observed different rates of polymorphism among classes of sites within genes (non-coding, degenerate and non-degenerate) as well as between genes. The cSNPs most likely to influence disease, those that alter the amino acid sequence of the encoded protein, are found at a lower rate and with lower allele frequencies than silent substitutions. This likely reflects selection acting against deleterious alleles during human evolution. The lower allele frequency of missense cSNPs has implications for the compilation of a comprehensive catalogue, as well as for the subsequent application to disease association. 相似文献
2.
Determination of ancestral alleles for human single-nucleotide polymorphisms using high-density oligonucleotide arrays. 总被引:25,自引:0,他引:25
J G Hacia J B Fan O Ryder L Jin K Edgemon G Ghandour R A Mayer B Sun L Hsie C M Robbins L C Brody D Wang E S Lander R Lipshutz S P Fodor F S Collins 《Nature genetics》1999,22(2):164-167
Here we report the application of high-density oligonucleotide array (DNA chip)-based analysis to determine the distant history of single nucleotide polymorphisms (SNPs) in current human populations. We analysed orthologues for 397 human SNP sites (identified in CEPH pedigrees from Amish, Venezuelan and Utah populations) from 23 common chimpanzee, 19 pygmy chimpanzee and 11 gorilla genomic DNA samples. From this data we determined 214 proposed ancestral alleles (the sequence found in the last common ancestor of humans and chimpanzees). In a diverse human population set, we found that SNP alleles with higher frequencies were more likely to be ancestral than less frequently occurring alleles. There were, however, exceptions. We also found three shared human/pygmy chimpanzee polymorphisms, all involving CpG dinucleotides, and two shared human/gorilla polymorphisms, one involving a CpG dinucleotide. We demonstrate that microarray-based assays allow rapid comparative sequence analysis of intra- and interspecies genetic variation. 相似文献
3.
Genome-wide analysis of single-nucleotide polymorphisms in human expressed sequences 总被引:16,自引:0,他引:16
Single-nucleotide polymorphisms (SNPs) have been explored as a high-resolution marker set for accelerating the mapping of disease genes. Here we report 48,196 candidate SNPs detected by statistical analysis of human expressed sequence tags (ESTs), associated primarily with coding regions of genes. We used Bayesian inference to weigh evidence for true polymorphism versus sequencing error, misalignment or ambiguity, misclustering or chimaeric EST sequences, assessing data such as raw chromatogram height, sharpness, overlap and spacing, sequencing error rates, context-sensitivity and cDNA library origin. Three separate validations-comparison with 54 genes screened for SNPs independently, verification of HLA-A polymorphisms and restriction fragment length polymorphism (RFLP) testing-verified 70%, 89% and 71% of our predicted SNPs, respectively. Our method detects tenfold more true HLA-A SNPs than previous analyses of the EST data. We found SNPs in a large fraction of known disease genes, including some disease-causing mutations (for example, the HbS sickle-cell mutation). Our comprehensive analysis of human coding region polymorphism provides a public resource for mapping of disease genes (available at http://www.bioinformatics.ucla.edu/snp). 相似文献
4.
Large-scale discovery and genotyping of single-nucleotide polymorphisms in the mouse 总被引:24,自引:0,他引:24
Lindblad-Toh K Winchester E Daly MJ Wang DG Hirschhorn JN Laviolette JP Ardlie K Reich DE Robinson E Sklar P Shah N Thomas D Fan JB Gingeras T Warrington J Patil N Hudson TJ Lander ES 《Nature genetics》2000,24(4):381-386
Single-nucleotide polymorphisms (SNPs) have been the focus of much attention in human genetics because they are extremely abundant and well-suited for automated large-scale genotyping. Human SNPs, however, are less informative than other types of genetic markers (such as simple-sequence length polymorphisms or microsatellites) and thus more loci are required for mapping traits. SNPs offer similar advantages for experimental genetic organisms such as the mouse, but they entail no loss of informativeness because bi-allelic markers are fully informative in analysing crosses between inbred strains. Here we report a large-scale analysis of SNPs in the mouse genome. We characterized the rate of nucleotide polymorphism in eight mouse strains and identified a collection of 2,848 SNPs located in 1,755 sequence-tagged sites (STSs) using high-density oligonucleotide arrays. Three-quarters of these SNPs have been mapped on the mouse genome, providing a first-generation SNP map of the mouse. We have also developed a multiplex genotyping procedure by which a genome scan can be performed with only six genotyping reactions per animal. 相似文献
5.
Substantial efforts are focused on identifying single-nucleotide polymorphisms (SNPs) throughout the human genome, particularly in coding regions (cSNPs), for both linkage disequilibrium and association studies. Less attention, however, has been directed to the clarification of evolutionary processes that are responsible for the variability in nucleotide diversity among different regions of the genome. We report here the population sequence diversity of genomic segments within a 450-kb cluster of olfactory receptor (OR) genes on human chromosome 17. We found a dichotomy in the pattern of nucleotide diversity between OR pseudogenes and introns on the one hand and the closely interspersed intact genes on the other. We suggest that weak positive selection is responsible for the observed patterns of genetic variation. This is inferred from a lower ratio of polymorphism to divergence in genes compared with pseudogenes or introns, high non-synonymous substitution rates in OR genes, and a small but significant overall reduction in variability in the entire OR gene cluster compared with other genomic regions. The dichotomy among functionally different segments within a short genomic distance requires high recombination rates within this OR cluster. Our work demonstrates the impact of weak positive selection on human nucleotide diversity, and has implications for the evolution of the olfactory repertoire. 相似文献
6.
Homology-based annotation yields 1,042 new candidate genes in the Drosophila melanogaster genome 总被引:6,自引:0,他引:6
Gopal S Schroeder M Pieper U Sczyrba A Aytekin-Kurban G Bekiranov S Fajardo JE Eswar N Sanchez R Sali A Gaasterland T 《Nature genetics》2001,27(3):337-340
The approach to annotating a genome critically affects the number and accuracy of genes identified in the genome sequence. Genome annotation based on stringent gene identification is prone to underestimate the complement of genes encoded in a genome. In contrast, over-prediction of putative genes followed by exhaustive computational sequence, motif and structural homology search will find rarely expressed, possibly unique, new genes at the risk of including non-functional genes. We developed a two-stage approach that combines the merits of stringent genome annotation with the benefits of over-prediction. First we identify plausible genes regardless of matches with EST, cDNA or protein sequences from the organism (stage 1). In the second stage, proteins predicted from the plausible genes are compared at the protein level with EST, cDNA and protein sequences, and protein structures from other organisms (stage 2). Remote but biologically meaningful protein sequence or structure homologies provide supporting evidence for genuine genes. The method, applied to the Drosophila melanogaster genome, validated 1,042 novel candidate genes after filtering 19,410 plausible genes, of which 12,124 matched the original 13,601 annotated genes. This annotation strategy is applicable to genomes of all organisms, including human. 相似文献
7.
W R McCombie A Martin-Gallardo J D Gocayne M FitzGerald M Dubnick J M Kelley L Castilla L I Liu S Wallace S Trapp 《Nature genetics》1992,1(5):348-353
The sequences of three cosmids (90 kilobases) from the Huntington's disease region in chromosome 4p16.3 have been determined. A 30,837 base overlap of DNA sequenced from two individuals was found to contain 72 DNA sequence polymorphisms, an average of 2.3 polymorphisms per kilobase (kb). The assembled 58 kb contig contains 62 Alu repeats, and eleven predicted exons representing at least three expressed genes that encode previously unidentified proteins. Each of these genes is associated with a CpG island. The structure of one of the new genes, hda1-1, has been determined by characterizing cDNAs from a placental library. This gene is expressed in a variety of tissues and may encode a novel housekeeping gene. 相似文献
8.
Isolation and characterization of a candidate gene for Norrie disease. 总被引:11,自引:0,他引:11
Z Y Chen R W Hendriks M A Jobling J F Powell X O Breakefield K B Sims I W Craig 《Nature genetics》1992,1(3):204-208
Previous analysis has refined the location of the gene for Norrie disease, a severe, X-linked, recessive neurodevelopmental disorder, to a yeast artificial chromosome subfragment of 160 kilobases (kb). This fragment was used to screen cDNA libraries from human fetal and adult retina. As a result, we have identified an evolutionarily conserved cDNA, which is expressed in fetal and adult brain and encodes a predicted protein of 133 amino acids. The cDNA detects genomic sequences which span a maximum of 50 kb, and which are partly deleted in several typical Norrie disease patients. An EcoRI polymorphism with a calculated heterozygosity value of 43% was observed. The locus identified is a strong candidate for the Norrie disease gene. 相似文献
9.
Isolation of a candidate gene for Norrie disease by positional cloning. 总被引:17,自引:0,他引:17
W Berger A Meindl T J van de Pol F P Cremers H H Ropers C D?erner A Monaco A A Bergen R Lebo M Warburg 《Nature genetics》1992,1(3):199-203
The gene for Norrie disease, an X-linked disorder characterized by progressive atrophy of the eyes, mental disturbances and deafness, has been mapped to chromosome Xp11.4 close to DXS7 and the monoamine oxidase (MAO) genes. By subcloning a YAC with a 640 kilobases (kb) insert which spans the DXS7-MAOB interval we have generated a cosmid contig which extends 250 kb beyond the MAOB gene. With one of these cosmids, microdeletions were detected in several patients with Norrie disease. Screening of cDNA libraries has enabled us to isolate and sequence a likely candidate gene for Norrie disease which is expressed in retina, choroid and fetal brain. No homologous sequences were found in DNA and protein databases indicating that this cDNA is part of a gene encoding a 'pioneer' protein. 相似文献
10.
A candidate mouse model for Prader-Willi syndrome which shows an absence of Snrpn expression. 总被引:22,自引:0,他引:22
B M Cattanach J A Barr E P Evans M Burtenshaw C V Beechey S E Leff C I Brannan N G Copeland N A Jenkins J Jones 《Nature genetics》1992,2(4):270-274
The best examples of imprinting in humans are provided by the Angelman and Prader-Willi syndromes (AS and PWS) which are associated with maternal and paternal 15q11-13 deletions, respectively, and also with paternal and maternal disomy 15. The region of the deletions has homology with a central part of mouse chromosome 7, incompletely tested for imprinting effects. Here, we report that maternal duplication for this region causes a murine imprinting effect which may correspond to PWS. Paternal duplication was not associated with any detectable effect that might correspond with AS. Gene expression studies established that Snrpn is not expressed in mice with the maternal duplication and suggest that the closely-linked Gabrb-3 locus is not subject to imprinting. Finally, an additional new imprinting effect is described. 相似文献
11.
12.
Dwarf8 polymorphisms associate with variation in flowering time. 总被引:49,自引:0,他引:49
J M Thornsberry M M Goodman J Doebley S Kresovich D Nielsen E S Buckler 《Nature genetics》2001,28(3):286-289
Historically, association tests have been used extensively in medical genetics, but have had virtually no application in plant genetics. One obstacle to their application is the structured populations often found in crop plants, which may lead to nonfunctional, spurious associations. In this study, statistical methods to account for population structure were extended for use with quantitative variation and applied to our evaluation of maize flowering time. Mutagenesis and quantitative trait locus (QTL) studies suggested that the maize gene Dwarf8 might affect the quantitative variation of maize flowering time and plant height. The wheat orthologs of this gene contributed to the increased yields seen in the 'Green Revolution' varieties. We used association approaches to evaluate Dwarf8 sequence polymorphisms from 92 maize inbred lines. Population structure was estimated using a Bayesian analysis of 141 simple sequence repeat (SSR) loci. Our results indicate that a suite of polymorphisms associate with differences in flowering time, which include a deletion that may alter a key domain in the coding region. The distribution of nonsynonymous polymorphisms suggests that Dwarf8 has been a target of selection. 相似文献
13.
Prospects for whole-genome linkage disequilibrium mapping of common disease genes. 总被引:54,自引:0,他引:54
L Kruglyak 《Nature genetics》1999,22(2):139-144
Recently, attention has focused on the use of whole-genome linkage disequilibrium (LD) studies to map common disease genes. Such studies would employ a dense map of single nucleotide polymorphisms (SNPs) to detect association between a marker and disease. Construction of SNP maps is currently underway. An essential issue yet to be settled is the required marker density of such maps. Here, I use population simulations to estimate the extent of LD surrounding common gene variants in the general human population as well as in isolated populations. Two main conclusions emerge from these investigations. First, a useful level of LD is unlikely to extend beyond an average distance of roughly 3 kb in the general population, which implies that approximately 500,000 SNPs will be required for whole-genome studies. Second, the extent of LD is similar in isolated populations unless the founding bottleneck is very narrow or the frequency of the variant is low (<5%). 相似文献
14.
15.
P I Patel B B Roa A A Welcher R Schoener-Scott B J Trask L Pentao G J Snipes C A Garcia U Francke E M Shooter J R Lupski U Suter 《Nature genetics》1992,1(3):159-165
Charcot-Marie-Tooth disease type 1A (CMT1A) is an autosomal dominant peripheral neuropathy associated with a large DNA duplication on the short arm of human chromosome 17. The trembler (Tr) mouse serves as a model for CMT1A because of phenotypic similarities and because the Tr locus maps to mouse chromosome 11 in a region of conserved synteny with human chromosome 17. Recently, the peripheral myelin gene Pmp-22 was found to carry a point mutation in Tr mice. We have isolated cDNA and genomic clones for human PMP-22. The gene maps to human chromosome 17p11.2-17p12, is expressed at high levels in peripheral nervous tissue and is duplicated, but not disrupted, in CMT1A patients. Thus, we suggest that a gene dosage effect involving PMP-22 is at least partially responsible for the demyelinating neuropathy seen in CMT1A. 相似文献
16.
A survey of expressed genes in Caenorhabditis elegans. 总被引:29,自引:0,他引:29
R Waterston C Martin M Craxton C Huynh A Coulson L Hillier R Durbin P Green R Shownkeen N Halloran 《Nature genetics》1992,1(2):114-123
As an adjunct to the genomic sequencing of Caenorhabditis elegans, we have investigated a representative cDNA library of 1,517 clones. A single sequence read has been obtained from the 5' end of each clone, allowing its characterization with respect to the public databases, and the clones are being localized on the genome map. The result is the identification of about 1,200 of the estimated 15,000 genes of C. elegans. More than 30% of the inferred protein sequences have significant similarity to existing sequences in the databases, providing a route towards in vivo analysis of known genes in the nematode. These clones also provide material for assessing the accuracy of predicted exons and splicing patterns and will lead to a more accurate estimate of the total number of genes in the organism than has hitherto been available. 相似文献
17.
Population choice in mapping genes for complex diseases 总被引:24,自引:0,他引:24
The difficulty of identifying susceptibility genes for common diseases has polarized geneticists' views on what disease models are appropriate and how best to proceed once high-density genome maps become available. Different disease models have different implications for using linkage or linkage-disequilibrium-based approaches for mapping complex disease genes. We argue that the choice of study population is a critical factor when designing a study, and that genetically simplified isolates are more useful than diverse continental populations under most assumptions. 相似文献
18.
19.
Selection for short introns in highly expressed genes 总被引:1,自引:0,他引:1
20.
Systematic screen for human disease genes in yeast 总被引:19,自引:0,他引:19
Steinmetz LM Scharfe C Deutschbauer AM Mokranjac D Herman ZS Jones T Chu AM Giaever G Prokisch H Oefner PJ Davis RW 《Nature genetics》2002,31(4):400-404
High similarity between yeast and human mitochondria allows functional genomic study of Saccharomyces cerevisiae to be used to identify human genes involved in disease. So far, 102 heritable disorders have been attributed to defects in a quarter of the known nuclear-encoded mitochondrial proteins in humans. Many mitochondrial diseases remain unexplained, however, in part because only 40-60% of the presumed 700-1,000 proteins involved in mitochondrial function and biogenesis have been identified. Here we apply a systematic functional screen using the pre-existing whole-genome pool of yeast deletion mutants to identify mitochondrial proteins. Three million measurements of strain fitness identified 466 genes whose deletions impaired mitochondrial respiration, of which 265 were new. Our approach gave higher selection than other systematic approaches, including fivefold greater selection than gene expression analysis. To apply these advantages to human disorders involving mitochondria, human orthologs were identified and linked to heritable diseases using genomic map positions. 相似文献