首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.
Strategies for assembling large, complex genomes have evolved to include a combination of whole-genome shotgun sequencing and hierarchal map-assisted sequencing. Whole-genome maps of all types can aid genome assemblies, generally starting with low-resolution cytogenetic maps and ending with the highest resolution of sequence. Fingerprint clone maps are based upon complete restriction enzyme digests of clones representative of the target genome, and ultimately comprise a near-contiguous path of clones across the genome. Such clone-based maps are used to validate sequence assembly order, supply long-range linking information for assembled sequences, anchor sequences to the genetic map and provide templates for closing gaps. Fingerprint maps are also a critical resource for subsequent functional genomic studies, because they provide a redundant and ordered sampling of the genome with clones. In an accompanying paper we describe the draft genome sequence of the chicken, Gallus gallus, the first species sequenced that is both a model organism and a global food source. Here we present a clone-based physical map of the chicken genome at 20-fold coverage, containing 260 contigs of overlapping clones. This map represents approximately 91% of the chicken genome and enables identification of chicken clones aligned to positions in other sequenced genomes.  相似文献   

2.
Wang J  Wang W  Li R  Li Y  Tian G  Goodman L  Fan W  Zhang J  Li J  Zhang J  Guo Y  Feng B  Li H  Lu Y  Fang X  Liang H  Du Z  Li D  Zhao Y  Hu Y  Yang Z  Zheng H  Hellmann I  Inouye M  Pool J  Yi X  Zhao J  Duan J  Zhou Y  Qin J  Ma L  Li G  Yang Z  Zhang G  Yang B  Yu C  Liang F  Li W  Li S  Li D  Ni P  Ruan J  Li Q  Zhu H  Liu D  Lu Z  Li N  Guo G  Zhang J  Ye J  Fang L  Hao Q  Chen Q  Liang Y  Su Y  San A  Ping C  Yang S  Chen F  Li L  Zhou K  Zheng H  Ren Y  Yang L  Gao Y  Yang G  Li Z  Feng X  Kristiansen K  Wong GK  Nielsen R  Durbin R  Bolund L  Zhang X 《Nature》2008,456(7218):60-65
Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics.  相似文献   

3.
Amylose content in rice endosperm is one of the key determinants of rice eating and cooking quality, and the poor quality ofindica hybrid rice is closely related to the high amylose level in rice grains. In order to improve the grain quality of theindica hybrid rice by genetic engineering, an antisense fragment of ricewaxy gene, driven by the 5′-franking sequences of the ricewaxy gene, was successfully introduced into three major parent lines ofindica hybrid rice, all contain a high amylose level in the grains, viaAgrobacterium, and more than 100 hygromycinresistant plants were regenerated. The analysis of PCR amplification and Southern blots indicated that the T-DNA containing the antisensewaxy gene had been integrated into the genome of transgenic rice plants. Most of the primary transgenic rice plants grew normally, and the mature seeds from these transgenic plants were performed for analysis of the amylose content. The results showed that the amylose content in the endosperm of some grains was reduced and the lowest reached 7.02% in one homozygous transgenic line, 72.4% lower than that of the wild type. The influence of the altered amylose content on the gelatinization temperature and gel consistency was also observed in several homozygous transgenic rice plants. The two authors contributed equally to this work.  相似文献   

4.
An efficient molecular method for the accurate and efficient identification of indica and japonica rice was created based on the polymorphisms of insertion/deletion (InDel) DNA fragments obtained from the basic local alignment search tool (BLAST) to the entire genomic sequences of indica (93-11) and japonica rice (Nipponbare). The 45 InDel loci were validated experimentally by the polymerase chain reaction (PCR) and polyacrylamide gel electrophoresis (PAGE) in 44 typical indica and japonica rice varieties, including 93-11 and Nipponbare. A neutrality test of the data matrix generated from electrophoretic banding patterns of various InDel loci indicated that 34 InDel loci were strongly associated with the differentiation of indica and japonica rice. More extensive analyses involving cultivated rice varieties from 11 Asian countries, and 12 wild Oryza species with various origins confirmed that indica and japonica characteristics could accurately be determined via calculating the average frequency of indica- or japonica-specific alleles on different InDel loci across the rice genome. This method was named as the “InDel molecular index” that combines molecular and statistical methods in determining the indica and japonica characteristics of rice varieties. Compared with the traditional methods based essentially on morphology, the InDel molecular index provides a very accurate, rapid, simple, and efficient method for identifying indica and japonica rice. In addition, the InDel index can be used to determine indica or japonica characteristics of wild Oryza species, which largely extends the utility of this method. The InDel molecular index provides a new tool for the effective selection of appropriate indica or japonica rice germplasm in rice breeding. It also offers a novel model for the study of the origin, evolution, and genetic differentiation of indica and japonica rice adapted to various environmental changes.  相似文献   

5.
The genome sequence and structure of rice chromosome 1   总被引:2,自引:0,他引:2  
The rice species Oryza sativa is considered to be a model plant because of its small genome size, extensive genetic map, relative ease of transformation and synteny with other cereal crops. Here we report the essentially complete sequence of chromosome 1, the longest chromosome in the rice genome. We summarize characteristics of the chromosome structure and the biological insight gained from the sequence. The analysis of 43.3 megabases (Mb) of non-overlapping sequence reveals 6,756 protein coding genes, of which 3,161 show homology to proteins of Arabidopsis thaliana, another model plant. About 30% (2,073) of the genes have been functionally categorized. Rice chromosome 1 is (G + C)-rich, especially in its coding regions, and is characterized by several gene families that are dispersed or arranged in tandem repeats. Comparison with a draft sequence indicates the importance of a high-quality finished sequence.  相似文献   

6.
Genome sequencing in microfabricated high-density picolitre reactors   总被引:21,自引:0,他引:21  
The proliferation of large-scale DNA-sequencing projects in recent years has driven a search for alternative methods to reduce time and cost. Here we describe a scalable, highly parallel sequencing system with raw throughput significantly greater than that of state-of-the-art capillary electrophoresis instruments. The apparatus uses a novel fibre-optic slide of individual wells and is able to sequence 25 million bases, at 99% or better accuracy, in one four-hour run. To achieve an approximately 100-fold increase in throughput over current Sanger sequencing technology, we have developed an emulsion method for DNA amplification and an instrument for sequencing by synthesis using a pyrosequencing protocol optimized for solid support and picolitre-scale volumes. Here we show the utility, throughput, accuracy and robustness of this system by shotgun sequencing and de novo assembly of the Mycoplasma genitalium genome with 96% coverage at 99.96% accuracy in one run of the machine.  相似文献   

7.
对一个水稻珍汕97A不育系核DNA来源的具有自主复制功能的高度重复顺序片段ARS2(全长4720bp)进行了亚克隆构建和测序,获得了它的全顺序。  相似文献   

8.
The systematic comparison of genomic sequences from different organisms represents a central focus of contemporary genome analysis. Comparative analyses of vertebrate sequences can identify coding and conserved non-coding regions, including regulatory elements, and provide insight into the forces that have rendered modern-day genomes. As a complement to whole-genome sequencing efforts, we are sequencing and comparing targeted genomic regions in multiple, evolutionarily diverse vertebrates. Here we report the generation and analysis of over 12 megabases (Mb) of sequence from 12 species, all derived from the genomic region orthologous to a segment of about 1.8 Mb on human chromosome 7 containing ten genes, including the gene mutated in cystic fibrosis. These sequences show conservation reflecting both functional constraints and the neutral mutational events that shaped this genomic region. In particular, we identify substantial numbers of conserved non-coding segments beyond those previously identified experimentally, most of which are not detectable by pair-wise sequence comparisons alone. Analysis of transposable element insertions highlights the variation in genome dynamics among these species and confirms the placement of rodents as a sister group to the primates.  相似文献   

9.
本研究基于下一代测序技术,对黄连基因组进行了勘测,构建了两个插入片段大小分别为200bp和500bp的文库,进行了深度约30X的测序。通过测序获得了54Gb的原始数据,过滤后得到44.8G数据。通过SOAP de nove软件组装后初步获得了contig和Scaffold序列,进一步分析结果显示其基因组大小为1,116Mb左右,大约具有1.1%的杂合度,说明要完成该物种的全基因测序可能在使用鸟枪法的同时,还应该联合BAC文库测序等多种方法.对这些数据进行了初步的组装,获得了130,381条scaffold序列.  相似文献   

10.
The medaka draft genome and insights into vertebrate genome evolution   总被引:3,自引:0,他引:3  
Teleosts comprise more than half of all vertebrate species and have adapted to a variety of marine and freshwater habitats. Their genome evolution and diversification are important subjects for the understanding of vertebrate evolution. Although draft genome sequences of two pufferfishes have been published, analysis of more fish genomes is desirable. Here we report a high-quality draft genome sequence of a small egg-laying freshwater teleost, medaka (Oryzias latipes). Medaka is native to East Asia and an excellent model system for a wide range of biology, including ecotoxicology, carcinogenesis, sex determination and developmental genetics. In the assembled medaka genome (700 megabases), which is less than half of the zebrafish genome, we predicted 20,141 genes, including approximately 2,900 new genes, using 5'-end serial analysis of gene expression tag information. We found single nucleotide polymorphisms (SNPs) at an average rate of 3.42% between the two inbred strains derived from two regional populations; this is the highest SNP rate seen in any vertebrate species. Analyses based on the dense SNP information show a strict genetic separation of 4 million years (Myr) between the two populations, and suggest that differential selective pressures acted on specific gene categories. Four-way comparisons with the human, pufferfish (Tetraodon), zebrafish and medaka genomes revealed that eight major interchromosomal rearrangements took place in a remarkably short period of approximately 50 Myr after the whole-genome duplication event in the teleost ancestor and afterwards, intriguingly, the medaka genome preserved its ancestral karyotype for more than 300 Myr.  相似文献   

11.
Using a polymerase chain reaction (PCR) based method six distinct candidate disease resistant gene (R) homologs from rice have been isolated. The rice sequences are organized into two phylogenetic groups with contrasting genomic organization patterns. The first group, represented by a single sequence, Osh359-1, is more similar to non-riceR sequences than to rice ones and has a simple genomic organization. The second group, represented by Osh359-3, contains the remaining five rice sequences. Osh359-3 consists of a multi-gene family. The members of Osh359-3 family are further found to be clustered together in the genome.  相似文献   

12.
Sequence and analysis of rice chromosome 4   总被引:1,自引:0,他引:1  
Feng Q  Zhang Y  Hao P  Wang S  Fu G  Huang Y  Li Y  Zhu J  Liu Y  Hu X  Jia P  Zhang Y  Zhao Q  Ying K  Yu S  Tang Y  Weng Q  Zhang L  Lu Y  Mu J  Lu Y  Zhang LS  Yu Z  Fan D  Liu X  Lu T  Li C  Wu Y  Sun T  Lei H  Li T  Hu H  Guan J  Wu M  Zhang R  Zhou B  Chen Z  Chen L  Jin Z  Wang R  Yin H  Cai Z  Ren S  Lv G  Gu W  Zhu G  Tu Y  Jia J  Zhang Y  Chen J  Kang H  Chen X  Shao C  Sun Y  Hu Q  Zhang X  Zhang W  Wang L  Ding C  Sheng H  Gu J  Chen S  Ni L  Zhu F  Chen W  Lan L  Lai Y  Cheng Z  Gu M  Jiang J  Li J  Hong G  Xue Y  Han B 《Nature》2002,420(6913):316-320
Rice is the principal food for over half of the population of the world. With its genome size of 430 megabase pairs (Mb), the cultivated rice species Oryza sativa is a model plant for genome research. Here we report the sequence analysis of chromosome 4 of O. sativa, one of the first two rice chromosomes to be sequenced completely. The finished sequence spans 34.6 Mb and represents 97.3% of the chromosome. In addition, we report the longest known sequence for a plant centromere, a completely sequenced contig of 1.16 Mb corresponding to the centromeric region of chromosome 4. We predict 4,658 protein coding genes and 70 transfer RNA genes. A total of 1,681 predicted genes match available unique rice expressed sequence tags. Transposable elements have a pronounced bias towards the euchromatic regions, indicating a close correlation of their distributions to genes along the chromosome. Comparative genome analysis between cultivated rice subspecies shows that there is an overall syntenic relationship between the chromosomes and divergence at the level of single-nucleotide polymorphisms and insertions and deletions. By contrast, there is little conservation in gene order between rice and Arabidopsis.  相似文献   

13.
Legumes (Fabaceae or Leguminosae) are unique among cultivated plants for their ability to carry out endosymbiotic nitrogen fixation with rhizobial bacteria, a process that takes place in a specialized structure known as the nodule. Legumes belong to one of the two main groups of eurosids, the Fabidae, which includes most species capable of endosymbiotic nitrogen fixation. Legumes comprise several evolutionary lineages derived from a common ancestor 60 million years ago (Myr ago). Papilionoids are the largest clade, dating nearly to the origin of legumes and containing most cultivated species. Medicago truncatula is a long-established model for the study of legume biology. Here we describe the draft sequence of the M. truncatula euchromatin based on a recently completed BAC assembly supplemented with Illumina shotgun sequence, together capturing ~94% of all M. truncatula genes. A whole-genome duplication (WGD) approximately 58 Myr ago had a major role in shaping the M. truncatula genome and thereby contributed to the evolution of endosymbiotic nitrogen fixation. Subsequent to the WGD, the M. truncatula genome experienced higher levels of rearrangement than two other sequenced legumes, Glycine max and Lotus japonicus. M. truncatula is a close relative of alfalfa (Medicago sativa), a widely cultivated crop with limited genomics tools and complex autotetraploid genetics. As such, the M. truncatula genome sequence provides significant opportunities to expand alfalfa's genomic toolbox.  相似文献   

14.
We have used the polymerase chain reaction to analyze variation in the size of 5S ribosomal gene spacer sequence. Eighty accessions, including 65 cultivated rice and 15 wild rice, were analyzed. Among them seven size classes of 5S DNA spacer were observed. Classification asindica orjaponica on the basis of 5S DNA spacer patterns generally agrees with classification based on morphological studies, indicating that the length polymorphism of 5S DNA spacer could be used as a molecular marker for taxonomic and phylogenetic analysis. Supported by the National Natural Science Fundation of China Yi Qingming: born in Apr. 1938, Professor  相似文献   

15.
We have used the polymerase chain reaction to analyze variation in the size of 5S ribosomal gene spacer sequence. Eighty accessions, including 65 cultivated rice and 15 wild rice, were analyzed. Among them seven size classes of 5S DNA spacer were observed. Classification asindica orjaponica on the basis of 5S DNA spacer patterns generally agrees with classification based on morphological studies, indicating that the length polymorphism of 5S DNA spacer could be used as a molecular marker for taxonomic and phylogenetic analysis. Supported by the National Natural Science Fundation of China Yi Qingming: born in Apr. 1938, Professor  相似文献   

16.
全基因组测序技术研究及其在木本植物中的应用   总被引:2,自引:0,他引:2  
基因组序列是开展遗传研究重要的信息基础,随着测序技术飞速发展至第3代长片段测序方法,测序读长历经从几十到数万个碱基的提升,对进一步提升基因组组装的完整度以及准确性提供了极大的裨益。现已完成了大量植物种全基因组测序工作,其中木本植物有40多个,还有更多树种的全基因组测序正在进行之中。针对各类测序技术的基因组组装及后续分析,研究人员也开发了大量的生物信息学工具。笔者从测序技术、基因组装技术和全基因组测序生物信息学分析等方面,罗列了目前已完成全基因组测序的木本植物,介绍了全基因组测序技术的发展与应用,以及适用于第3代数据基因组组装的生物学分析软件,为林木基因组研究者提供一定的借鉴。  相似文献   

17.
Comparison of human genetic and sequence-based physical maps   总被引:40,自引:0,他引:40  
Recombination is the exchange of information between two homologous chromosomes during meiosis. The rate of recombination per nucleotide, which profoundly affects the evolution of chromosomal segments, is calculated by comparing genetic and physical maps. Human physical maps have been constructed using cytogenetics, overlapping DNA clones and radiation hybrids; but the ultimate and by far the most accurate physical map is the actual nucleotide sequence. The completion of the draft human genomic sequence provides us with the best opportunity yet to compare the genetic and physical maps. Here we describe our estimates of female, male and sex-average recombination rates for about 60% of the genome. Recombination rates varied greatly along each chromosome, from 0 to at least 9 centiMorgans per megabase (cM Mb(-1)). Among several sequence and marker parameters tested, only relative marker position along the metacentric chromosomes in males correlated strongly with recombination rate. We identified several chromosomal regions up to 6 Mb in length with particularly low (deserts) or high (jungles) recombination rates. Linkage disequilibrium was much more common and extended for greater distances in the deserts than in the jungles.  相似文献   

18.
The copy number of 5S rDNA and centromeric sequence RCS2 was determined by extended DNA fiber based fluorescence in situ hybridization (Fiber-FISH) in rice (Oryza sativa ssp. indica cv. Guangluai No. 4) genome. In order to determine the copy number, it is necessary to know the basepair number that a given length DNA fiber contains under a microscope. Therefore, the length of two DNA fragments, in which the basepair number had been already known, was measured. The insert sequence of the tested BAC 38D17 was 136 kb and its extended DNA was 56.4 μm long, 2.41 kb/μm on average, while that of the tested BAC 44B4 was 144.5 kb in total and 55.7 μm long, 2.60 kb/μm on average under the microscope. They were very close to the theoretical value of B-DNA in the Watson-Crick DNA model, which is 2.97 kb/μm. According to the average value of basepair number per μm of the two samples mentioned above, that is, 2.51 kb/μm, it could be estimated that the copy number was about 686 for 5S rDNA and 286-1121 for the centromere sequence RCS2.  相似文献   

19.
Avian infectious bronchitis virus (AIBV) is classified as a member of the genus coronavirus in the family coronaviridae. The enveloped virus has a positive-sense, single-stranded RNA genome of approximately 28 kilo-bases,which has a 5‘ cap structure and 3‘ polyadenylation tract.The complete genome sequence of infectious bronchitis virus (IBV), Beijing isolate, was determined by cloning sequencing and primer walking. The whole genome is 27733 nucleotides in length, has ten open reading frames:5′-orfla-orflab-s-3a-3b-e-m- 6a-6b-n-3′. Alignments of the genome sequence of IBV Beijing isolate with those of two AIBV strains and one SARS coronavirus were performed respectively. The genome sequence of IBV Beijing isolate compared with that of the IBV strain LX4 (uncompleted, 19440 bp in size) was 91.2% similarity. However, the full-length genome sequence of IBV Beijing isolate was 85.2% identity to that of IBV Strain Beaudette, and was only 50.8% homology to that of SARS coronavirus. The results showed that the genome of IBV has remarkable variation. And IBV Beijing isolate is not closely related to SARS coronavirus. Phylogenetic analyses based on the whole genome sequence, S protein, M protein and N protein, also showed that AIBV Bering isolate is lone virus in group Ⅲ and is distant from SARS coronavirus. In conclusion, this study will contribute to the studies of diagnosis and diseases control on IBV in China.  相似文献   

20.
Resistance-like sequences have been amplified from first strand cDNA and genomic DNA of rice by PCR using oligonucleotide primers designed from sequence motifs conserved between resistance genes of tobacco andArabidopsis thaliana. 3 PCR clones, designatedOsr1, Osr2 andOsr3 which were 98% identical in nucleotide sequence level, have been found to be significantly homologous to known plant resistance genes and all contained the conserved motifs of NBS-LRR type resistance genes, such as P-loop, kinase2a, kinase3a and transmembrane domain.Southern hybridization revealed that rice resistance gene hornologueswere organized as a cluster in the genome. RFLP mapping using a DH population derived from anindica/japonka cross (Zhaiyeqing 8/Jingxi 17) and an RFLP linkage map assigned two copies ofOsrl and one copy ofOsr3 to the distal position of chromosome 12 where a blast resistance QTL has been mapped previously. Northern blot analysis showed thatOsrl gene was constitutively transcribed in rice leaves, shoots and roots. Further study concerning isolation of full-length cDNAs would be conducive to elucidating the functions of these genes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号