首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于BWC的XML文本数据索引技术
引用本文:仲志平,刘渝妍,翟从鸿.基于BWC的XML文本数据索引技术[J].安徽师范大学学报(自然科学版),2011,34(3).
作者姓名:仲志平  刘渝妍  翟从鸿
作者单位:1. 安徽师范大学物理与电子信息学院,中国芜湖,241000
2. 昆明学院现代教育技术中心,中国昆明,650000
基金项目:安徽省自然科学基金项目(KJ2010B280)
摘    要:在XML文档中,相当大的部分是由文本数据组成的,针对XML文本数据占用空间较大、对压缩文本数据有效搜索效率较低的难点,基于BWC提出了压缩XML文本数据索引的技术,通过构造全文本数据模型,并利用整体压缩自索引存储XML文档的文本数据,实验结果表明,该技术不仅有效支持XPath查询语言文本搜索,而且内存消耗相对较小,实现了中小规模数据的内存搜索.

关 键 词:自索引  后向搜索  文本数据  BWC  

Indexing Technique of XML Text Data Based on BWC
ZHONG Zhi-ping,LIU Yu-yan,ZHAI Cong-hong.Indexing Technique of XML Text Data Based on BWC[J].Journal of Anhui Normal University(Natural Science Edition),2011,34(3).
Authors:ZHONG Zhi-ping  LIU Yu-yan  ZHAI Cong-hong
Institution:ZHONG Zhi-ping1,LIU Yu-yan2,ZHAI Cong-hong1(1.College of Physics and Electrical Information,Anhui Normal University,Wuhu 241000,China,2.Modern Education Technology Center of Kunming University,Kunming 650000,China)
Abstract:A large number of fractions of an XML document are composed of text data.Considering the problems of the size of large XML document and less efficiency of effective searching on compressed text data,an index technology for compressed XML text data based on BWC is presented in this paper.The proposed technique is implemented by constructing a full text data model and in which the text data of XML document is stored with global compressed self-index.Experimental results shows that not only the proposed techni...
Keywords:self-index  backward searching  text data  Burrows Wheeler compression  
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号