首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于Huffman编码的XML数据压缩方法
引用本文:施鹏,李敏,于涛,赵利强,王建林.基于Huffman编码的XML数据压缩方法[J].北京化工大学学报(自然科学版),2013,40(4):120-124.
作者姓名:施鹏  李敏  于涛  赵利强  王建林
作者单位:北京化工大学信息科学与技术学院,北京,100029;北京中核东方控制系统工程有限公司,北京,100176
摘    要:针对一定网络带宽下生产过程报表系统对大型数据源访问速率不高的问题,提出了一种基于Huffman编码的XML数据压缩方法。通过构造数据处理类获取XML文档中重复率高的节点单元,采用Huffman编码对节点单元进行编码,将编码后文档利用LZMA算法压缩,构建了Huffman-LZMA压缩算法,并将该压缩算法应用于生产过程报表系统设计。实际应用结果表明,该压缩算法对生产过程报表数据源的压缩率达到约88%,有效的节省了网络带宽和存储空间,提高了报表系统的访问速率。

关 键 词:生产过程报表系统  压缩算法  Huffman编码  LZMA算法
收稿时间:2012-12-19

Design and application of an XML data compression algorithm based on Huffman coding
SHI Peng , LI Min , YU Tao , ZHAO LiQiang , WANG JianLin.Design and application of an XML data compression algorithm based on Huffman coding[J].Journal of Beijing University of Chemical Technology,2013,40(4):120-124.
Authors:SHI Peng  LI Min  YU Tao  ZHAO LiQiang  WANG JianLin
Institution:1. College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029;2. China Nuclear Control System Engineering Co. Ltd, Beijing 100176, China
Abstract:An XML data compression method based on Huffman coding has been proposed for the problem where the accessing rate of a production process report system for a large data source is not high in a certain bandwidth. A data processing class was constructed for XML documents to get a high rate word units in this algorithm. With the help of Huffman coding to code specific unit words, the coded document was compressed by the LZMA compression algorithm. The problem of needing the assistance of the document type definition and XML parser in the traditional XML data compression algorithm was solved using this algorithm, which resulted in a good compression effect. The Huffman-LZMA compression algorithm was constructed and was applied to the production process report system design. The experimental compression ratio of the report data reached about 88%. The bandwidth and storage space were saved effectively, and the report accessing rate was improved.
Keywords:production process report system  data compression algorithm  Huffman coding  LZMA algorithm
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京化工大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《北京化工大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号