首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于动态词典的英文文本压缩算法
引用本文:江力,孙建伶,王新宇,杨长生.一种基于动态词典的英文文本压缩算法[J].江南大学学报(自然科学版),2007,6(4):442-445.
作者姓名:江力  孙建伶  王新宇  杨长生
作者单位:浙江大学,计算机科学与技术学院,浙江,杭州,310027
摘    要:传统的压缩算法LZ77、LZ78以及改进的LZW都是以单字符为单位进行处理的.这种处理模式降低了对多字符词汇相关性的适应速度,从而直接导致压缩效率的降低.为了提高压缩效率,在LZW压缩算法的基础之上,结合以词汇为单位的处理模式的设计思想,提出了一种基于词汇模式的LZW算法.实验结果表明,这种以词汇为基础的文本压缩算法比原来的LZW算法具有稳定的、更高的压缩效率.该压缩算法可直接推广应用到其他语言的文本压缩中去.

关 键 词:压缩  LZW算法  相关性  词汇
文章编号:1671-7147(2007)04-0442-04
收稿时间:2005-10-01
修稿时间:2005-10-012006-03-30

A Compression Algorithm for English-Text Based on Dynamic Dictionary
JIANG Li,SUN Jian-ling,WANG Xin-yu,YANG Chang-sheng.A Compression Algorithm for English-Text Based on Dynamic Dictionary[J].Journal of Southern Yangtze University:Natural Science Edition,2007,6(4):442-445.
Authors:JIANG Li  SUN Jian-ling  WANG Xin-yu  YANG Chang-sheng
Institution:College of Computer Science, Zheiiang University, Hangzhou 310027, China
Abstract:The classical text compression algorithm LZ77 and LZ78,as well as later improved LZW,are all based on single-character mode while collecting the tokens.This method will cause the algorithm to slowly learns.the correlations between words,and to directly influence the compress rate.Based on the LZW algorithm,the paper brings in an idea called "word-based" to solve the problem,also called as "Word-based LZW".Tested by an experiment,Word-based LZW algorithm is confirmed to have steady and better compression efficiency.The algorithm can be easily extended to compression for text of other languages as well.
Keywords:compression  LZW algorithm  correlation  word
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号