首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Efficient Multi-Resolution Compression Algorithm for Disk-Based Backup and Recovery
作者姓名:WANG  Dejun  WANG  Lina  WANG  Hui
作者单位:School of Computer, Wuhan University, Wuhan 430072, Hubei, China
摘    要:0 IntroductionTraditionally,there are two kinds of compression meth-ods developedfor tape-based backup.Oneisincrementalbackup method, which can distinguish modified files , andbackup only those modifiedfilesinthe backup set . The otherone is streamcompression, which eli minate redundancy in asingle file.In a large backup system,numerous redundant data ex-ists across files .Files with the same content may be foundinother nodes due tofile reduplication and distribution;a partlymodifiedfile may …

关 键 词:数据恢复  数据压缩    文件系统  信息安全
文章编号:1007-1202(2006)06-1951-04
收稿时间:2006-03-20

Efficient multi-resolution compression algorithm for disk-based backup and recovery
WANG Dejun WANG Lina WANG Hui.Efficient Multi-Resolution Compression Algorithm for Disk-Based Backup and Recovery[J].Wuhan University Journal of Natural Sciences,2006,11(6):1951-1954.
Authors:Wang Dejun  Wang Lina  Wang Hui
Institution:(1) School of Computer, Wuhan University, 430072 Wuhan, Hubei, China
Abstract:In this paper, we deal with the problem of improving backup and recovery performance by compressing redundancies in large disk-based backup system. We analyze some general compression algorithms; evaluate their scalability and applicability. We investigate the distribution features of the redundant data in whole system range, and propose a multi-resolution distributed compression algorithm which can discern duplicated data at granularity of file level, block level or byte level to reduce the redundancy in backup environment. In order to accelerate recovery, we propose a synthetic backup solution which stores data in a recovery-oriented way and can compose the final data in back-end backup server. Experiments show that this algorithm can greatly reduce bandwidth consumption, save storage cost, and shorten the backup and recovery time. We implement these technologies in our product, called H-info backup system, which is capable of achieving over 10x compression ratio in both network utilization and data storage during backup.
Keywords:backup and recovery  data compression  remote file synchronization  entropy
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号