loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
International Conference on Information Technology: Computers and Communications
Dictionary-Based Fast Transform for Text Compression
Las Vegas, Nevada
April 28-April 30
ISBN: 0-7695-1916-4
Weifeng Sun, University of Central Florida
Nan Zhang, University of Central Florida
Amar Mukherjee, University of Central Florida
In this paper we present StarNT, a dictionary-based fast lossless text transform algorithm. With a static generic dictionary, StarNT achieves a superior compression ratio than almost all the other recent efforts based on BWT and PPM. This algorithm utilizes ternary search tree to expedite transform encoding. Experimental results show that the average compression time has improved by orders of magnitude compared with our previous algorithm LIPT and the additional time overhead it introduced to the backend compressor is unnoticeable.
Based on StarNT, we propose StarZip, a domain-specific lossless text compression utility. Using domain-specific static dictionaries embedded in the system, StarZip achieves an average improvement in compression performance (in terms of BPC) of 13% over bzip2 -9,19% over gzip -9, and 10% over PPMD.
Citation:
Weifeng Sun, Nan Zhang, Amar Mukherjee, "Dictionary-Based Fast Transform for Text Compression," itcc, pp.176, International Conference on Information Technology: Computers and Communications, 2003
Usage of this product signifies your acceptance of the Terms of Use.