The Community for Technology Leaders
Data Compression Conference (2008)
Mar. 25, 2008 to Mar. 27, 2008
ISSN: 1068-0314
ISBN: 978-0-7695-3121-2
pp: 545
ABSTRACT
In this work, we describe a lossless HTML transform which, combined with generally used LZ77 and PPM compression algorithms, allows to attain high compression ratios. Its core is a fully reversible transform featuring substitution of words in an HTML document using a static dictionary or a semi-static dictionary, effective encoding of dictionary indices and numbers.The test results show the proposed transform to improve the HTML compression efficiency of general purpose compressors on average by 17% in case of Deflate and 8% in case of PPMVC.
INDEX TERMS
lossless, compression, HTML, dictionary
CITATION
Przemyslaw Skibinski, "Improving HTML Compression", Data Compression Conference, vol. 00, no. , pp. 545, 2008, doi:10.1109/DCC.2008.74
78 ms
(Ver 3.3 (11022016))