loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Data Compression Conference (DCC'06)
Snowbird, Utah
March 28-March 30
ISBN: 0-7695-2545-8
S.T. Klein, Bar Ilan University, Israel
T.C. Serebro, Bar Ilan University, Israel
D. Shapira, Ashkelon Acad. Colleges, Israel
We introduce a new model of differencing encoding, that of Compressed Differencing. Given two files for which at least one is in compressed form, the goal is to create a third file which is the delta file of the two original files, in time proportional to the size of the input, that is, without decompressing the compressed files. If both files, S and T, are compressed using the same static Huffman code, generating the differencing file can be done in the traditional way (using a sliding window) directly on the compressed files. The delta encoding is at least as efficient as the delta encoding generated on the original files S and T. Common substrings of S and T are still common substrings of the compressed versions of S and T. However the reverse is not necessarily true, since the common substrings can extend beyond codeword boudaries.
Citation:
S.T. Klein, T.C. Serebro, D. Shapira, "Modeling Delta Encoding of Compressed Files," dcc, pp.457, Data Compression Conference (DCC'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.