loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sixth Mexican International Conference on Computer Science (ENC'05)
Data Preprocessing by Sequential Pattern Mining for LZW
Puebla, Mexico
September 26-September 30
ISBN: 0-7695-2454-0
Osslan O. Vergara-Villegas, Instituto Nacional de Astrofisica Optica y Electronica, Mexico
Rene A. Garcia-Hernandez, Instituto Nacional de Astrofisica Optica y Electronica, Mexico
J. Ariel Carrasco-Ochoa, Instituto Nacional de Astrofisica Optica y Electronica, Mexico
Raul Pinto, Instituto Nacional de Astrofisica Optica y Electronica, Mexico
LZW is a lossless data compression algorithm which has been incorporated as the standard of the Consultative Committee on International Telegraphy and Telephony. In addition, LZW is used to create GIF, TIFF and PDF files. In this paper, we propose an improvement to LZW using ideas from Sequential Pattern Mining. The goal of this area is to find all the Maximal Frequent Sequences (MFSs) which are sequences that appear at least .. times and they are not subsequences of any other MFS. We preprocess the data using an algorithm for searching all the MFSs to manage the MFSs as part of the dictionary of LZW, according to the frequency of the MFS. This modification allows us to propose a new variant of LZW algorithm. Some experiments with text files, showing the compression rates of the proposed algorithm, were performed.
Citation:
Osslan O. Vergara-Villegas, Rene A. Garcia-Hernandez, J. Ariel Carrasco-Ochoa, Raul Pinto, "Data Preprocessing by Sequential Pattern Mining for LZW," enc, pp.82-87, Sixth Mexican International Conference on Computer Science (ENC'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.