loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Data Compression Conference (DCC'06)
Snowbird, Utah
March 28-March 30
ISBN: 0-7695-2545-8
Joaqu?n Adiego, Universidad de Valladolid, Spain
Pablo de la Fuente, Universidad de Valladolid, Spain
Most text compression algorithms perform compression at character level. If the algorithm is adaptive, it slowly learns correlations between adjacent pairs of characters, then triples and so on. The algorithm rarely has a chance to take advantage of longer range correlations. If text compression algorithms were to use larger units (words) than single characters as the basic storage element, they would be able to make the most of the longer range correlations and, perhaps, achieve better compression performance. Faster compression may also be possible by working with words. On the other hand, PPM is one of the most promising lossless discrete-data and character-based compression algorithms, which uses Markov models of order k.
Citation:
Joaqu?n Adiego, Pablo de la Fuente, "On the Use of Words as Source Alphabet Symbols in PPM," dcc, pp.435, Data Compression Conference (DCC'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.