2007 Data Compression Conference (DCC'07)
Normalized maximum likelihood model of order-1 for the compression of DNA sequences
Snowbird, Utah
March 27-March 29
ISBN: 0-7695-2791-4
We present the NML model for classes of models with memory described by first order dependencies. The model is used for efficiently locating and encoding the best regressor present in a dictionary. By combining the order-1 NML with the order- 0 NML model the resulting algorithm achieves a consistent improvement over the earlier order-0 NML algorithm, and it is demonstrated to have superior performance on the practical compression of the human genome.