We introduce a new approach to the problem of optimal compression when a source code produces multiple codewords for a given symbol. It may seem that the most sensible codeword to use in this case is the shortest one. However, in the proposed free energy approach, random codeword selection yields an effective codeword length that can be less than the shortest codeword length. If the random choices are Boltzmann distributed, the effective length is optimal for the given source code. The expectation-maximization parameter estimation algorithms minimize this effective codeword length. We illustrate the performance of free energy coding on a simple problem where a compression factor of two is gained by using the new method.
Index Terms:
encoding; data compression; parameter estimation; optimisation; free energy coding; optimal compression; source code; multiple codewords; random codeword selection; codeword length; Boltzmann distribution; expectation-maximization parameter estimation; compression factor
Citation:
B.J. Frey, G.E. Hinton, "Free energy coding," dcc, pp.73, Data Compression Conference (DCC '96), 1996