Asymmetric Encoding-Decoding Schemes for Lossless Data Compression
By: Hirosuke Yamamoto, Ken-ichi Iwata
Potential Business Impact:
Makes computer files smaller without losing anything.
This paper proposes a new lossless data compression coding scheme named an asymmetric encoding-decoding scheme (AEDS), which can be considered as a generalization of tANS (tabled variant of asymmetric numeral systems). In the AEDS, a data sequence $\bm{s}=s_1s_2\cdots s_n$ is encoded in backward order $s_t, t=n, \cdots, 2,1$, while $\bm{s}$ is decoded in forward order $s_t, t=1, 2, \cdots, n$ in the same way as the tANS. But, the code class of the AEDS is much broader than that of the tANS. We show for i.i.d.~sources that an AEDS with 2 states (resp.~5 states) can attain a shorter average code length than the Huffman code if a child of the root in the Huffman code tree has a probability weight larger than 0.61803 (resp.~0.56984). Furthermore, we derive several upper bounds on the average code length of the AEDS, which also hold for the tANS, and we show that the average code length of the optimal AEDS and tANS with $N$ states converges to the source entropy with speed $O(1/N)$ as $N$ increases.
Similar Papers
Range-Coder with fast Adaptation and Table-Based Decoding
Information Theory
Makes data compression much faster for computers.
The Sample Complexity of Lossless Data Compression
Information Theory
Makes computer files smaller, faster, and more reliable.
Successive Refinement for Lossy Compression of Individual Sequences
Information Theory
Makes files smaller with two steps.