āLempel-Ziv: a ā1-Bit Catastropheā but Not a Tragedyā, 2017-07-13 ()ā :
The so-called āone-bit catastropheā for the compression algorithm LZā78 asks whether the compression ratio of an infinite word can change when a single bit is added in front of it.
We answer positively this open question raised by Lutz and others: we show that there exists an infinite word w such that Ļsup(w) = 0 but Ļinf(0w) > 0, where Ļsup and Ļinf are respectively the limāsup and the lim inf of the compression ratios Ļ of the prefixes.
To that purpose we explore the behavior of LZā78 on finite words and show the following results:
\n
- There is a constant C > 0 such that, for any finite word w and any letter a, Ļ(aw) ⤠Cā(Ļ(w) log |w|). Thus, sufficiently compressible words (Ļ(w) = o(1ālog|w|)) remain compressible with a letter in front;
\n
The previous result is tight up to a multiplicative constant for any compression ratio Ļ(w) = \U0001D4AA(1ālog|w|). In particular, there are infinitely many words w satisfying Ļ(w) = \U0001D4AA(1ālog|w|) but Ļ(0w) = Ī©(1).