Loading…
The Smallest Grammar Problem Revisited
In a seminal paper, Charikar et al. derive upper and lower bounds on the approximation ratios for several grammar-based compressors, but in all cases there is a gap between the lower and upper bound. Here the gaps for LZ78 and BISECTION are closed by showing that the approximation ratio of LZ78 is...
Saved in:
Published in: | IEEE transactions on information theory 2021-01, Vol.67 (1), p.317-328 |
---|---|
Main Authors: | , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | In a seminal paper, Charikar et al. derive upper and lower bounds on the approximation ratios for several grammar-based compressors, but in all cases there is a gap between the lower and upper bound. Here the gaps for LZ78 and BISECTION are closed by showing that the approximation ratio of LZ78 is \Theta ((\text {n}/\log \text {n})^{2/3}) , whereas the approximation ratio of BISECTION is \Theta (\sqrt {\text {n}/\log \text {n}}) . In addition, the lower bound for RePair is improved from \Omega (\sqrt {\log \text {n}}) to \Omega (\log \text {n}/\log \log \text {n}) . Finally, results of Arpe and Reischuk relating grammar-based compression for arbitrary alphabets and binary alphabets are improved. |
---|---|
ISSN: | 0018-9448 1557-9654 |
DOI: | 10.1109/TIT.2020.3038147 |