EPIIC: a novel encoding pluggable lossless data compression algorithm

Download
2018
Doğan, Taylan İsmail
Encoding pluggable inverted index compression (EPIIC) is a novel lossless data compression algorithm that applies a pipeline of conventional compression techniques on files that are transformed into a structure similar to inverted indexes. What makes this study novel is the idea of compressing the positions or indexes of hexadecimals that make up a file, instead of focusing on compressing the original file. By leveraging the inverted index structure underneath, we have been able to avoid storing the positional data of the most frequent hexadecimal in a file. Moreover, a slightly different variation of run length encoding is used to make the data even more compressible. As a result, it is observed that this new notion of compression performs on a par with widely known algorithms like LZMA and bzip2, especially when used on text and XML files.
Citation Formats
T. İ. Doğan, “EPIIC: a novel encoding pluggable lossless data compression algorithm,” M.S. - Master of Science, Middle East Technical University, 2018.