Parallelization of variable rate decompression through metadata

Noordsij, Lennart; van der Vlugt, Steven; Bamakhrama, Mohamed A.; Al-Ars, Zaid; Lindstrom, Peter

doi:10.1109/PDP50117.2020.00045

Parallelization of variable rate decompression through metadata

Conference paper (2020)

Authors

Lennart Noordsij ASML

Steven van der Vlugt ASML

Mohamed A. Bamakhrama Synopsys Corporation, Universiteit Leiden

Zaid Al-Ars Computer Engineering

Peter Lindstrom Lawrence Livermore National Laboratory

Research Group

Computer Engineering

DOI: https://doi.org/10.1109/PDP50117.2020.00045

To reference this document use:

http://resolver.tudelft.nl/uuid:e84c6f43-8bc0-408e-80c1-dbea4a633bd6

More Info

expand_more

Published Date

2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Research Group

Computer Engineering

Abstract

Data movement has long been identified as the biggest challenge facing modern computer systems' designers. To tackle this challenge, many novel data compression algorithms have been developed. Often variable rate compression algorithms are favored over fixed rate. However, variable rate decompression is difficult to parallelize. Most existing algorithms adopt a single parallelization strategy suited for a particular HW platform. Such an approach fails to harness the parallelism found in diverse modern HW architectures. We propose a parallelization method for tiled variable rate compression algorithms that consists of multiple strategies that can be applied interchangeably. This allows an algorithm to apply the strategy most suitable for a specific HW platform. Our strategies are based on generating metadata during encoding, which is used to parallelize the decoding process. To demonstrate the effectiveness of our strategies, we implement them in a state-of-the-art compression algorithm called ZFP. We show that the strategies suited for multicore CPUs are different from the ones suited for GPUs. On a CPU, we achieve a near optimal decoding speedup and an overhead size which is consistently less than 0.04% of the compressed data size. On a GPU, we achieve average decoding rates of up to 100 GiB/s. Our strategies allow the user to make a trade-off between decoding throughput and metadata size overhead.

Files

Paralellization_of_variable_ra... (pdf)

(pdf | 0.536 Mb)

Unknown license

09092414.pdf

(pdf | 0.383 Mb)

Unknown license

Download not available