An Empirical Performance Comparison between Matrix Multiplication Join and Hash Join on GPUs

Sun, Wenbo; Katsifodimos, Asterios; Hai, R.

doi:10.1109/ICDEW58674.2023.00034

An Empirical Performance Comparison between Matrix Multiplication Join and Hash Join on GPUs

Conference paper (2023)

Authors

Wenbo Sun Web Information Systems

Asterios Katsifodimos Web Information Systems

R. Hai Web Information Systems

Research Group

Web Information Systems

DOI: https://doi.org/10.1109/ICDEW58674.2023.00034

GPU Hash Join Matrix Multiplication Join

To reference this document use:

http://resolver.tudelft.nl/uuid:1e960def-f997-488e-886f-4f4bc2d29ab3

More Info

expand_more

Published Date

2023

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Research Group

Web Information Systems

Abstract

Recent advances in Graphic Processing Units (GPUs) have facilitated a significant performance boost for database operators, in particular, joins. It has been intensively studied how conventional join implementations, such as hash joins, benefit from the massive parallelism of GPUs. With the proliferation of machine learning, more databases have started to provide native support for the basic building blocks of ML algorithms, i.e., linear algebra operators such as matrix multiplication (MM). Despite the recent increasing interest in processing relational joins using matrix multiplication (MM-join), two crucial questions still remain open: i) how efficient are current MM-join implementations compared to the GPU-based join algorithms; ii) how should practitioners choose among MM-join and conventional GPU-based joins given different data characteristics.In this paper, we compare the execution time, and memory I/O of MM-join against multiple GPU hash joins. An empirical analysis of our experimental results reveals that the state-of-the-art hash join implementation shows substantial scalability for various data characteristics. In contrast, MM-join outperforms the SOTA hash join in low join selectivity and low table cardinality but shows unsatisfactory scalability due to synchronous data movement and computation.

Files

An_Empirical_Performance_Compa... (pdf)

(pdf | 1.11 Mb)

- Embargo expired in 14-12-2023

Unknown license