Parallelizing filter-and-verification based exact set similarity joins on multicores

作者:

Highlights:

• Multi-threading has not yet been considered to speed up set similarity joins.

• We propose a novel data-parallel set similarity join algorithm.

• Multi-threading speeds up the set similarity join 2 to 10 times.

• Implementation optimizations are not benefitial for the runtime.

摘要

•Multi-threading has not yet been considered to speed up set similarity joins.•We propose a novel data-parallel set similarity join algorithm.•Multi-threading speeds up the set similarity join 2 to 10 times.•Implementation optimizations are not benefitial for the runtime.

论文关键词:Set similarity join,Parallelization,Multi-threading,Multi-core,Filter-and-verification

论文评审过程:Received 11 February 2021, Revised 6 August 2021, Accepted 6 October 2021, Available online 20 October 2021, Version of Record 12 May 2022.

论文官网地址:https://doi.org/10.1016/j.is.2021.101912