A top-k spatial join querying processing algorithm based on spark

作者:

Highlights:

• To the best of our knowledge, STKSJ algorithm is the first implementation in Spark.

• Grid partitioning and Z-order methods are used to partition and encode spatial data.

• Two efficient operations are presented to project and replicate spatial objects.

• An improved plane sweeping algorithm is proposed to improve the performance of STKSJ.

• Experimental results show that STKSJ performs better than the other algorithms.

摘要

•To the best of our knowledge, STKSJ algorithm is the first implementation in Spark.•Grid partitioning and Z-order methods are used to partition and encode spatial data.•Two efficient operations are presented to project and replicate spatial objects.•An improved plane sweeping algorithm is proposed to improve the performance of STKSJ.•Experimental results show that STKSJ performs better than the other algorithms.

论文关键词:Cloud computing,Spark platform,Top-k spatial join query,Plane sweeping algorithm

论文评审过程:Received 24 June 2017, Revised 11 November 2018, Accepted 14 July 2019, Available online 22 July 2019, Version of Record 29 July 2019.

论文官网地址:https://doi.org/10.1016/j.is.2019.101419