A Cryptographic Ensemble for secure third party data analysis: Collaborative data clustering without data owner participation

作者:

Highlights:

摘要

This paper introduces the twin concepts Cryptographic Ensembles and Global Encrypted Distance Matrices (GEDMs), designed to provide a solution to outsourced secure collaborative data clustering. The cryptographic ensemble comprises: Homomorphic Encryption (HE) to preserve raw data privacy, while supporting data analytics; and Multi-User Order Preserving Encryption (MUOPE) to preserve the privacy of the GEDM. Clustering can therefore be conducted over encrypted datasets without requiring decryption or the involvement of data owners once encryption has taken place, all with no loss of accuracy. The GEDM concept is applicable to large scale collaborative data mining applications that feature horizontal data partitioning. In the paper DBSCAN clustering is adopted for illustrative and evaluation purposes. The results demonstrate that the proposed solution is both efficient and accurate while maintaining data privacy.

论文关键词:Data mining as a service,Privacy preserving data mining,Security,Data outsourcing

论文评审过程:Available online 30 August 2019, Version of Record 9 April 2020.

论文官网地址:https://doi.org/10.1016/j.datak.2019.101734