Equally contributory privacy-preserving k-means clustering over vertically partitioned data

作者:

Highlights:

摘要

In recent years, there have been numerous attempts to extend the k-means clustering protocol for single database to a distributed multiple database setting and meanwhile keep privacy of each data site. Current solutions for (whether two or more) multiparty k-means clustering, built on one or more secure two-party computation algorithms, are not equally contributory, in other words, each party does not equally contribute to k-means clustering. This may lead a perfidious attack where a party who learns the outcome prior to other parties tells a lie of the outcome to other parties. In this paper, we present an equally contributory multiparty k-means clustering protocol for vertically partitioned data, in which each party equally contributes to k-means clustering. Our protocol is built on ElGamal's encryption scheme, Jakobsson and Juels's plaintext equivalence test protocol, and mix networks, and protects privacy in terms that each iteration of k-means clustering can be performed without revealing the intermediate values.

论文关键词:Privacy-preserving distributed data mining,k-means clustering,Data security

论文评审过程:Received 26 July 2010, Revised 2 February 2012, Accepted 1 June 2012, Available online 12 June 2012.

论文官网地址:https://doi.org/10.1016/j.is.2012.06.001