A local asynchronous distributed privacy preserving feature selection algorithm for large peer-to-peer networks

作者:Kamalika Das, Kanishka Bhaduri, Hillol Kargupta

摘要

In this paper we develop a local distributed privacy preserving algorithm for feature selection in a large peer-to-peer environment. Feature selection is often used in machine learning for data compaction and efficient learning by eliminating the curse of dimensionality. There exist many solutions for feature selection when the data are located at a central location. However, it becomes extremely challenging to perform the same when the data are distributed across a large number of peers or machines. Centralizing the entire dataset or portions of it can be very costly and impractical because of the large number of data sources, the asynchronous nature of the peer-to-peer networks, dynamic nature of the data/network, and privacy concerns. The solution proposed in this paper allows us to perform feature selection in an asynchronous fashion with a low communication overhead where each peer can specify its own privacy constraints. The algorithm works based on local interactions among participating nodes. We present results on real-world dataset in order to test the performance of the proposed algorithm.

论文关键词:Privacy preserving, Data mining, Feature selection, Distributed computation

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-009-0274-3