User k-anonymity for privacy preserving data mining of query logs

作者:

Highlights:

摘要

The anonymization of query logs is an important process that needs to be performed prior to the publication of such sensitive data. This ensures the anonymity of the users in the logs, a problem that has been already found in released logs from well known companies. This paper presents the anonymization of query logs using microaggregation. Our proposal ensures the k-anonymity of the users in the query log, while preserving its utility. We provide the evaluation of our proposal in real query logs, showing the privacy and utility achieved, as well as providing estimations for the use of such data in data mining processes based on clustering.

论文关键词:Privacy,Query log,Microaggregation,k-Anonymity,Clustering,Web search

论文评审过程:Received 26 February 2010, Revised 27 December 2010, Accepted 5 January 2011, Available online 8 February 2011.

论文官网地址:https://doi.org/10.1016/j.ipm.2011.01.004