Privacy-preserving publishing microdata with full functional dependencies

作者：

Highlights：

•

摘要

Data publishing has generated much concern on individual privacy. Recent work has shown that different background knowledge can bring various threats to the privacy of published data. In this paper, we study the privacy threat from the full functional dependency (FFD) that is used as part of adversary knowledge. We show that the cross-attribute correlations by FFDs (e.g., Phone → Zipcode) can bring potential vulnerability. Unfortunately, none of the existing anonymization principles (e.g., k-anonymity, ℓ-diversity, etc.) can effectively prevent against an FFD-based privacy attack. We formalize the FFD-based privacy attack and define the privacy model, (d,ℓ)-inference, to combat the FD-based attack. We distinguish the safe FFDs that will not jeopardize privacy from the unsafe ones. We design robust algorithms that can efficiently anonymize the microdata with low information loss when the unsafe FFDs are present. The efficiency and effectiveness of our approach are demonstrated by the empirical study.

论文关键词：Privacy-preserving data publishing,Functional dependency,Utility

论文评审过程：Received 23 January 2010, Revised 30 October 2010, Accepted 2 November 2010, Available online 10 November 2010.

论文官网地址：https://doi.org/10.1016/j.datak.2010.11.002