Nov. 27, 2005 to Nov. 30, 2005
Ke Wang , Simon Fraser University
Benjamin C. M. Fung , Simon Fraser University
Philip S. Yu , IBM T. J. Watson Research Center
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2005.142
In this paper, we present a template-based privacy preservation to protect against the threats caused by data mining abilities. The problem has dual goals: preserve the information for a wanted classification analysis and limit the usefulness of unwanted sensitive inferences that may be derived from the data. Sensitive inferences are specified by a set of "privacy templates". Each template specifies the sensitive information to be protected, a set of identifying attributes, and the maximum association between the two. We show that suppressing the domain values is an effective way to eliminate sensitive inferences. For a large data set, finding an optimal suppression is hard, since it requires optimization over all suppressions. We present an approximate but scalable solution. We demonstrate the effectiveness of this approach on real life data sets.
Ke Wang, Benjamin C. M. Fung, Philip S. Yu, "Template-Based Privacy Preservation in Classification Problems", ICDM, 2005, Proceedings. Fifth IEEE International Conference on Data Mining, Proceedings. Fifth IEEE International Conference on Data Mining 2005, pp. 466-473, doi:10.1109/ICDM.2005.142