|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2008 IEEE 24th International Conference on Data Engineering
On the Anonymization of Sparse High-Dimensional Data
Cancun, Mexico
April 07-April 12
ISBN: 978-1-4244-1836-7
| ASCII Text | x | ||
| Gabriel Ghinita, Yufei Tao, Panos Kalnis, "On the Anonymization of Sparse High-Dimensional Data," Data Engineering, International Conference on, pp. 715-724, 2008 IEEE 24th International Conference on Data Engineering, 2008. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDE.2008.4497480, author = {Gabriel Ghinita and Yufei Tao and Panos Kalnis}, title = {On the Anonymization of Sparse High-Dimensional Data}, journal ={Data Engineering, International Conference on}, volume = {0}, year = {2008}, isbn = {978-1-4244-1836-7}, pages = {715-724}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDE.2008.4497480}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Engineering, International Conference on TI - On the Anonymization of Sparse High-Dimensional Data SN - 978-1-4244-1836-7 SP715 EP724 A1 - Gabriel Ghinita, A1 - Yufei Tao, A1 - Panos Kalnis, PY - 2008 VL - 0 JA - Data Engineering, International Conference on ER - | |||
Existing research on privacy-preserving data publishing focuses on relational data: in this context, the objective is to enforce privacy-preserving paradigms, such as k-anonymity and l-diversity, while minimizing the information loss incurred in the anonymizing process (i.e. maximize data utility). However, existing techniques adopt an indexing-or clustering-based approach, and work well for fixed-schema data, with low dimensionality. Nevertheless, certain applications require privacy-preserving publishing of transaction data (or basket data), which involves hundreds or even thousands of dimensions, rendering existing methods unusable. We propose a novel anonymization method for sparse high-dimensional data. We employ a particular representation that captures the correlation in the underlying data, and facilitates the formation of anonymized groups with low information loss. We propose an efficient anonymization algorithm based on this representation. We show experimentally, using real-life datasets, that our method clearly outperforms existing state-of-the-art in terms of both data utility and computational overhead.
Citation:
Gabriel Ghinita, Yufei Tao, Panos Kalnis, "On the Anonymization of Sparse High-Dimensional Data," icde, pp.715-724, 2008 IEEE 24th International Conference on Data Engineering, 2008
Usage of this product signifies your acceptance of the Terms of Use.
