2008 IEEE 24th International Conference on Data Engineering A Hybrid Approach to Private Record Linkage Cancun, Mexico April 07-April 12 ISBN: 978-1-4244-1836-7
Real-world entities are not always represented by the same set of features in different data sets. Therefore matching and linking records corresponding to the same real-world entity distributed across these data sets is a challenging task. If the data sets contain private information, the problem becomes even harder due to privacy concerns. Existing solutions of this problem mostly follow two approaches: sanitization techniques and cryptographic techniques. The former achieves privacy by perturbing sensitive data at the expense of degrading matching accuracy. The later, on the other hand, attains both privacy and high accuracy under heavy communication and computation costs. In this paper, we propose a method that combines these two approaches and enables users to trade off between privacy, accuracy and cost. Experiments conducted on real data sets show that our method has significantly lower costs than cryptographic techniques and yields much more accurate matching results compared to sanitization techniques, even when the data sets are perturbed extensively.
Citation:
Ali Inan, Murat Kantarcioglu, Elisa Bertino, Monica Scannapieco, "A Hybrid Approach to Private Record Linkage," icde, pp.496-505, 2008 IEEE 24th International Conference on Data Engineering, 2008 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||