loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2008 IEEE 24th International Conference on Data Engineering
A Hybrid Approach to Private Record Linkage
Cancun, Mexico
April 07-April 12
ISBN: 978-1-4244-1836-7
Ali Inan, Department of Computer Science, The University of Texas at Dallas, Richardson, TX 75083, USA. inan@student.utdallas.edu
Murat Kantarcioglu, Department of Computer Science, The University of Texas at Dallas, Richardson, TX 75083, USA. muratk@utdallas.edu
Elisa Bertino, Department of Computer Sciences, Purdue University, West Lafayette, IN 47907, USA. bertino@cs.purdue.edu
Monica Scannapieco, Dipartimento di Informatica e Sistemistica, Universita di Roma "La Sapienza", Roma 00198, Italy. monscan@dis.uniroma1.it
Real-world entities are not always represented by the same set of features in different data sets. Therefore matching and linking records corresponding to the same real-world entity distributed across these data sets is a challenging task. If the data sets contain private information, the problem becomes even harder due to privacy concerns. Existing solutions of this problem mostly follow two approaches: sanitization techniques and cryptographic techniques. The former achieves privacy by perturbing sensitive data at the expense of degrading matching accuracy. The later, on the other hand, attains both privacy and high accuracy under heavy communication and computation costs. In this paper, we propose a method that combines these two approaches and enables users to trade off between privacy, accuracy and cost. Experiments conducted on real data sets show that our method has significantly lower costs than cryptographic techniques and yields much more accurate matching results compared to sanitization techniques, even when the data sets are perturbed extensively.
Citation:
Ali Inan, Murat Kantarcioglu, Elisa Bertino, Monica Scannapieco, "A Hybrid Approach to Private Record Linkage," icde, pp.496-505, 2008 IEEE 24th International Conference on Data Engineering, 2008
Usage of this product signifies your acceptance of the Terms of Use.