The Community for Technology Leaders
2017 IEEE Symposium on Computers and Communications (ISCC) (2017)
Heraklion, Greece
July 3, 2017 to July 6, 2017
ISBN: 978-1-5386-1630-7
pp: 890-897
Gianpiero Costantino , Istituto di Informatica e Telematica, Consiglio Nazionale delle Ricerche, Pisa, Italy
Antonio La Marra , Istituto di Informatica e Telematica, Consiglio Nazionale delle Ricerche, Pisa, Italy
Fabio Martinelli , Istituto di Informatica e Telematica, Consiglio Nazionale delle Ricerche, Pisa, Italy
Andrea Saracino , Istituto di Informatica e Telematica, Consiglio Nazionale delle Ricerche, Pisa, Italy
Mina Sheikhalishahi , Istituto di Informatica e Telematica, Consiglio Nazionale delle Ricerche, Pisa, Italy
ABSTRACT
Text mining is the process to automatically infer relevant information from semantically related text documents. This technique, which has applications from business intelligence to homeland security, terrorism and crime fight, might bring noticeable privacy issues when analyzed documents contain privacy sensitive information. In this paper, we propose a framework for privacy-preserving text analysis, which exploits Homomorphic Encryption, to analyze text documents in a privacy preserving manner. The proposed framework is designed to ensure that there is no disclosure of privacy sensitive information contained in the document to any party, including the analysis engine itself. Furthermore, we present two use cases of analysis based on bag-of-words classification, where the proposed framework manages to obtain good classification results without information disclosure. In particular the two different settings that are considered are: tweet analysis for detection of terrorist Twitter accounts, and out-box email analysis for detection of bot infected devices. Accuracy results with different classifiers, performances and a security analysis of our approach are presented and discussed.
INDEX TERMS
Encryption, Data privacy, Engines, Privacy, Text analysis, Data analysis
CITATION

G. Costantino, A. La Marra, F. Martinelli, A. Saracino and M. Sheikhalishahi, "Privacy-preserving text mining as a service," 2017 IEEE Symposium on Computers and Communications (ISCC), Heraklion, Greece, 2017, pp. 890-897.
doi:10.1109/ISCC.2017.8024639
92 ms
(Ver 3.3 (11022016))