2012 Seventh International Conference on Availability, Reliability and Security (2008)
Mar. 4, 2008 to Mar. 7, 2008
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ARES.2008.61
In this paper, a novel statistical algorithm for linguistic steganography detection, which takes advantage of distribution of words in the text segment detected, is presented. Linguistic steganography is the art of using written natural language to hide the very presence of secret messages. Using the text data, which is the foundational media in internet communications, as its carrier, linguistic steganography plays an important part in Information Hiding (IH) area. The previous work was mainly focused on linguistic steganography and there were few researches on linguistic steganalisys. We attempt to do something to help to fix this gap. In our experiment of detecting the three different linguistic steganography methods: NICETEXT,TEXTO and Markov-Chain-Based, the total accuracies on discovering stego-text segments and normal text segmentsare found to be 87.39%, 95.51%, 98.50%, 99.15% and 99.57% respectively when the segment size is 5kB, 10kB,20kB, 30kB and 40kB. Our research shows that the linguistic steganalysis based on distribution of words is promising.
linguistic steganography, linguistic steganalysis, detection, statistical, distribution of words
Huang Liu-sheng, Li Ling-jun, Yang Wei, Yu Zhen-shan, Chen Zhi-li, "A Statistical Algorithm for Linguistic Steganography Detection Based on Distribution of Words", 2012 Seventh International Conference on Availability, Reliability and Security, vol. 00, no. , pp. 558-563, 2008, doi:10.1109/ARES.2008.61