TY - JOUR
T1 - UniSpaCh
T2 - a text-based data hiding method using unicode space characters
AU - Por, Lip Yee
AU - Wong, Koksheik
AU - Chee, Kok Onn
N1 - Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 2012/5
Y1 - 2012/5
N2 - This paper proposes a text-based data hiding method to insert external information into Microsoft Word document. First, the drawback of low embedding efficiency in the existing text-based data hiding methods is addressed, and a simple attack, DASH, is proposed to reveal the information inserted by the existing text-based data hiding methods. Then, a new data hiding method, UniSpaCh, is proposed to counter DASH. The characteristics of Unicode space characters with respect to embedding efficiency and DASH are analyzed, and the selected Unicode space characters are inserted into inter-sentence, inter-word, end-of-line and inter-paragraph spacings to encode external information while improving embedding efficiency and imperceptivity of the embedded information. UniSpaCh is also reversible where the embedded information can be removed to completely reconstruct the original Microsoft Word document. Experiments were carried out to verify the performance of UniSpaCh as well as comparing it to the existing space-manipulating data hiding methods. Results suggest that UniSpaCh offers higher embedding efficiency while exhibiting higher imperceptivity of white space manipulation when compared to the existing methods considered. In the best case scenario, UniSpaCh produces output document of size almost 9 times smaller than that of the existing method.
AB - This paper proposes a text-based data hiding method to insert external information into Microsoft Word document. First, the drawback of low embedding efficiency in the existing text-based data hiding methods is addressed, and a simple attack, DASH, is proposed to reveal the information inserted by the existing text-based data hiding methods. Then, a new data hiding method, UniSpaCh, is proposed to counter DASH. The characteristics of Unicode space characters with respect to embedding efficiency and DASH are analyzed, and the selected Unicode space characters are inserted into inter-sentence, inter-word, end-of-line and inter-paragraph spacings to encode external information while improving embedding efficiency and imperceptivity of the embedded information. UniSpaCh is also reversible where the embedded information can be removed to completely reconstruct the original Microsoft Word document. Experiments were carried out to verify the performance of UniSpaCh as well as comparing it to the existing space-manipulating data hiding methods. Results suggest that UniSpaCh offers higher embedding efficiency while exhibiting higher imperceptivity of white space manipulation when compared to the existing methods considered. In the best case scenario, UniSpaCh produces output document of size almost 9 times smaller than that of the existing method.
KW - DASH
KW - Data hiding
KW - Space manipulation
KW - Unicode character
KW - UniSpaCh
UR - http://www.scopus.com/inward/record.url?scp=84865743341&partnerID=8YFLogxK
U2 - 10.1016/j.jss.2011.12.023
DO - 10.1016/j.jss.2011.12.023
M3 - Article
AN - SCOPUS:84865743341
SN - 0164-1212
VL - 85
SP - 1075
EP - 1082
JO - Journal of Systems and Software
JF - Journal of Systems and Software
IS - 5
ER -