TY - JOUR
T1 - A new Histogram Oriented Moments descriptor for multi-oriented moving text detection in video
AU - Khare, Vijeta
AU - Shivakumara, Palaiahnakote
AU - Raveendran, Paramesran
N1 - Funding Information:
We acknowledge the University of Malaya for funding this work. The research has been carried out under HIR Grant (UM.C/625/1/HIR/MOHE/ENG/42).
Publisher Copyright:
© 2015 Elsevier Ltd.
PY - 2015/6/25
Y1 - 2015/6/25
N2 - Abstract Developing an expert text detection system for video indexing and retrieving is a challenging task due to low resolution, complex background, non-illumination and movement of text present in a video. Besides, text detection is vital for several real time applications, such as license plate recognition, assisting a blind person and other surveillance applications. In this paper, we introduce a new descriptor called Histogram Oriented Moments (HOM) for text detection in video, which is invariant to rotation, scaling, font, and font size variations. The HOM finds orientations with the second order geometrical moments for each sliding window (overlapped block) of the input frame. The proposed method performs histogram operations on the orientations of each window to identify the dominant orientation (as a representative). Then, a new hypothesis is defined based on the dominant orientations of a connected component as the numbers of orientations, which point towards centroid of the connected components are larger than the number of dominant orientations which point away from the centroid of the connected components. The components that satisfy the above hypothesis are considered as text candidates, or else as non-text candidates. Further, to detect a moving text- we explore optical flow properties, such as velocity of text candidates to estimate the motions between temporal frames. The components which move with constant velocity and uniform direction are considered as text candidates otherwise non-text candidates. We demonstrate the proposed method's dominance over state of the art methods by testing on benchmark database, namely ICDAR 2013 and our own video datasets in terms of recall, precision and F-measure.
AB - Abstract Developing an expert text detection system for video indexing and retrieving is a challenging task due to low resolution, complex background, non-illumination and movement of text present in a video. Besides, text detection is vital for several real time applications, such as license plate recognition, assisting a blind person and other surveillance applications. In this paper, we introduce a new descriptor called Histogram Oriented Moments (HOM) for text detection in video, which is invariant to rotation, scaling, font, and font size variations. The HOM finds orientations with the second order geometrical moments for each sliding window (overlapped block) of the input frame. The proposed method performs histogram operations on the orientations of each window to identify the dominant orientation (as a representative). Then, a new hypothesis is defined based on the dominant orientations of a connected component as the numbers of orientations, which point towards centroid of the connected components are larger than the number of dominant orientations which point away from the centroid of the connected components. The components that satisfy the above hypothesis are considered as text candidates, or else as non-text candidates. Further, to detect a moving text- we explore optical flow properties, such as velocity of text candidates to estimate the motions between temporal frames. The components which move with constant velocity and uniform direction are considered as text candidates otherwise non-text candidates. We demonstrate the proposed method's dominance over state of the art methods by testing on benchmark database, namely ICDAR 2013 and our own video datasets in terms of recall, precision and F-measure.
KW - Central moments
KW - Histogram Oriented Gradients
KW - Histogram Oriented Moments
KW - Moving caption text detection
KW - Optical flow
KW - Video text detection
UR - http://www.scopus.com/inward/record.url?scp=84934963997&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2015.06.002
DO - 10.1016/j.eswa.2015.06.002
M3 - Article
AN - SCOPUS:84934963997
SN - 0957-4174
VL - 42
SP - 7627
EP - 7640
JO - Expert Systems with Applications
JF - Expert Systems with Applications
IS - 21
M1 - 10079
ER -