TY - JOUR
T1 - Spatiotemporal feature extraction for facial expression recognition
AU - Kamarol, Siti Khairuni Amalina
AU - Jaward, Mohamed Hisham
AU - Parkkinen, Jussi
AU - Parthiban, Rajendran
N1 - Publisher Copyright:
© The Institution of Engineering and Technology 2016.
Copyright:
Copyright 2018 Elsevier B.V., All rights reserved.
PY - 2016/7
Y1 - 2016/7
N2 - A key issue regarding feature extraction is the capability of a technique to extract distinctive features to represent facial expressions while requiring a low computational complexity. In this study, the authors propose a novel approach for appearance-based facial feature extraction to perform the task of facial expression recognition on video sequences. The proposed spatiotemporal texture map (STTM) is capable of capturing subtle spatial and temporal variations of facial expressions with low computational complexity. First, face is detected using Viola-Jones face detector and frames are cropped to remove unnecessary background. Facial features are then modelled with the proposed STTM, which uses the spatiotemporal information extracted from three-dimensional Harris corner function. A block-based method is adopted to extract the dynamic features and represent the features in the form of histograms. The features are then classified into classes of emotion by the support vector machine classifier. The experimental results demonstrate that the proposed approach shows superior performance compared with the state-of-the-art approaches with an average recognition rate of 95.37, 98.56, and 84.52% on datasets containing posed expressions, spontaneous microexpressions, and close-to-real-world expressions, respectively. They also show that the proposed algorithm requires low computational cost.
AB - A key issue regarding feature extraction is the capability of a technique to extract distinctive features to represent facial expressions while requiring a low computational complexity. In this study, the authors propose a novel approach for appearance-based facial feature extraction to perform the task of facial expression recognition on video sequences. The proposed spatiotemporal texture map (STTM) is capable of capturing subtle spatial and temporal variations of facial expressions with low computational complexity. First, face is detected using Viola-Jones face detector and frames are cropped to remove unnecessary background. Facial features are then modelled with the proposed STTM, which uses the spatiotemporal information extracted from three-dimensional Harris corner function. A block-based method is adopted to extract the dynamic features and represent the features in the form of histograms. The features are then classified into classes of emotion by the support vector machine classifier. The experimental results demonstrate that the proposed approach shows superior performance compared with the state-of-the-art approaches with an average recognition rate of 95.37, 98.56, and 84.52% on datasets containing posed expressions, spontaneous microexpressions, and close-to-real-world expressions, respectively. They also show that the proposed algorithm requires low computational cost.
UR - http://www.scopus.com/inward/record.url?scp=84975086205&partnerID=8YFLogxK
U2 - 10.1049/iet-ipr.2015.0519
DO - 10.1049/iet-ipr.2015.0519
M3 - Article
AN - SCOPUS:84975086205
SN - 1751-9659
VL - 10
SP - 534
EP - 541
JO - IET Image Processing
JF - IET Image Processing
IS - 7
ER -