TY - JOUR
T1 - Classifier aided training for semantic segmentation
AU - Ahmed, Ifham Abdul Latheef
AU - Jaward, Mohamed Hisham
N1 - Funding Information:
This work was supported by Malaysian Ministry of Higher Education Exploratory Research Grant Scheme , ERGS/1/2013/TK02/MUSM/03/1 . This work was also supported by the MASSIVE HPC facility ( www.massive.org.au ).
Publisher Copyright:
© 2021 Elsevier Inc.
PY - 2021/7
Y1 - 2021/7
N2 - Semantic segmentation is a prominent problem in scene understanding expressed as a dense labeling task with deep learning models being one of the main methods to solve it. Traditional training algorithms for semantic segmentation models produce less than satisfactory results when not combined with post-processing techniques such as CRFs. In this paper, we propose a method to train segmentation models using an approach which utilizes classification information in the training process of the segmentation network. Our method employs the use of classification network that detects the presence of classes in the segmented output. These class scores are then used to train the segmentation model. This method is motivated by the fact that by conditioning the training of the segmentation model with these scores, higher order features can be captured. Our experiments show significantly improved performance of the segmentation model on the CamVid and CityScapes datasets with no additional post processing.
AB - Semantic segmentation is a prominent problem in scene understanding expressed as a dense labeling task with deep learning models being one of the main methods to solve it. Traditional training algorithms for semantic segmentation models produce less than satisfactory results when not combined with post-processing techniques such as CRFs. In this paper, we propose a method to train segmentation models using an approach which utilizes classification information in the training process of the segmentation network. Our method employs the use of classification network that detects the presence of classes in the segmented output. These class scores are then used to train the segmentation model. This method is motivated by the fact that by conditioning the training of the segmentation model with these scores, higher order features can be captured. Our experiments show significantly improved performance of the segmentation model on the CamVid and CityScapes datasets with no additional post processing.
KW - Computer vision
KW - Deep learning
KW - Scene understanding
KW - Semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=85108686594&partnerID=8YFLogxK
U2 - 10.1016/j.jvcir.2021.103177
DO - 10.1016/j.jvcir.2021.103177
M3 - Article
AN - SCOPUS:85108686594
SN - 1047-3203
VL - 78
JO - Journal of Visual Communication and Image Representation
JF - Journal of Visual Communication and Image Representation
M1 - 103177
ER -