Effect of training class label noise on classification performances for land cover mapping with satellite image time series

Charlotte Pelletier, Silvia Valero, Jordi Inglada, Nicolas Champion, Claire Marais Sicre, Gérard Dedieu

Research output: Contribution to journalArticleResearchpeer-review

73 Citations (Scopus)

Abstract

Supervised classification systems used for land cover mapping require accurate reference databases. These reference data come generally from different sources such as field measurements, thematic maps, or aerial photographs. Due to misregistration, update delay, or land cover complexity, they may contain class label noise, i.e., a wrong label assignment. This study aims at evaluating the impact of mislabeled training data on classification performances for land cover mapping. Particularly, it addresses the random and systematic label noise problem for the classification of high resolution satellite image time series. Experiments are carried out on synthetic and real datasets with two traditional classifiers: Support Vector Machines (SVM) and Random Forests (RF). A synthetic dataset has been designed for this study, simulating vegetation profiles over one year. The real dataset is composed of Landsat-8 and SPOT-4 images acquired during one year in the south of France. The results show that both classifiers are little influenced for low random noise levels up to 25%-30%, but their performances drop down for higher noise levels. Different classification configurations are tested by increasing the number of classes, using different input feature vectors, and changing the number of training instances. Algorithm complexities are also analyzed. The RF classifier achieves high robustness to random and systematic label noise for all the tested configurations; whereas the SVM classifier is more sensitive to the kernel choice and to the input feature vectors. Finally, this work reveals that the cross-validation procedure is impacted by the presence of class label noise.

Original languageEnglish
Article number173
Number of pages24
JournalRemote Sensing
Volume9
Issue number2
DOIs
Publication statusPublished - 18 Feb 2017
Externally publishedYes

Keywords

  • Class label noise
  • Classification
  • Land cover mapping
  • Mislabeled training data
  • Random Forests
  • Satellite image time series
  • Support Vector Machines

Cite this