LETRIST: Locally Encoded TRansform feature hISTtogram for rotation-invariant texture classification

Tiecheng Song, Hongliang Li, Fanman Meng, Qingbo Wu, Jianfei Cai

Research output: Contribution to journalArticleResearchpeer-review

34 Citations (Scopus)

Abstract

Classifying texture images, especially those with significant rotation, illumination, scale, and viewpoint changes, is a fundamental and challenging problem in computer vision. This paper proposes a simple yet effective image descriptor, called Locally Encoded TRansform feature hISTogram (LETRIST), for texture classification. LETRIST is a histogram representation that explicitly encodes the joint information within an image across feature and scale spaces. The proposed representation is training-free, low-dimensional, yet discriminative and robust for texture description. It consists of the following major steps. First, a set of transform features is constructed to characterize local texture structures and their correlation by applying linear and non-linear operators on the extremum responses of directional Gaussian derivative filters in scale space. Established on the basis of steerable filters, the constructed transform features are exactly rotationally invariant as well as computationally efficient. Second, the scalar quantization via binary or multi-level thresholding is adopted to quantize these transform features into texture codes. Two quantization schemes are designed, both of which are robust to image rotation and illumination changes. Third, the cross-scale joint coding is explored to aggregate the discrete texture codes into a compact histogram representation, i.e., LETRIST. Experimental results on the Outex, CUReT, KTH-TIPS, and UIUC texture data sets show that LETRIST consistently produces better or comparable classification results than the state-of-the-art approaches. Impressively, recognition rates of 100.00% and 99.00% have been achieved on the Outex and KTH-TIPS data sets, respectively. In addition, the noise robustness is evaluated on the Outex and CUReT data sets. The source code is publicly available at https://github.com/stc-cqupt/letrist.

Original languageEnglish
Pages (from-to)1565-1579
Number of pages15
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume28
Issue number7
DOIs
Publication statusPublished - Jul 2018
Externally publishedYes

Keywords

  • local binary pattern (LBP)
  • rotation invariance
  • steerable filter
  • texton
  • texture analysis
  • Texture classification

Cite this