Multimodal compatibility modeling via exploring the consistent and complementary correlations

Weili Guan, Haokun Wen, Xuemeng Song, Chunghsing Yeh, Xiaojun Chang, Liqiang Nie

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

30 Citations (Scopus)

Abstract

Existing methods towards outfit compatibility modeling seldom explicitly consider multimodal correlations. In this work, we explore the consistent and complementary correlations for better compatibility modeling. This is, however, non-trivial due to the following challenges: 1) how to separate and model these two kinds of correlations; 2) how to leverage the derived complementary cues to strengthen the text and vision-oriented representations of the given item; and 3) how to reinforce the compatibility modeling with text and vision-oriented representations. To address these challenges, we present a comprehensive multimodal outfit compatibility modeling scheme. It first nonlinearly projects each modality into separable consistent and complementary spaces via multi-layer perceptron, and then models the consistent and complementary correlations between two modalities by parallel and orthogonal regularization. Thereafter, we strengthen the visual and textual representation of items with complementary information, and further induct both the text-oriented and vision- oriented outfit compatibility modeling. We ultimately employ the mutual learning strategy to reinforce the final performance of compatibility modeling. Extensive experiments demonstrate the superiority of our scheme.

Original languageEnglish
Title of host publicationProceedings of the 29th ACM International Conference on Multimedia
EditorsLiqiang Nie, Qianru Sun, Peng Cui
Place of PublicationNeed York NY USA
PublisherAssociation for Computing Machinery (ACM)
Pages2299-2307
Number of pages9
ISBN (Electronic)9781450386517
DOIs
Publication statusPublished - 2021
EventACM International Conference on Multimedia 2021 - Chengdu, China
Duration: 20 Oct 202124 Oct 2021
Conference number: 29th
https://dl.acm.org/doi/proceedings/10.1145/3474085 (Proceedings)
https://2021.acmmm.org/ (Website)

Conference

ConferenceACM International Conference on Multimedia 2021
Abbreviated titleMM 2021
Country/TerritoryChina
CityChengdu
Period20/10/2124/10/21
Internet address

Keywords

  • compatibility modeling
  • consistency and complementarity
  • multimodal correlations

Cite this