Diffusion-based Image translation with label guidance for domain adaptive semantic segmentation

Duo Peng, Ping Hu, Qiuhong Ke, Jun Liu

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

9 Citations (Scopus)

Abstract

Translating images from a source domain to a target domain for learning target models is one of the most common strategies in domain adaptive semantic segmentation (DASS). However, existing methods still struggle to preserve semantically-consistent local details between the original and translated images. In this work, we present an innovative approach that addresses this challenge by using source domain labels as explicit guidance during image translation. Concretely, we formulate cross-domain image translation as a denoising diffusion process and utilize a novel Semantic Gradient Guidance (SGG) method to constrain the translation process, conditioning it on the pixel-wise source labels. Additionally, a Progressive Translation Learning (PTL) strategy is devised to enable the SGG method to work reliably across domains with large gaps. Extensive experiments demonstrate the superiority of our approach over state-of-the-art methods.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023
EditorsFrédéric Jurie, Gaurav Sharma
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages808-820
Number of pages13
ISBN (Electronic)9798350307184
ISBN (Print)9798350307191
DOIs
Publication statusPublished - 2023
EventIEEE International Conference on Computer Vision 2023 - Paris, France
Duration: 2 Oct 20236 Oct 2023
https://ieeexplore.ieee.org/xpl/conhome/10376473/proceeding (Proceedings)
https://iccv2023.thecvf.com/ (Website)

Conference

ConferenceIEEE International Conference on Computer Vision 2023
Abbreviated titleICCV 2023
Country/TerritoryFrance
CityParis
Period2/10/236/10/23
Internet address

Cite this