Abstract
Translating images from a source domain to a target domain for learning target models is one of the most common strategies in domain adaptive semantic segmentation (DASS). However, existing methods still struggle to preserve semantically-consistent local details between the original and translated images. In this work, we present an innovative approach that addresses this challenge by using source domain labels as explicit guidance during image translation. Concretely, we formulate cross-domain image translation as a denoising diffusion process and utilize a novel Semantic Gradient Guidance (SGG) method to constrain the translation process, conditioning it on the pixel-wise source labels. Additionally, a Progressive Translation Learning (PTL) strategy is devised to enable the SGG method to work reliably across domains with large gaps. Extensive experiments demonstrate the superiority of our approach over state-of-the-art methods.
Original language | English |
---|---|
Title of host publication | Proceedings - 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023 |
Editors | Frédéric Jurie, Gaurav Sharma |
Place of Publication | Piscataway NJ USA |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 808-820 |
Number of pages | 13 |
ISBN (Electronic) | 9798350307184 |
ISBN (Print) | 9798350307191 |
DOIs | |
Publication status | Published - 2023 |
Event | IEEE International Conference on Computer Vision 2023 - Paris, France Duration: 2 Oct 2023 → 6 Oct 2023 https://ieeexplore.ieee.org/xpl/conhome/10376473/proceeding (Proceedings) https://iccv2023.thecvf.com/ (Website) |
Conference
Conference | IEEE International Conference on Computer Vision 2023 |
---|---|
Abbreviated title | ICCV 2023 |
Country/Territory | France |
City | Paris |
Period | 2/10/23 → 6/10/23 |
Internet address |
|