Abstract
Although much progress has been made in visual emotion recognition, researchers have realized that modern deep networks tend to exploit dataset characteristics to learn spurious statistical associations between the input and the target. Such dataset characteristics are usually treated as dataset bias, which damages the robustness and generalization performance of these recognition systems. In this work, we scrutinize this problem from the perspective of causal inference, where such dataset characteristic is termed as a confounder which misleads the system to learn the spurious correlation. To alleviate the negative effects brought by the dataset bias, we propose a novel Interventional Emotion Recognition Network (IERN) to achieve the backdoor adjustment, which is one fundamental deconfounding technique in causal inference. Specifically, IERN starts by disentangling the dataset-related context feature from the actual emotion feature, where the former forms the confounder. The emotion feature will then be forced to see each confounder stratum equally before being fed into the classifier. A series of designed tests validate the efficacy of IERN, and experiments on three emotion benchmarks demonstrate that IERN outperforms state-of-the-art approaches for unbiased visual emotion recognition.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 30th ACM International Conference on Multimedia |
| Editors | Marco Bertini, Klaus Schoeffmann |
| Place of Publication | New York NY USA |
| Publisher | Association for Computing Machinery (ACM) |
| Pages | 60-69 |
| Number of pages | 10 |
| ISBN (Electronic) | 9781450392037 |
| DOIs | |
| Publication status | Published - 2022 |
| Event | ACM International Conference on Multimedia 2022 - Lisbon, Portugal Duration: 10 Oct 2022 → 14 Oct 2022 Conference number: 30th https://dl.acm.org/doi/proceedings/10.1145/3503161 (Proceedings) https://2022.acmmm.org/ (Website) |
Conference
| Conference | ACM International Conference on Multimedia 2022 |
|---|---|
| Abbreviated title | MM 2022 |
| Country/Territory | Portugal |
| City | Lisbon |
| Period | 10/10/22 → 14/10/22 |
| Internet address |
|
Keywords
- backdoor adjustment
- causal intervention
- dataset bias
- facial expression recognition
- image emotion recognition
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver