Abstract
In this paper, we propose a practical privacy-preserving generative model for data sanitization and sharing, called Sanitizer-Variational Autoencoder (SVAE). We assume that the data consists of identification-relevant and irrelevant components. A variational autoencoder (VAE) based sanitization model is proposed to strip the identification-relevant features and only retain identification-irrelevant components in a privacy-preserving manner. The sanitization allows for task-relevant discrimination (utility) but minimizes the personal identification information leakage (privacy). We conduct extensive empirical evaluations on the real-world face, biometric signal and speech datasets, and validate the effectiveness of our proposed SVAE, as well as the robustness against the membership inference attack.
Original language | English |
---|---|
Title of host publication | Web Information Systems Engineering – WISE 2020 |
Subtitle of host publication | 21st International Conference Amsterdam, The Netherlands, October 20–24, 2020 Proceedings, Part II |
Editors | Zhisheng Huang, Wouter Beek, Hua Wang, Rui Zhou, Yanchun Zhang |
Place of Publication | Cham Switzerland |
Publisher | Springer |
Pages | 185-200 |
Number of pages | 16 |
ISBN (Electronic) | 9783030620080 |
ISBN (Print) | 9783030620073 |
DOIs | |
Publication status | Published - 2020 |
Event | International Conference on Web Information Systems Engineering 2020 - Amsterdam, Netherlands Duration: 20 Oct 2020 → 24 Oct 2020 Conference number: 21st https://link.springer.com/book/10.1007/978-3-030-62008-0 (Proceedings) http://wasp.cs.vu.nl/WISE2020/ (Website) |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Number | Part II |
Volume | 12343 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | International Conference on Web Information Systems Engineering 2020 |
---|---|
Abbreviated title | WISE 2020 |
Country | Netherlands |
City | Amsterdam |
Period | 20/10/20 → 24/10/20 |
Internet address |
|
Keywords
- Data sharing
- Deep learning
- Generative model
- Privacy-preserving
- Variational autoencoder