Abstract
In this paper, we propose an iterative approach to estimate the frequent itemsets with high accuracy while satisfying the local differential privacy (LDP). The key component behind the improved accuracy of the estimated frequent itemsets by our approach is our novel two-level randomization technique for guaranteeing the LDP. Our randomization technique exploits the correlation of the presence of items in a user's itemset, which has not been considered before. We present a mathematical proof that shows that our approach satisfies the LDP constraint. Extensive experiments are performed to validate the effectiveness and efficiency of our proposed algorithms using real datasets.
Original language | English |
---|---|
Title of host publication | Scientific and Statistical Database Management, 33rd International Conference, SSDBM 2021 Tampa, Florida, USA, July 6 – 7, 2021 Proceedings |
Editors | Qiang Zhu, Xingquan (Hill) Zhu, Yicheng Tu, Zichen (Frank) Xu, Anand Kumar |
Place of Publication | NY New York USA |
Publisher | Association for Computing Machinery (ACM) |
Pages | 232-236 |
Number of pages | 5 |
ISBN (Electronic) | 9781450384131 |
DOIs | |
Publication status | Published - 2021 |
Externally published | Yes |
Event | International Conference on Scientific and Statistical Database Management 2021 - Online, Tampa, United States of America Duration: 6 Jul 2021 → 7 Jul 2021 Conference number: 33rd https://dl.acm.org/doi/proceedings/10.1145/3468791 (Proceedings) https://ssdbm.org/2021/ (Website) |
Conference
Conference | International Conference on Scientific and Statistical Database Management 2021 |
---|---|
Abbreviated title | SSDBM 2021 |
Country/Territory | United States of America |
City | Tampa |
Period | 6/07/21 → 7/07/21 |
Internet address |
|
Keywords
- frequent itemsets mining
- local differential privacy
- small datasets