Frequent Itemsets Mining with a Guaranteed Local Differential Privacy in Small Datasets

Sharmin Afrose, Tanzima Hashem, Mohammed Eunus Ali

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

8 Citations (Scopus)

Abstract

In this paper, we propose an iterative approach to estimate the frequent itemsets with high accuracy while satisfying the local differential privacy (LDP). The key component behind the improved accuracy of the estimated frequent itemsets by our approach is our novel two-level randomization technique for guaranteeing the LDP. Our randomization technique exploits the correlation of the presence of items in a user's itemset, which has not been considered before. We present a mathematical proof that shows that our approach satisfies the LDP constraint. Extensive experiments are performed to validate the effectiveness and efficiency of our proposed algorithms using real datasets.

Original languageEnglish
Title of host publicationScientific and Statistical Database Management, 33rd International Conference, SSDBM 2021 Tampa, Florida, USA, July 6 – 7, 2021 Proceedings
EditorsQiang Zhu, Xingquan (Hill) Zhu, Yicheng Tu, Zichen (Frank) Xu, Anand Kumar
Place of PublicationNY New York USA
PublisherAssociation for Computing Machinery (ACM)
Pages232-236
Number of pages5
ISBN (Electronic)9781450384131
DOIs
Publication statusPublished - 2021
Externally publishedYes
EventInternational Conference on Scientific and Statistical Database Management 2021 - Online, Tampa, United States of America
Duration: 6 Jul 20217 Jul 2021
Conference number: 33rd
https://dl.acm.org/doi/proceedings/10.1145/3468791 (Proceedings)
https://ssdbm.org/2021/ (Website)

Conference

ConferenceInternational Conference on Scientific and Statistical Database Management 2021
Abbreviated titleSSDBM 2021
Country/TerritoryUnited States of America
CityTampa
Period6/07/217/07/21
Internet address

Keywords

  • frequent itemsets mining
  • local differential privacy
  • small datasets

Cite this