Abstract
Unprecedented volumes of location-based information have been produced as a result of the widespread adoption of social network applications and GPS-enabled devices and sensors. Publication of such location data can provide valuable resources for researchers and government agencies in applications ranging from near real-time population-wide health monitoring to planning for future cities. However, such data hold personally identifying information, which gives rise to many privacy issues. There is thus a pressing need for ways to restrict this inherently identifying location-related information, however ideally we would like to preserve the utility of the data. Importantly, any such solution has to be scalable to large population-wide data scenarios. To tackle this, we introduce a novel differentially private hierarchical location sanitization (DPHLS) approach based on the concept '(α, r)-dataset' implemented through a Variable Order Mobility Markov Model (VO3M). We show how this system allows individual locations in personal trajectories to be protected using selection and frequency perturbation mechanisms using the '(α, r)-dataset', leveraging past (published) location histories to obfuscate the user location in a flexible and controllable manner. The effectiveness and efficiency of the proposed solution is evaluated through the big data experiments that have been carried out using an OpenStack-based Cloud and Apache Sparkbased platform utilising large-scale social media trajectories. The experimental results suggest that the privacy publication algorithm can successfully scale to big data scenarios whilst retaining the utility of the datasets (trajectories) and preserving individual user privacy.
Original language | English |
---|---|
Title of host publication | Proceedings - 2016 IEEE International Conference on Big Data |
Editors | James Joshi, George Karypis, Ling Liu, Xiaohua Hu, Ronay Ak, Yinglong Xia, Weijia Xu, Aki-Hiro Sato, Sudarsan Rachuri, Lyle Ungar, Philip S. Yu, Rama Govindaraju, Toyotaro Suzumura |
Place of Publication | Piscataway NJ USA |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 1143-1150 |
Number of pages | 8 |
ISBN (Electronic) | 9781467390040, 9781467390057 |
ISBN (Print) | 9781467390064 |
DOIs | |
Publication status | Published - 2016 |
Externally published | Yes |
Event | IEEE International Congress on Big Data 2016 - San Francisco, United States of America Duration: 27 Jun 2016 → 2 Jul 2016 Conference number: 5th http://www.ieeebigdata.org/2016/ https://ieeexplore.ieee.org/xpl/conhome/7584314/proceeding (Proceedings) |
Conference
Conference | IEEE International Congress on Big Data 2016 |
---|---|
Abbreviated title | BigData Congress 2016 |
Country/Territory | United States of America |
City | San Francisco |
Period | 27/06/16 → 2/07/16 |
Internet address |
Keywords
- big data
- differential privacy
- location privacy
- social media