PRESSGenDB: PRivacy-prEserving substring search on encrypted genomic database

Sara Jafarbeiki, Amin Sakzad, Shabnam Kasra Kermanshahi, Ron Steinfeld, Raj Gaire

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

1 Citation (Scopus)

Abstract

Efficient sequencing methods produce a large amount of genetic data, and make it accessible to researchers. This leads genomics to be considered a legitimate big data field. Hence, outsourcing data to the cloud is necessary as the genomic dataset is large. Data owners encrypt sensitive data before outsourcing to maintain data confidentiality and outsourcing aids data owners in resolving the issue of local storage management. Because genomic data is so enormous, safely and effectively performing researchers' queries is challenging. In this paper, we propose a method, PRESSGenDB, for securely performing string and substring searches on the encrypted genomic sequences dataset. We leverage searchable symmetric encryption (SSE) and design a new method to handle these queries. In comparison to the state-of-the-art methods, PRESSGenDB supports various types of queries over genomic sequences such as string search and substring searches with and without a given requested start position. Moreover, it supports strings of alphabets as sequences rather than just a binary sequence of 0,1s. It can search for substrings (patterns) over a whole dataset of genomic sequences rather than just one sequence. Furthermore, by comparing PRESSGenDB's search complexity analytically with the state-of-the-art, we show that it outperforms the recent efficient works.

Original languageEnglish
Title of host publicationINFOCOM WKSHPS 2022 - IEEE Conference on Computer Communications Workshops
EditorsKeshav Sood
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Number of pages6
ISBN (Electronic)9781665409261
ISBN (Print)9781665409278
DOIs
Publication statusPublished - 2022
EventInternational Workshop on Security and Privacy in Big Data (BigSecurity) 2022 - Oline, United States of America
Duration: 2 May 20222 May 2022
Conference number: 10th
https://ieeexplore.ieee.org/xpl/conhome/9797887/proceeding (Proceedings)
http://www.cloud-conf.net/datasec/2022/cfp.html (Website)

Conference

ConferenceInternational Workshop on Security and Privacy in Big Data (BigSecurity) 2022
Country/TerritoryUnited States of America
Period2/05/222/05/22
Internet address

Keywords

  • big data
  • Genomic data privacy
  • privacy and security
  • searchable encryption
  • substring search

Cite this