Big data in genomics

Huaming Chen, Jiangning Song, Jun Shen, Lei Wang

Research output: Chapter in Book/Report/Conference proceedingChapter (Book)Otherpeer-review


The leverage of high-throughput technologies in biology area brings the academia and industry an enormous amount of “omics” data. These data include genomics data and proteomics data. In this chapter we consider mostly on the genomics data. Benefited from the development of “Big Data” area and also the domain knowledge driven by genomics data, two subsequent areas including precision medicine and cancer genomics, are discussed in this chapter. Meanwhile, we consider genomics data from the “Big Data” landscape and give a comprehensive “life cycle” on these data. Two significant and state-of-the-art cases in genomics data study are also presented. These two cases, which are ENCODE and CGHub, show inspiring and interesting results by the integration of big data analytics technology in genomics data. As the life science, biomedicine and health care sectors are at a turning point into data intensive science. Since we could benefit from the overwhelming genomics data, big data analytics shows us a promising potential to deliver a better understanding and improvement of our life.

Original languageEnglish
Title of host publicationBig Data Management and Processing
EditorsKuan-Ching Li, Hai Jiang, Albert Y. Zomaya
PublisherCRC Press
Number of pages22
ISBN (Electronic)9781498768085
ISBN (Print)9781498768078
Publication statusPublished - 2017

Cite this