Leveraging Stacked Denoising Autoencoder in Prediction of Pathogen-Host Protein-Protein Interactions

Huaming Chen, Jun Shen, Lei Wang, Jiangning Song

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

12 Citations (Scopus)

Abstract

In big data research related to bioinformatics, one of the most critical areas is proteomics. In this paper, we focus on the protein-protein interactions, especially on pathogen-host protein-protein interactions (PHPPIs), which reveals the critical molecular process in biology. Conventionally, biologists apply in-lab methods, including small-scale biochemical, biophysical, genetic experiments and large-scale experiment methods (e.g. yeast-two-hybrid analysis), to identify the interactions. These in-lab methods are time consuming and labor intensive. Since the interactions between proteins from different species play very critical roles for both the infectious diseases and drug design, the motivation behind this study is to provide a basic framework for biologists, which is based on big data analytics and deep learning models. Our work contributes in leveraging unsupervised learning model, in which we focus on stacked denoising autoencoders, to achieve a more efficient prediction performance on PHPPI. In this paper, we further detail the framework based on unsupervised learning model for PHPPI researches, while curating a large imbalanced PHPPI dataset. Our model demonstrates a better result with the unsupervised learning model on PHPPI dataset.

Original languageEnglish
Title of host publication2017 IEEE International Congress on Big Data
Subtitle of host publicationBigData Congress
EditorsGeorge Karypis, Jia Zhang
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages368-375
Number of pages8
Edition1st
ISBN (Electronic)9781538619964
DOIs
Publication statusPublished - 7 Sept 2017
EventIEEE International Congress on Big Data 2017 - Honolulu, United States of America
Duration: 25 Jun 201730 Jun 2017
Conference number: 6th
https://ieeexplore.ieee.org/xpl/conhome/8027154/proceeding (Proceedings)

Conference

ConferenceIEEE International Congress on Big Data 2017
Abbreviated titleBigData Congress 2017
Country/TerritoryUnited States of America
CityHonolulu
Period25/06/1730/06/17
Internet address

Keywords

  • big data
  • denoising autoencoder
  • machine learning
  • PHPPI
  • prediction

Cite this