Abstract
In big data research related to bioinformatics, one of the most critical areas is proteomics. In this paper, we focus on the protein-protein interactions, especially on pathogen-host protein-protein interactions (PHPPIs), which reveals the critical molecular process in biology. Conventionally, biologists apply in-lab methods, including small-scale biochemical, biophysical, genetic experiments and large-scale experiment methods (e.g. yeast-two-hybrid analysis), to identify the interactions. These in-lab methods are time consuming and labor intensive. Since the interactions between proteins from different species play very critical roles for both the infectious diseases and drug design, the motivation behind this study is to provide a basic framework for biologists, which is based on big data analytics and deep learning models. Our work contributes in leveraging unsupervised learning model, in which we focus on stacked denoising autoencoders, to achieve a more efficient prediction performance on PHPPI. In this paper, we further detail the framework based on unsupervised learning model for PHPPI researches, while curating a large imbalanced PHPPI dataset. Our model demonstrates a better result with the unsupervised learning model on PHPPI dataset.
Original language | English |
---|---|
Title of host publication | 2017 IEEE International Congress on Big Data |
Subtitle of host publication | BigData Congress |
Editors | George Karypis, Jia Zhang |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 368-375 |
Number of pages | 8 |
Edition | 1st |
ISBN (Electronic) | 9781538619964 |
DOIs | |
Publication status | Published - 7 Sept 2017 |
Event | IEEE International Congress on Big Data 2017 - Honolulu, United States of America Duration: 25 Jun 2017 → 30 Jun 2017 Conference number: 6th https://ieeexplore.ieee.org/xpl/conhome/8027154/proceeding (Proceedings) |
Conference
Conference | IEEE International Congress on Big Data 2017 |
---|---|
Abbreviated title | BigData Congress 2017 |
Country/Territory | United States of America |
City | Honolulu |
Period | 25/06/17 → 30/06/17 |
Internet address |
Keywords
- big data
- denoising autoencoder
- machine learning
- PHPPI
- prediction