Abstract
We consider a semi-supervised learning scenario for regression, where only few labelled examples, many unlabelled instances and different data representations (multiple views) are available. For this setting, we extend support vector regression with a co-regularisation term and obtain co-regularised support vector regression (CoSVR). In addition to labelled data, co-regularisation includes information from unlabelled examples by ensuring that models trained on different views make similar predictions. Ligand affinity prediction is an important real-world problem that fits into this scenario. The characterisation of the strength of protein-ligand bonds is a crucial step in the process of drug discovery and design. We introduce variants of the base CoSVR algorithm and discuss their theoretical and computational properties. For the CoSVR function class we provide a theoretical bound on the Rademacher complexity. Finally, we demonstrate the usefulness of CoSVR for the affinity prediction task and evaluate its performance empirically on different protein-ligand datasets. We show that CoSVR outperforms co-regularised least squares regression as well as existing state-of-the-art approaches for affinity prediction. Code and data related to this chapter are available at: https://doi.org/10.6084/m9.figshare.5427241.
Original language | English |
---|---|
Title of host publication | Machine Learning and Knowledge Discovery in Databases |
Subtitle of host publication | European Conference, ECML PKDD 2017 Skopje, Macedonia, September 18–22, 2017 Proceedings, Part II |
Editors | Michelangelo Ceci, Jaakko Hollmen, Ljupco Todorovski, Celine Vens, Saso Dzeroski |
Place of Publication | Cham Switzerland |
Publisher | Springer |
Chapter | 10535 |
Pages | 338-354 |
Number of pages | 17 |
ISBN (Electronic) | 9783319712468 |
ISBN (Print) | 9783319712451 |
DOIs | |
Publication status | Published - 2017 |
Externally published | Yes |
Event | European Conference on Machine Learning European Conference on Principles and Practice of Knowledge Discovery in Databases 2017 - Skopje, North Macedonia Duration: 18 Sept 2017 → 22 Sept 2017 Conference number: 15th http://ecmlpkdd2017.ijs.si/ https://link.springer.com/book/10.1007/978-3-319-71249-9 (Proceedings) |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 10535 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | European Conference on Machine Learning European Conference on Principles and Practice of Knowledge Discovery in Databases 2017 |
---|---|
Abbreviated title | ECML PKDD 2017 |
Country/Territory | North Macedonia |
City | Skopje |
Period | 18/09/17 → 22/09/17 |
Internet address |
Keywords
- Co-regularisation
- Kernel methods
- Ligand affinity prediction
- Multiple views
- Rademacher complexity
- Regression
- Semi-supervised learning