Machine Learning Approaches for Predicting Protein Complex Similarity

Farhoodi, Roshanak; Akbal-Delibas, Bahar; Haspel, Nurit

Machine Learning Approaches for Predicting Protein Complex Similarity

Date

2017

Authors

Farhoodi, Roshanak

Akbal-Delibas, Bahar

Haspel, Nurit

Publisher

Mary Ann Liebert Inc Publ

Green Open Access

No

Publicly Funded

No

Impulse

Average

Influence

Average

Popularity

Average

Abstract

Discriminating native-like structures from false positives with high accuracy is one of the biggest challenges in protein-protein docking. While there is an agreement on the existence of a relationship between various favorable intermolecular interactions (e.g. Van der Waals electrostatic and desolvation forces) and the similarity of a conformation to its native structure the precise nature of this relationship is not known. Existing protein-protein docking methods typically formulate this relationship as a weighted sum of selected terms and calibrate their weights by using a training set to evaluate and rank candidate complexes. Despite improvements in the predictive power of recent docking methods producing a large number of false positives by even state-of-the-art methods often leads to failure in predicting the correct binding of many complexes. With the aid of machine learning methods we tested several approaches that not only rank candidate structures relative to each other but also predict how similar each candidate is to the native conformation. We trained a two-layer neural network a multilayer neural network and a network of Restricted Boltzmann Machines against extensive data sets of unbound complexes generated by RosettaDock and PyDock. We validated these methods with a set of refinement candidate structures. We were able to predict the root mean squared deviations (RMSDs) of protein complexes with a very small often less than 1.5 angstrom error margin when trained with structures that have RMSD values of up to 7 angstrom. In our most recent experiments with the protein samples having RMSD values up to 27 angstrom the average prediction error was still relatively small attesting to the potential of our approach in predicting the correct binding of protein-protein complexes.

Keywords

Machine learning, Neural networks, Protein docking and refinement, RMSD prediction, Scoring functions, Protein Conformation, Proteins, Protein docking and refinement, Machine Learning, Molecular Docking Simulation, Scoring functions, Machine learning, Animals, Humans, Neural Networks, Computer, RMSD prediction, Neural networks, Protein Binding

Fields of Science

0301 basic medicine, 0303 health sciences, 03 medical and health sciences

WoS Q

Q2

Scopus Q

Q3

OpenCitations Citation Count

1

Source

Journal of Computational Biology

Volume

24

Issue

1

Start Page

40

End Page

51

URI

https://hdl.handle.net/20.500.12469/412
https://doi.org/10.1089/cmb.2016.0137

Collections

WoS İndeksli Yayınlar Koleksiyonu
Bilgisayar Mühendisliği Bölümü Koleksiyonu
PubMed İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

PlumX Metrics

Citations

CrossRef : 1

Scopus : 0

PubMed : 1

Captures

Mendeley Readers : 13

Full item page

Page Views

5

checked on Mar 05, 2026

Google Scholar™

Check

Machine Learning Approaches for Predicting Protein Complex Similarity

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Open Access Color

Green Open Access

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

BIP! Indicators

Research Projects

Journal Issue

Abstract

Description

Keywords

Fields of Science

Citation

WoS Q

Scopus Q

OpenCitations Citation Count

Source

Volume

Issue

Start Page

End Page

URI

Collections

PlumX Metrics

Citations

Captures

Page Views

5

Google Scholar™

OpenAlex FWCI

0.1667

Sustainable Development Goals

SDG data is not available