Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Struct2Graph: A graph attention network for structure based predictions of protein-protein interactions

View ORCID ProfileMayank Baranwal, View ORCID ProfileAbram Magner, Jacob Saldinger, Emine S. Turali-Emre, View ORCID ProfilePaolo Elvati, View ORCID ProfileShivani Kozarekar, View ORCID ProfileJ. Scott VanEpps, Nicholas A. Kotov, View ORCID ProfileAngela Violi, Alfred O. Hero
doi: https://doi.org/10.1101/2020.09.17.301200
Mayank Baranwal
1Division of Data and Decision Sciences, Tata Consultancy Services Research, Mumbai, India
2Systems & Control Engineering Group, Indian Institute of Technology, Bombay, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mayank Baranwal
  • For correspondence: baranwal.mayank{at}tcs.com
Abram Magner
3Department of Computer Science, University of Albany, SUNY, Albany, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Abram Magner
Jacob Saldinger
4Department of Chemical Engineering, University of Michigan, Ann Arbor, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Emine S. Turali-Emre
5Department of Biomedical Engineering, University of Michigan, Ann Arbor, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Paolo Elvati
6Department of Mechanical Engineering, University of Michigan, Ann Arbor, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Paolo Elvati
Shivani Kozarekar
4Department of Chemical Engineering, University of Michigan, Ann Arbor, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Shivani Kozarekar
J. Scott VanEpps
5Department of Biomedical Engineering, University of Michigan, Ann Arbor, USA
7Department of Emergency Medicine, University of Michigan, Ann Arbor, USA
8Biointerfaces Institute, University of Michigan, Ann Arbor, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for J. Scott VanEpps
Nicholas A. Kotov
4Department of Chemical Engineering, University of Michigan, Ann Arbor, USA
5Department of Biomedical Engineering, University of Michigan, Ann Arbor, USA
8Biointerfaces Institute, University of Michigan, Ann Arbor, USA
9Department of Materials Science & Engineering, University of Michigan, Ann Arbor, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Angela Violi
4Department of Chemical Engineering, University of Michigan, Ann Arbor, USA
6Department of Mechanical Engineering, University of Michigan, Ann Arbor, USA
10Biophysics Program, University of Michigan, Ann Arbor, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Angela Violi
Alfred O. Hero
5Department of Biomedical Engineering, University of Michigan, Ann Arbor, USA
11Department of Electrical Engineering & Computer Science, University of Michigan, Ann Arbor, USA
12Department of Statistics, University of Michigan, Ann Arbor, USA
13Program in Applied Interdisciplinary Mathematics, University of Michigan, Ann Arbor, USA
14Program in Bioinformatics, University of Michigan, Ann Arbor, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Development of new methods for analysis of protein-protein interactions (PPIs) at molecular and nanometer scales gives insights into intracellular signaling pathways and will improve understanding of protein functions, as well as other nanoscale structures of biological and abiological origins. Recent advances in computational tools, particularly the ones involving modern deep learning algorithms, have been shown to complement experimental approaches for describing and rationalizing PPIs. However, most of the existing works on PPI predictions use protein-sequence information, and thus have difficulties in accounting for the three-dimensional organization of the protein chains.

Results In this study, we address this problem and describe a PPI analysis based on a graph attention network, named Struct2Graph, for identifying PPIs directly from the structural data of folded protein globules. Our method is capable of predicting the PPI with an accuracy of 98.89% on the balanced set consisting of an equal number of positive and negative pairs. On the unbalanced set with the ratio of 1:10 between positive and negative pairs, Struct2Graph achieves a five-fold cross validation average accuracy of 99.42%. Moreover, Struct2Graph can potentially identify residues that likely contribute to the formation of the protein-protein complex. The identification of important residues is tested for two different interaction types: (a) Proteins with multiple ligands competing for the same binding area, (b) Dynamic protein-protein adhesion interaction. Struct2Graph identifies interacting residues with 30% sensitivity, 89% specificity, and 87% accuracy.

Conclusions In this manuscript, we address the problem of prediction of PPIs using a first of its kind, 3D-structure-based graph attention network (code available at https://github.com/baranwa2/Struct2Graph). Furthermore, the novel mutual attention mechanism provides insights into likely interaction sites through its unsupervised knowledge selection process. This study demonstrates that a relatively low-dimensional feature embedding learned from graph structures of individual proteins outperforms other modern machine learning classifiers based on global protein features. In addition, through the analysis of single amino acid variations, the attention mechanism shows preference for disease-causing residue variations over benign polymorphisms, demonstrating that it is not limited to interface residues.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • New results on human SAV dataset, as well as detailed discussion on PPI database

  • https://github.com/baranwa2/Struct2Graph

  • [1] Based on the pairwise homology analysis comprising of 3677 unique proteins in our database, only 0.3% of the proteins were found to have BLAST e-value < 0.05 and 0.26% has < 0.001, indicating statistically insignificant homologous relationships.

  • Abbreviations

    PPI
    Protein-Protein Interaction
    GAT
    Graph Attention Network
    GCN
    Graph Convolutional Network
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
    Back to top
    PreviousNext
    Posted June 08, 2022.
    Download PDF

    Supplementary Material

    Data/Code
    Email

    Thank you for your interest in spreading the word about bioRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Struct2Graph: A graph attention network for structure based predictions of protein-protein interactions
    (Your Name) has forwarded a page to you from bioRxiv
    (Your Name) thought you would like to see this page from the bioRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Struct2Graph: A graph attention network for structure based predictions of protein-protein interactions
    Mayank Baranwal, Abram Magner, Jacob Saldinger, Emine S. Turali-Emre, Paolo Elvati, Shivani Kozarekar, J. Scott VanEpps, Nicholas A. Kotov, Angela Violi, Alfred O. Hero
    bioRxiv 2020.09.17.301200; doi: https://doi.org/10.1101/2020.09.17.301200
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    Struct2Graph: A graph attention network for structure based predictions of protein-protein interactions
    Mayank Baranwal, Abram Magner, Jacob Saldinger, Emine S. Turali-Emre, Paolo Elvati, Shivani Kozarekar, J. Scott VanEpps, Nicholas A. Kotov, Angela Violi, Alfred O. Hero
    bioRxiv 2020.09.17.301200; doi: https://doi.org/10.1101/2020.09.17.301200

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Bioinformatics
    Subject Areas
    All Articles
    • Animal Behavior and Cognition (7561)
    • Biochemistry (17409)
    • Bioengineering (13663)
    • Bioinformatics (41400)
    • Biophysics (21215)
    • Cancer Biology (18335)
    • Cell Biology (25190)
    • Clinical Trials (138)
    • Developmental Biology (13246)
    • Ecology (19695)
    • Epidemiology (2067)
    • Evolutionary Biology (24120)
    • Genetics (15491)
    • Genomics (22289)
    • Immunology (17541)
    • Microbiology (39939)
    • Molecular Biology (16969)
    • Neuroscience (87586)
    • Paleontology (664)
    • Pathology (2805)
    • Pharmacology and Toxicology (4750)
    • Physiology (7558)
    • Plant Biology (14944)
    • Scientific Communication and Education (2033)
    • Synthetic Biology (4229)
    • Systems Biology (9719)
    • Zoology (2251)