IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1011773.html
   My bibliography  Save this article

Joint representation of molecular networks from multiple species improves gene classification

Author

Listed:
  • Christopher A Mancuso
  • Kayla A Johnson
  • Renming Liu
  • Arjun Krishnan

Abstract

Network-based machine learning (ML) has the potential for predicting novel genes associated with nearly any health and disease context. However, this approach often uses network information from only the single species under consideration even though networks for most species are noisy and incomplete. While some recent methods have begun addressing this shortcoming by using networks from more than one species, they lack one or more key desirable properties: handling networks from more than two species simultaneously, incorporating many-to-many orthology information, or generating a network representation that is reusable across different types of and newly-defined prediction tasks. Here, we present GenePlexusZoo, a framework that casts molecular networks from multiple species into a single reusable feature space for network-based ML. We demonstrate that this multi-species network representation improves both gene classification within a single species and knowledge-transfer across species, even in cases where the inter-species correspondence is undetectable based on shared orthologous genes. Thus, GenePlexusZoo enables effectively leveraging the high evolutionary molecular, functional, and phenotypic conservation across species to discover novel genes associated with diverse biological contexts.Author summary: Our work addresses two major challenges; 1) computationally predicting the role a gene plays in various diseases, processes and phenotypes, and 2) accurately transferring genetic information discovered in one species to another. To simultaneously tackle both of these challenges, we developed the GenePlexusZoo method which builds a gene classification model by utilizing molecular interaction information from multiple species, seamlessly handling the complicated mapping of how genes across species are functionally related. We show that machine learning classifiers that utilize information from multiple species outperform those that only consider information from a single species. Additionally, we show that the GenePelxusZoo method is able to accurately transfer knowledge from one species to another, even in the cases where no previous connection would have been detected based solely on shared orthologous genes. Finally, we present an illustrative example of how GenePlexusZoo can provide novel insights into a complicated genetic-based disease.

Suggested Citation

  • Christopher A Mancuso & Kayla A Johnson & Renming Liu & Arjun Krishnan, 2024. "Joint representation of molecular networks from multiple species improves gene classification," PLOS Computational Biology, Public Library of Science, vol. 20(1), pages 1-20, January.
  • Handle: RePEc:plo:pcbi00:1011773
    DOI: 10.1371/journal.pcbi.1011773
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1011773
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011773&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1011773?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1011773. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.