IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1010732.html
   My bibliography  Save this article

ToMExO: A probabilistic tree-structured model for cancer progression

Author

Listed:
  • Mohammadreza Mohaghegh Neyshabouri
  • Jens Lagergren

Abstract

Identifying the interrelations among cancer driver genes and the patterns in which the driver genes get mutated is critical for understanding cancer. In this paper, we study cross-sectional data from cohorts of tumors to identify the cancer-type (or subtype) specific process in which the cancer driver genes accumulate critical mutations. We model this mutation accumulation process using a tree, where each node includes a driver gene or a set of driver genes. A mutation in each node enables its children to have a chance of mutating. This model simultaneously explains the mutual exclusivity patterns observed in mutations in specific cancer genes (by its nodes) and the temporal order of events (by its edges). We introduce a computationally efficient dynamic programming procedure for calculating the likelihood of our noisy datasets and use it to build our Markov Chain Monte Carlo (MCMC) inference algorithm, ToMExO. Together with a set of engineered MCMC moves, our fast likelihood calculations enable us to work with datasets with hundreds of genes and thousands of tumors, which cannot be dealt with using available cancer progression analysis methods. We demonstrate our method’s performance on several synthetic datasets covering various scenarios for cancer progression dynamics. Then, a comparison against two state-of-the-art methods on a moderate-size biological dataset shows the merits of our algorithm in identifying significant and valid patterns. Finally, we present our analyses of several large biological datasets, including colorectal cancer, glioblastoma, and pancreatic cancer. In all the analyses, we validate the results using a set of method-independent metrics testing the causality and significance of the relations identified by ToMExO or competing methods.Author summary: Cancer progression is an evolutionary process where somatic mutations in so-called driver genes provide the harboring cells with certain selective advantages. Identifying the interplay among the driver genes is critical for understanding how cancer evolves. In this paper, we introduce a method for analyzing cohorts of tumors. Our approach is based on a novel probabilistic model, which can identify the temporal order of mutations in driver genes, and how they may exhaust each other’s selective advantages. We introduce an efficient likelihood calculation procedure and build an MCMC algorithm for making inferences based on our model. Our computationally efficient inference algorithm enables us to work with hundreds of genes and thousands of tumors. Using a broad set of synthetic data experiments, we demonstrate the performance of our inference algorithm in various scenarios. We also present our analyses of several biological datasets. Our results agree with a set of well-known relations among the driver genes and suggest new interesting such relationships.

Suggested Citation

  • Mohammadreza Mohaghegh Neyshabouri & Jens Lagergren, 2022. "ToMExO: A probabilistic tree-structured model for cancer progression," PLOS Computational Biology, Public Library of Science, vol. 18(12), pages 1-22, December.
  • Handle: RePEc:plo:pcbi00:1010732
    DOI: 10.1371/journal.pcbi.1010732
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1010732
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1010732&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1010732?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Loes Olde Loohuis & Giulio Caravagna & Alex Graudenzi & Daniele Ramazzotti & Giancarlo Mauri & Marco Antoniotti & Bud Mishra, 2014. "Inferring Tree Causal Models of Cancer Progression with Probability Raising," PLOS ONE, Public Library of Science, vol. 9(10), pages 1-14, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Olav N L Aga & Morten Brun & Kazeem A Dauda & Ramon Diaz-Uriarte & Konstantinos Giannakis & Iain G Johnston, 2024. "HyperTraPS-CT: Inference and prediction for accumulation pathways with flexible data and model structures," PLOS Computational Biology, Public Library of Science, vol. 20(9), pages 1-22, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1010732. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.