IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v13y2022i1d10.1038_s41467-022-29292-7.html
   My bibliography  Save this article

Knowledge graph-based recommendation framework identifies drivers of resistance in EGFR mutant non-small cell lung cancer

Author

Listed:
  • Anna Gogleva

    (AI Engineering, R&D IT, AstraZeneca)

  • Dimitris Polychronopoulos

    (Oncology R&D, AstraZeneca)

  • Matthias Pfeifer

    (Oncology R&D, AstraZeneca)

  • Vladimir Poroshin

    (IGNITE, AstraZeneca)

  • Michaël Ughetto

    (AI Engineering, R&D IT, AstraZeneca)

  • Matthew J. Martin

    (Oncology R&D, AstraZeneca)

  • Hannah Thorpe

    (Oncology R&D, AstraZeneca)

  • Aurelie Bornot

    (Discovery Sciences, R&D, AstraZeneca)

  • Paul D. Smith

    (Oncology R&D, AstraZeneca)

  • Ben Sidders

    (Oncology R&D, AstraZeneca)

  • Jonathan R. Dry

    (Oncology R&D, AstraZeneca)

  • Miika Ahdesmäki

    (Oncology R&D, AstraZeneca)

  • Ultan McDermott

    (Oncology R&D, AstraZeneca)

  • Eliseo Papa

    (AI Engineering, R&D IT, AstraZeneca)

  • Krishna C. Bulusu

    (Oncology R&D, AstraZeneca)

Abstract

Resistance to EGFR inhibitors (EGFRi) presents a major obstacle in treating non-small cell lung cancer (NSCLC). One of the most exciting new ways to find potential resistance markers involves running functional genetic screens, such as CRISPR, followed by manual triage of significantly enriched genes. This triage process to identify ‘high value’ hits resulting from the CRISPR screen involves manual curation that requires specialized knowledge and can take even experts several months to comprehensively complete. To find key drivers of resistance faster we build a recommendation system on top of a heterogeneous biomedical knowledge graph integrating pre-clinical, clinical, and literature evidence. The recommender system ranks genes based on trade-offs between diverse types of evidence linking them to potential mechanisms of EGFRi resistance. This unbiased approach identifies 57 resistance markers from >3,000 genes, reducing hit identification time from months to minutes. In addition to reproducing known resistance markers, our method identifies previously unexplored resistance mechanisms that we prospectively validate.

Suggested Citation

  • Anna Gogleva & Dimitris Polychronopoulos & Matthias Pfeifer & Vladimir Poroshin & Michaël Ughetto & Matthew J. Martin & Hannah Thorpe & Aurelie Bornot & Paul D. Smith & Ben Sidders & Jonathan R. Dry &, 2022. "Knowledge graph-based recommendation framework identifies drivers of resistance in EGFR mutant non-small cell lung cancer," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-29292-7
    DOI: 10.1038/s41467-022-29292-7
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-022-29292-7
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-022-29292-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Silvana Konermann & Mark D. Brigham & Alexandro E. Trevino & Julia Joung & Omar O. Abudayyeh & Clea Barcena & Patrick D. Hsu & Naomi Habib & Jonathan S. Gootenberg & Hiroshi Nishimasu & Osamu Nureki &, 2015. "Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex," Nature, Nature, vol. 517(7536), pages 583-588, January.
    2. Neil Vasan & José Baselga & David M. Hyman, 2019. "A view on drug resistance in cancer," Nature, Nature, vol. 575(7782), pages 299-309, November.
    3. Wright, Marvin N. & Ziegler, Andreas, 2017. "ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 77(i01).
    4. Marinka Zitnik & Rok Sosič & Jure Leskovec, 2018. "Prioritizing network communities," Nature Communications, Nature, vol. 9(1), pages 1-9, December.
    5. Gai Li & Qiang Chen, 2016. "Exploiting Explicit and Implicit Feedback for Personalized Ranking," Mathematical Problems in Engineering, Hindawi, vol. 2016, pages 1-11, January.
    6. Markus Brockmann & Vincent A. Blomen & Joppe Nieuwenhuis & Elmer Stickel & Matthijs Raaben & Onno B. Bleijerveld & A. F. Maarten Altelaar & Lucas T. Jae & Thijn R. Brummelkamp, 2017. "Genetic wiring maps of single-cell protein states reveal an off-switch for GPCR signalling," Nature, Nature, vol. 546(7657), pages 307-311, June.
    7. Tijana Radivojević & Zak Costello & Kenneth Workman & Hector Garcia Martin, 2020. "A machine learning Automated Recommendation Tool for synthetic biology," Nature Communications, Nature, vol. 11(1), pages 1-14, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Heathcliff Dorado García & Fabian Pusch & Yi Bei & Jennifer Stebut & Glorymar Ibáñez & Kristina Guillan & Koshi Imami & Dennis Gürgen & Jana Rolff & Konstantin Helmsauer & Stephanie Meyer-Liesener & N, 2022. "Therapeutic targeting of ATR in alveolar rhabdomyosarcoma," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    2. Mariana Oliveira & Luís Torgo & Vítor Santos Costa, 2021. "Evaluation Procedures for Forecasting with Spatiotemporal Data," Mathematics, MDPI, vol. 9(6), pages 1-27, March.
    3. Tian Zhou & Xinyi Zhu & Zhizhong Ye & Yong-Fei Wang & Chao Yao & Ning Xu & Mi Zhou & Jianyang Ma & Yuting Qin & Yiwei Shen & Yuanjia Tang & Zhihua Yin & Hong Xu & Yutong Zhang & Xiaoli Zang & Huihua D, 2022. "Lupus enhancer risk variant causes dysregulation of IRF8 through cooperative lncRNA and DNA methylation machinery," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    4. Amir Pandi & Christoph Diehl & Ali Yazdizadeh Kharrazi & Scott A. Scholz & Elizaveta Bobkova & Léon Faure & Maren Nattermann & David Adam & Nils Chapin & Yeganeh Foroughijabbari & Charles Moritz & Nic, 2022. "A versatile active learning workflow for optimization of genetic and metabolic networks," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    5. Shaela Wright & Xujie Zhao & Wojciech Rosikiewicz & Shelby Mryncza & Judith Hyle & Wenjie Qi & Zhenling Liu & Siqi Yi & Yong Cheng & Beisi Xu & Chunliang Li, 2023. "Systematic characterization of the HOXA9 downstream targets in MLL-r leukemia by noncoding CRISPR screens," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    6. Wei Xu & Sau Har Lee & Fengjun Qiu & Li Zhou & Xiaoling Wang & Tingjie Ye & Xudong Hu, 2021. "Association of SMAD4 loss with drug resistance in clinical cancer patients: A systematic meta-analysis," PLOS ONE, Public Library of Science, vol. 16(5), pages 1-14, May.
    7. Arjan S. Gosal & Janine A. McMahon & Katharine M. Bowgen & Catherine H. Hoppe & Guy Ziv, 2021. "Identifying and Mapping Groups of Protected Area Visitors by Environmental Awareness," Land, MDPI, vol. 10(6), pages 1-14, May.
    8. Albert Stuart Reece & Gary Kenneth Hulse, 2022. "European Epidemiological Patterns of Cannabis- and Substance-Related Congenital Neurological Anomalies: Geospatiotemporal and Causal Inferential Study," IJERPH, MDPI, vol. 20(1), pages 1-35, December.
    9. Haipeng Fu & Tingyu Wang & Xiaohui Kong & Kun Yan & Yang Yang & Jingyi Cao & Yafei Yuan & Nan Wang & Kehkooi Kee & Zhi John Lu & Qiaoran Xi, 2022. "A Nodal enhanced micropeptide NEMEP regulates glucose uptake during mesendoderm differentiation of embryonic stem cells," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    10. Michael Parzinger & Lucia Hanfstaengl & Ferdinand Sigg & Uli Spindler & Ulrich Wellisch & Markus Wirnsberger, 2020. "Residual Analysis of Predictive Modelling Data for Automated Fault Detection in Building’s Heating, Ventilation and Air Conditioning Systems," Sustainability, MDPI, vol. 12(17), pages 1-18, August.
    11. Van Belle, Jente & Guns, Tias & Verbeke, Wouter, 2021. "Using shared sell-through data to forecast wholesaler demand in multi-echelon supply chains," European Journal of Operational Research, Elsevier, vol. 288(2), pages 466-479.
    12. Albert Stuart Reece & Gary Kenneth Hulse, 2022. "European Epidemiological Patterns of Cannabis- and Substance-Related Body Wall Congenital Anomalies: Geospatiotemporal and Causal Inferential Study," IJERPH, MDPI, vol. 19(15), pages 1-38, July.
    13. Philipp Bach & Victor Chernozhukov & Malte S. Kurz & Martin Spindler & Sven Klaassen, 2021. "DoubleML -- An Object-Oriented Implementation of Double Machine Learning in R," Papers 2103.09603, arXiv.org, revised Feb 2024.
    14. Marchetto, Elisa & Da Re, Daniele & Tordoni, Enrico & Bazzichetto, Manuele & Zannini, Piero & Celebrin, Simone & Chieffallo, Ludovico & Malavasi, Marco & Rocchini, Duccio, 2023. "Testing the effect of sample prevalence and sampling methods on probability- and favourability-based SDMs," Ecological Modelling, Elsevier, vol. 477(C).
    15. Yafeng Wang & Guiquan Zhang & Qingzhou Meng & Shisheng Huang & Panpan Guo & Qibin Leng & Lingyun Sun & Geng Liu & Xingxu Huang & Jianghuai Liu, 2022. "Precise tumor immune rewiring via synthetic CRISPRa circuits gated by concurrent gain/loss of transcription factors," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    16. Jorge Luis Andrade & José Luis Valencia, 2022. "A Fuzzy Random Survival Forest for Predicting Lapses in Insurance Portfolios Containing Imprecise Data," Mathematics, MDPI, vol. 11(1), pages 1-16, December.
    17. Eeva-Katri Kumpula & Pauline Norris & Adam C Pomerleau, 2020. "Stocks of paracetamol products stored in urban New Zealand households: A cross-sectional study," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-11, June.
    18. Michael Bucker & Gero Szepannek & Alicja Gosiewska & Przemyslaw Biecek, 2020. "Transparency, Auditability and eXplainability of Machine Learning Models in Credit Scoring," Papers 2009.13384, arXiv.org.
    19. Jian Lu & Raheel Ahmad & Thomas Nguyen & Jeffrey Cifello & Humza Hemani & Jiangyuan Li & Jinguo Chen & Siyi Li & Jing Wang & Achouak Achour & Joseph Chen & Meagan Colie & Ana Lustig & Christopher Dunn, 2022. "Heterogeneity and transcriptome changes of human CD8+ T cells across nine decades of life," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    20. Timo Schulte & Tillmann Wurz & Oliver Groene & Sabine Bohnet-Joschko, 2023. "Big Data Analytics to Reduce Preventable Hospitalizations—Using Real-World Data to Predict Ambulatory Care-Sensitive Conditions," IJERPH, MDPI, vol. 20(6), pages 1-16, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-29292-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.