IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-61823-w.html
   My bibliography  Save this article

Machine-learning driven strategies for adapting immunotherapy in metastatic NSCLC

Author

Listed:
  • Maliazurina B. Saad

    (The University of Texas MD Anderson Cancer Center)

  • Qasem Al-Tashi

    (The University of Texas MD Anderson Cancer Center)

  • Lingzhi Hong

    (The University of Texas MD Anderson Cancer Center
    MD Anderson Cancer Center)

  • Vivek Verma

    (The University of Texas MD Anderson Cancer Center)

  • Wentao Li

    (The University of Texas MD Anderson Cancer Center)

  • Daniel Boiarsky

    (Yale University School of Medicine)

  • Shenduo Li

    (Mayo Clinic)

  • Milena Petranovic

    (Massachusetts General Hospital)

  • Carol C. Wu

    (The University of Texas MD Anderson Cancer Center)

  • Brett W. Carter

    (The University of Texas MD Anderson Cancer Center)

  • Girish S. Shroff

    (The University of Texas MD Anderson Cancer Center)

  • Tina Cascone

    (MD Anderson Cancer Center)

  • Xiuning Le

    (MD Anderson Cancer Center)

  • Yasir Y. Elamin

    (MD Anderson Cancer Center)

  • Mehmet Altan

    (MD Anderson Cancer Center)

  • Simon Heeke

    (MD Anderson Cancer Center)

  • Ajay Sheshadri

    (The University of Texas MD Anderson Cancer Center)

  • Joe Y. Chang

    (The University of Texas MD Anderson Cancer Center)

  • Percy P. Lee

    (City of Hope Orange County)

  • Zhongxing Liao

    (The University of Texas MD Anderson Cancer Center)

  • Don L. Gibbons

    (MD Anderson Cancer Center)

  • Ara A. Vaporciyan

    (The University of Texas MD Anderson Cancer Center)

  • J. Jack Lee

    (The University of Texas MD Anderson Cancer Center)

  • Ignacio I. Wistuba

    (The University of Texas MD Anderson Cancer Center)

  • Cara Haymaker

    (The University of Texas MD Anderson Cancer Center)

  • Seyedali Mirjalili

    (Fortitude Valley
    VŠB-TU)

  • David Jaffray

    (The University of Texas MD Anderson Cancer Center)

  • Justin F. Gainor

    (Massachusetts General Hospital)

  • Yanyan Lou

    (Mayo Clinic)

  • Alessandro Federico

    (Harvard Medical School
    Dana-Farber Cancer Institute)

  • Federica Pecci

    (Harvard Medical School
    Dana-Farber Cancer Institute)

  • Mark Awad

    (Harvard Medical School
    Dana-Farber Cancer Institute)

  • Biagio Ricciuti

    (Harvard Medical School
    Dana-Farber Cancer Institute)

  • John V. Heymach

    (MD Anderson Cancer Center)

  • Natalie I. Vokes

    (MD Anderson Cancer Center
    The University of Texas MD Anderson Cancer Center)

  • Jianjun Zhang

    (MD Anderson Cancer Center
    The University of Texas MD Anderson Cancer Center)

  • Jia Wu

    (The University of Texas MD Anderson Cancer Center
    MD Anderson Cancer Center)

Abstract

Immune checkpoint inhibitors (ICIs), either as monotherapy (ICI-Mono) or combined with chemotherapy (ICI-Chemo), improves survival in advanced non-small cell lung cancer (NSCLC). However, prospective guidance for choosing between these options remains limited, and single-feature biomarkers like PD-L1 prove inadequate. We develop a machine learning model using clinicogenomic data from four cohorts (MD Anderson n = 750; Mayo Clinic n = 80; Dana-Farber n = 1077; Stand Up To Cancer n = 393) to predict individual benefit from adding chemotherapy. Benefit scores are calculated using five distinct functions derived from 28 genomic and 6 clinical features. Our integrated model, A-STEP (Attention-based Scoring for Treatment Effect Prediction), estimates heterogeneous treatment effects and achieves the largest reduction in 3-month progression risk, improving weighted risk reduction by 13–23% over stand-alone models. A-STEP recommends treatment changes for over 50% of patients, most often favoring ICI-Chemo. In simulation on external cohort, patients treated in accordance with A-STEP recommendations show improved 2-year progression-free survival (HR = 0.60 for ICI-Mono treatment arm; HR = 0.58 for ICI-Chemo treatment arm). Predictive features include FBXW7, APC, and PD-L1. In this study, we demonstrate how machine learning can fill critical gaps in immunotherapy selection for NSCLC, by modeling treatment heterogeneity with real-world clinicogenomic data, driving precision medicine beyond conventional biomarker boundaries.

Suggested Citation

  • Maliazurina B. Saad & Qasem Al-Tashi & Lingzhi Hong & Vivek Verma & Wentao Li & Daniel Boiarsky & Shenduo Li & Milena Petranovic & Carol C. Wu & Brett W. Carter & Girish S. Shroff & Tina Cascone & Xiu, 2025. "Machine-learning driven strategies for adapting immunotherapy in metastatic NSCLC," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-61823-w
    DOI: 10.1038/s41467-025-61823-w
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-61823-w
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-61823-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jacqulyne P. Robichaux & Xiuning Le & R. S. K. Vijayan & J. Kevin Hicks & Simon Heeke & Yasir Y. Elamin & Heather Y. Lin & Hibiki Udagawa & Ferdinandos Skoulidis & Hai Tran & Susan Varghese & Junqin H, 2021. "Structure-based classification predicts drug response in EGFR-mutant NSCLC," Nature, Nature, vol. 597(7878), pages 732-737, September.
    2. Shuai Chen & Lu Tian & Tianxi Cai & Menggang Yu, 2017. "A general statistical framework for subgroup identification and comparative treatment scoring," Biometrics, The International Biometric Society, vol. 73(4), pages 1199-1209, December.
    3. Donald B. Rubin, 2005. "Causal Inference Using Potential Outcomes: Design, Modeling, Decisions," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 322-331, March.
    4. Su Xiaogang & Zhou Tianni & Yan Xin & Fan Juanjuan & Yang Song, 2008. "Interaction Trees with Censored Survival Data," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-28, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shonosuke Sugasawa & Hisashi Noma, 2021. "Efficient screening of predictive biomarkers for individual treatment selection," Biometrics, The International Biometric Society, vol. 77(1), pages 249-257, March.
    2. Shuai Chen & Lu Tian & Tianxi Cai & Menggang Yu, 2017. "A general statistical framework for subgroup identification and comparative treatment scoring," Biometrics, The International Biometric Society, vol. 73(4), pages 1199-1209, December.
    3. Muxuan Liang & Menggang Yu, 2023. "Relative contrast estimation and inference for treatment recommendation," Biometrics, The International Biometric Society, vol. 79(4), pages 2920-2932, December.
    4. Noémi Kreif & Richard Grieve & Iván Díaz & David Harrison, 2015. "Evaluation of the Effect of a Continuous Treatment: A Machine Learning Approach with an Application to Treatment for Traumatic Brain Injury," Health Economics, John Wiley & Sons, Ltd., vol. 24(9), pages 1213-1228, September.
    5. Lechner, Michael, 2018. "Modified Causal Forests for Estimating Heterogeneous Causal Effects," IZA Discussion Papers 12040, Institute of Labor Economics (IZA).
    6. Martin Ravallion, 2022. "On the Gains from Tradable Benefits‐in‐kind: Evidence for Workfare in India," Economica, London School of Economics and Political Science, vol. 89(355), pages 770-787, July.
    7. Peter Abell & Ofer Engel, 2021. "Subjective Causality and Counterfactuals in the Social Sciences: Toward an Ethnographic Causality?," Sociological Methods & Research, , vol. 50(4), pages 1842-1862, November.
    8. Alberto Galasso & El Hadi Caoui, 2025. "Fractional Ownership and Copyright Licensing: Evidence from the Music Industry," NBER Working Papers 34336, National Bureau of Economic Research, Inc.
    9. Goel, Deepti, 2025. "Estimator of What? A Note on Teaching Regressions in Introductory Econometrics," IZA Discussion Papers 18007, Institute of Labor Economics (IZA).
    10. Salvatore Bimonte & Antonella D’Agostino, 2021. "Tourism development and residents’ well-being: Comparing two seaside destinations in Italy," Tourism Economics, , vol. 27(7), pages 1508-1525, November.
    11. Mealli Fabrizia & Mattei Alessandra, 2012. "A Refreshing Account of Principal Stratification," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-19, April.
    12. Michael C Knaus & Michael Lechner & Anthony Strittmatter, 2021. "Machine learning estimation of heterogeneous causal effects: Empirical Monte Carlo evidence," The Econometrics Journal, Royal Economic Society, vol. 24(1), pages 134-161.
    13. Antonio R. Linero, 2022. "Simulation‐based estimators of analytically intractable causal effects," Biometrics, The International Biometric Society, vol. 78(3), pages 1001-1017, September.
    14. Wu, Guojun & Song, Ge & Lv, Xiaoxiang & Luo, Shikai & Shi, Chengchun & Zhu, Hongtu, 2023. "DNet: distributional network for distributional individualized treatment effects," LSE Research Online Documents on Economics 122895, London School of Economics and Political Science, LSE Library.
    15. Berger, Marius & Hottenrott, Hanna, 2021. "Start-up subsidies and the sources of venture capital," Journal of Business Venturing Insights, Elsevier, vol. 16(C).
    16. Sahar Saeed & Erica E. M. Moodie & Erin C. Strumpf & Marina B. Klein, 2018. "Segmented generalized mixed effect models to evaluate health outcomes," International Journal of Public Health, Springer;Swiss School of Public Health (SSPH+), vol. 63(4), pages 547-551, May.
    17. Afzal, Hania & Hawkins, Jason, 2024. "Electric vehicle and supply equipment adoption dynamics in the United States," Energy Policy, Elsevier, vol. 193(C).
    18. Ruoqing Zhu & Ying-Qi Zhao & Guanhua Chen & Shuangge Ma & Hongyu Zhao, 2017. "Greedy outcome weighted tree learning of optimal personalized treatment rules," Biometrics, The International Biometric Society, vol. 73(2), pages 391-400, June.
    19. Jinglong Zhao, 2024. "Experimental Design For Causal Inference Through An Optimization Lens," Papers 2408.09607, arXiv.org, revised Aug 2024.
    20. Hodula, Martin & Melecký, Martin & Pfeifer, Lukáš & Szabo, Milan, 2023. "Cooling the mortgage loan market: The effect of borrower-based limits on new mortgage lending," Journal of International Money and Finance, Elsevier, vol. 132(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-61823-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.