IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-025-58344-x.html
   My bibliography  Save this article

A clinically accessible small multimodal radiology model and evaluation metric for chest X-ray findings

Author

Listed:
  • Juan Manuel Zambrano Chaves

    (Microsoft Research
    Stanford University)

  • Shih-Cheng Huang

    (Stanford University)

  • Yanbo Xu

    (Microsoft Research)

  • Hanwen Xu

    (University of Washington)

  • Naoto Usuyama

    (Microsoft Research)

  • Sheng Zhang

    (Microsoft Research)

  • Fei Wang

    (University of Southern California)

  • Yujia Xie

    (Microsoft Research)

  • Mahmoud Khademi

    (Microsoft Research)

  • Ziyi Yang

    (Microsoft Research)

  • Hany Awadalla

    (Microsoft Research)

  • Julia Gong

    (Microsoft Research)

  • Houdong Hu

    (Microsoft Research)

  • Jianwei Yang

    (Microsoft Research)

  • Chunyuan Li

    (Microsoft Research)

  • Jianfeng Gao

    (Microsoft Research)

  • Yu Gu

    (Microsoft Research)

  • Cliff Wong

    (Microsoft Research)

  • Mu Wei

    (Microsoft Research)

  • Tristan Naumann

    (Microsoft Research)

  • Muhao Chen

    (University of California)

  • Matthew P. Lungren

    (Microsoft Research
    Stanford University
    University of California)

  • Akshay Chaudhari

    (Stanford University)

  • Serena Yeung-Levy

    (Stanford University)

  • Curtis P. Langlotz

    (Stanford University)

  • Sheng Wang

    (University of Washington)

  • Hoifung Poon

    (Microsoft Research)

Abstract

Large foundation models show promise in biomedicine but face challenges in clinical use due to performance gaps, accessibility, cost, and lack of scalable evaluation. Here we show that open-source small multimodal models can bridge these gaps in radiology by generating free-text findings from chest X-ray images. Our data-centric approach leverages 697K curated radiology image-text pairs to train a specialized, domain-adapted chest X-ray encoder. We integrate this encoder with pre-trained language models via a lightweight adapter that aligns image and text modalities. To enable robust, clinically relevant evaluation, we develop and validate CheXprompt, a GPT-4-based metric for assessing factual accuracy aligned with radiologists’ evaluations. Benchmarked with CheXprompt and other standard factuality metrics, LLaVA-Rad (7B) achieves state-of-the-art performance, outperforming much larger models like GPT-4V and Med-PaLM M (84B). While not immediately ready for real-time clinical deployment, LLaVA-Rad is a scalable, privacy-preserving and cost-effective step towards clinically adaptable multimodal AI for radiology.

Suggested Citation

  • Juan Manuel Zambrano Chaves & Shih-Cheng Huang & Yanbo Xu & Hanwen Xu & Naoto Usuyama & Sheng Zhang & Fei Wang & Yujia Xie & Mahmoud Khademi & Ziyi Yang & Hany Awadalla & Julia Gong & Houdong Hu & Jia, 2025. "A clinically accessible small multimodal radiology model and evaluation metric for chest X-ray findings," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-58344-x
    DOI: 10.1038/s41467-025-58344-x
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-025-58344-x
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-025-58344-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Michael Moor & Oishi Banerjee & Zahra Shakeri Hossein Abad & Harlan M. Krumholz & Jure Leskovec & Eric J. Topol & Pranav Rajpurkar, 2023. "Foundation models for generalist medical artificial intelligence," Nature, Nature, vol. 616(7956), pages 259-265, April.
    2. Shih-Cheng Huang & Akshay S. Chaudhari & Curtis P. Langlotz & Nigam Shah & Serena Yeung & Matthew P. Lungren, 2022. "Developing medical imaging AI for emerging infectious diseases," Nature Communications, Nature, vol. 13(1), pages 1-6, December.
    3. Hanwen Xu & Naoto Usuyama & Jaspreet Bagga & Sheng Zhang & Rajesh Rao & Tristan Naumann & Cliff Wong & Zelalem Gero & Javier González & Yu Gu & Yanbo Xu & Mu Wei & Wenhui Wang & Shuming Ma & Furu Wei , 2024. "A whole-slide foundation model for digital pathology from real-world data," Nature, Nature, vol. 630(8015), pages 181-188, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Eduard Chelebian & Christophe Avenel & Carolina Wählby, 2025. "Combining spatial transcriptomics with tissue morphology," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
    2. Luyang Luo & Mingxiang Wu & Mei Li & Yi Xin & Qiong Wang & Varut Vardhanabhuti & Winnie CW Chu & Zhenhui Li & Juan Zhou & Pranav Rajpurkar & Hao Chen, 2025. "A large model for non-invasive and personalized management of breast cancer from multiparametric MRI," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    3. Pengcheng Qiu & Chaoyi Wu & Xiaoman Zhang & Weixiong Lin & Haicheng Wang & Ya Zhang & Yanfeng Wang & Weidi Xie, 2024. "Towards building multilingual language model for medicine," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    4. Maksim Makarenko & Arturo Burguete-Lopez & Qizhou Wang & Silvio Giancola & Bernard Ghanem & Luca Passone & Andrea Fratalocchi, 2024. "Hardware-accelerated integrated optoelectronic platform towards real-time high-resolution hyperspectral video understanding," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    5. Fasheng Xu & Jing Hou & Wei Chen & Karen Xie, 2025. "Generative AI and Organizational Structure in the Knowledge Economy," Papers 2506.00532, arXiv.org.
    6. Senliang Lu & Yehang Chen & Yuan Chen & Peijun Li & Junqi Sun & Changye Zheng & Yujian Zou & Bo Liang & Mingwei Li & Qinggeng Jin & Enming Cui & Wansheng Long & Bao Feng, 2025. "General lightweight framework for vision foundation model supporting multi-task and multi-center medical image analysis," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    7. Jingbo Liu & Fan Jiang & Shinichi Tashiro & Shujun Chen & Manabu Tanaka, 2025. "A physics-informed and data-driven framework for robotic welding in manufacturing," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    8. Marc Schmitt & Pantelis Koutroumpis, 2025. "Cyber Shadows: Neutralizing Security Threats with AI and Targeted Policy Measures," Papers 2501.09025, arXiv.org, revised Jan 2025.
    9. Zhaochang Yang & Ting Wei & Ying Liang & Xin Yuan & RuiTian Gao & Yujia Xia & Jie Zhou & Yue Zhang & Zhangsheng Yu, 2025. "A foundation model for generalizable cancer diagnosis and survival prediction from histopathological images," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    10. Gabriele Campanella & Shengjia Chen & Manbir Singh & Ruchika Verma & Silke Muehlstedt & Jennifer Zeng & Aryeh Stock & Matt Croken & Brandon Veremis & Abdulkadir Elmas & Ivan Shujski & Noora Neittaanmä, 2025. "A clinical benchmark of public self-supervised pathology foundation models," Nature Communications, Nature, vol. 16(1), pages 1-12, December.
    11. Yujin Oh & Sangjoon Park & Hwa Kyung Byun & Yeona Cho & Ik Jae Lee & Jin Sung Kim & Jong Chul Ye, 2024. "LLM-driven multimodal target volume contouring in radiation oncology," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    12. Chuang Niu & Qing Lyu & Christopher D. Carothers & Parisa Kaviani & Josh Tan & Pingkun Yan & Mannudeep K. Kalra & Christopher T. Whitlow & Ge Wang, 2025. "Medical multimodal multitask foundation model for lung cancer screening," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    13. Zhou, Wuping & Xu, Chunchun & Zhang, Lanyue & Fu, Hongqiao & Jian, Weiyan, 2025. "Behaviours and drivers of diagnosis-related group upcoding in China: A mixed-methods study," Social Science & Medicine, Elsevier, vol. 366(C).
    14. Junwei Cheng & Chaoran Huang & Jialong Zhang & Bo Wu & Wenkai Zhang & Xinyu Liu & Jiahui Zhang & Yiyi Tang & Hailong Zhou & Qiming Zhang & Min Gu & Jianji Dong & Xinliang Zhang, 2024. "Multimodal deep learning using on-chip diffractive optics with in situ training capability," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    15. Zhilong Weng & Alexander Seper & Alexey Pryalukhin & Fabian Mairinger & Claudia Wickenhauser & Marcus Bauer & Lennert Glamann & Hendrik Bläker & Thomas Lingscheidt & Wolfgang Hulla & Danny Jonigk & Si, 2024. "GrandQC: A comprehensive solution to quality control problem in digital pathology," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    16. Soroosh Tayebi Arasteh & Tianyu Han & Mahshad Lotfinia & Christiane Kuhl & Jakob Nikolas Kather & Daniel Truhn & Sven Nebelung, 2024. "Large language models streamline automated machine learning for clinical studies," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    17. Jiashu Han & Kunzan Liu & Keith B. Isaacson & Kristina Monakhova & Linda G. Griffith & Sixian You, 2025. "System- and sample-agnostic isotropic three-dimensional microscopy by weakly physics-informed, domain-shift-resistant axial deblurring," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    18. Weijian Huang & Cheng Li & Hong-Yu Zhou & Hao Yang & Jiarun Liu & Yong Liang & Hairong Zheng & Shaoting Zhang & Shanshan Wang, 2024. "Enhancing representation in radiography-reports foundation model: a granular alignment algorithm using masked contrastive learning," Nature Communications, Nature, vol. 15(1), pages 1-12, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-58344-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.