IDEAS home Printed from https://ideas.repec.org/p/wbk/wbrwps/11298.html

Does LLM Assistance Improve Healthcare Delivery ? An Evaluation Using On-Site Physicians and Laboratory Tests

Author

Listed:
  • Abaluck, Jason
  • Pless, Robert
  • Ravi, Nirmal
  • Sautmann, Anja
  • Schwartz, Aaron

Abstract

This study tests the effects of large language model (LLM) decision support on patient care at two outpatient clinics in Nigeria. Health workers were given the option to make revisions to their initial care plan based on LLM feedback. The unassisted and assisted plans are evaluated using (1) comparisons with independent care plans created by on-site physicians, (2) laboratory tests for malaria, anemia, and urinary tract infections, and (3) a blinded randomized assessment by the on-site physician who saw the same patient. In response to LLM feedback, health workers changed their prescribing for more than half of the patients and reported high satisfaction with the recommendations. In a selected sample, retrospective review by academic physicians also suggested improvements in care related to long-term risk management. However, the three metrics show mixed effects of LLM-assistance, with on average no significant improvement in diagnostic alignment with physicians, detection rates for the tested conditions, or physician subjective assessments. Health workers follow LLM recommendations that agree with the physician's decisions only slightly more often than those that do not. These results suggest that, despite some benefits, LLM-based frontline health worker support is not yet a public health priority in low- and middle-income countries.

Suggested Citation

  • Abaluck, Jason & Pless, Robert & Ravi, Nirmal & Sautmann, Anja & Schwartz, Aaron, 2026. "Does LLM Assistance Improve Healthcare Delivery ? An Evaluation Using On-Site Physicians and Laboratory Tests," Policy Research Working Paper Series 11298, The World Bank.
  • Handle: RePEc:wbk:wbrwps:11298
    as

    Download full text from publisher

    File URL: https://documents.worldbank.org/curated/en/099814501152625141/pdf/IDU-d099c5ba-e8b0-433e-9dbc-c23487ab9955.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Carolina Lopez & Anja Sautmann & Simone Schaner, 2022. "Does Patient Demand Contribute to the Overuse of Prescription Drugs?," American Economic Journal: Applied Economics, American Economic Association, vol. 14(1), pages 225-260, January.
    2. Sanjay Basu & Sepideh Modrek & Eran Bendavid, 2014. "Comparing Decisions for Malaria Testing and Presumptive Treatment," Medical Decision Making, , vol. 34(8), pages 996-1005, November.
    3. Tao Tu & Mike Schaekermann & Anil Palepu & Khaled Saab & Jan Freyberg & Ryutaro Tanno & Amy Wang & Brenna Li & Mohamed Amin & Yong Cheng & Elahe Vedadi & Nenad Tomasev & Shekoofeh Azizi & Karan Singha, 2025. "Towards conversational diagnostic artificial intelligence," Nature, Nature, vol. 642(8067), pages 442-450, June.
    4. Currie, Janet & Lin, Wanchuan & Meng, Juanjuan, 2014. "Addressing antibiotic abuse in China: An experimental audit study," Journal of Development Economics, Elsevier, vol. 110(C), pages 39-51.
    5. Jishnu Das & Alaka Holla & Aakash Mohpal & Karthik Muralidharan, 2016. "Quality and Accountability in Health Care Delivery: Audit-Study Evidence from Primary Care in India," American Economic Review, American Economic Association, vol. 106(12), pages 3765-3799, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Carolina Lopez & Anja Sautmann & Simone G. Schaner, 2024. "Do Patients Value High-Quality Medical Care? Experimental Evidence from Malaria Diagnosis and Treatment," NBER Working Papers 32075, National Bureau of Economic Research, Inc.
    2. He, Daixin & Lu, Fangwen & Yang, Jianan, 2023. "Impact of self- or social-regarding health messages: Experimental evidence based on antibiotics purchases," Journal of Development Economics, Elsevier, vol. 163(C).
    3. Maria Vittoria Levati & Ivan Soraperra & Saba Yifredew, 2023. "How to Curb Over-The-Counter Sales of Antibiotics? Evidence from a Field Experiment in Ethiopia," Working Papers 10/2023, University of Verona, Department of Economics.
    4. Hao Xue & Jennifer Hager & Qi An & Kai Liu & Jing Zhang & Emma Auden & Bingyan Yang & Jie Yang & Hongyan Liu & Jingchun Nie & Aiqin Wang & Chengchao Zhou & Yaojiang Shi & Sean Sylvia, 2018. "The Quality of Tuberculosis Care in Urban Migrant Clinics in China," IJERPH, MDPI, vol. 15(9), pages 1-15, September.
    5. Michael Allan Ribers & Hannes Ullrich, 2020. "Machine Predictions and Human Decisions with Variation in Payoffs and Skill," CESifo Working Paper Series 8702, CESifo.
    6. Michael Allan Ribers & Hannes Ullrich, 2019. "Battling antibiotic resistance: can machine learning improve prescribing?," CESifo Working Paper Series 7654, CESifo.
    7. Lagarde, Mylène & Blaauw, Duane, 2022. "Overtreatment and benevolent provider moral hazard: Evidence from South African doctors," Journal of Development Economics, Elsevier, vol. 158(C).
    8. Si, Yafei & Bateman, Hazel & Chen, Shu & Hanewald, Katja & Li, Bingqin & Su, Min & Zhou, Zhongliang, 2023. "Quantifying the financial impact of overuse in primary care in China: A standardised patient study," Social Science & Medicine, Elsevier, vol. 320(C).
    9. Jishnu Das & Alaka Holla & Aakash Mohpal & Karthik Muralidharan, 2016. "Quality and Accountability in Health Care Delivery: Audit-Study Evidence from Primary Care in India," American Economic Review, American Economic Association, vol. 106(12), pages 3765-3799, December.
    10. Yiwei Chen & Hui Ding & Min Yu & Jieming Zhong & Ruying Hu & Xiangyu Chen & Chunmei Wang & Kaixu Xie & Karen Eggleston, 2019. "The Effects of Primary Care Chronic-Disease Management in Rural China," NBER Working Papers 26100, National Bureau of Economic Research, Inc.
    11. Hartwig, Renate & Japaridze, Dimitri & Khetsuriani, Nana & Vollmer, Sebastian, 2024. "Information signal and medical diagnosis: Audit study evidence from Georgia," Ruhr Economic Papers 1112, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.
    12. Michael Allan Ribers & Hannes Ullrich, 2023. "Machine learning and physician prescribing: a path to reduced antibiotic use," Berlin School of Economics Discussion Papers 0019, Berlin School of Economics.
    13. Silvia Angerer & Daniela Glätzle‐Rützler & Christian Waibel, 2021. "Monitoring institutions in healthcare markets: Experimental evidence," Health Economics, John Wiley & Sons, Ltd., vol. 30(5), pages 951-971, May.
    14. Lagarde, Mylène & Blaauw, Duane, 2022. "Overtreatment and benevolent provider moral hazard: evidence from South African doctors," LSE Research Online Documents on Economics 115383, London School of Economics and Political Science, LSE Library.
    15. Hannes Ullrich & Michael Allan Ribers, 2023. "Machine predictions and human decisions with variation in payoffs and skill: the case of antibiotic prescribing," Berlin School of Economics Discussion Papers 0027, Berlin School of Economics.
    16. Carolina Lopez & Anja Sautmann & Simone Schaner, 2018. "The Contribution of Patients and Providers to the Overuse of Prescription Drugs," NBER Working Papers 25284, National Bureau of Economic Research, Inc.
    17. Chen, Juhui & Bai, Junfei & Van Trijp, Hans C.M., 2024. "Savior or Driver? Retailer recommendation and pesticide overuse," 2024 Annual Meeting, July 28-30, New Orleans, LA 343584, Agricultural and Applied Economics Association.
    18. Ortiz, Santiago & Castelblanco, Geraldine & Mantilla, Cesar, 2020. "Perishability, dynamic pricing and price discrimination: evidence from flower markets in Bogotá," SocArXiv pv5kz, Center for Open Science.
    19. repec:ags:aaea22:343584 is not listed on IDEAS
    20. Carolina Lopez & Anja Sautmann & Simone Schaner, 2022. "Does Patient Demand Contribute to the Overuse of Prescription Drugs?," American Economic Journal: Applied Economics, American Economic Association, vol. 14(1), pages 225-260, January.
    21. Maxime Perodaud & Michela Chessa, 2026. "Hey, what did you expect ? Confirmation bias in credence goods markets: Theoretical and experimental analyses," Post-Print hal-05441370, HAL.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wbk:wbrwps:11298. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Roula I. Yazigi (email available below). General contact details of provider: https://edirc.repec.org/data/dvewbus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.