IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0248360.html

Integrating human services and criminal justice data with claims data to predict risk of opioid overdose among Medicaid beneficiaries: A machine-learning approach

Author

Listed:
  • Wei-Hsuan Lo-Ciganic
  • Julie M Donohue
  • Eric G Hulsey
  • Susan Barnes
  • Yuan Li
  • Courtney C Kuza
  • Qingnan Yang
  • Jeanine Buchanich
  • James L Huang
  • Christina Mair
  • Debbie L Wilson
  • Walid F Gellad

Abstract

Health system data incompletely capture the social risk factors for drug overdose. This study aimed to improve the accuracy of a machine-learning algorithm to predict opioid overdose risk by integrating human services and criminal justice data with health claims data to capture the social determinants of overdose risk. This prognostic study included Medicaid beneficiaries (n = 237,259) in Allegheny County, Pennsylvania enrolled between 2015 and 2018, randomly divided into training, testing, and validation samples. We measured 290 potential predictors (239 derived from Medicaid claims data) in 30-day periods, beginning with the first observed Medicaid enrollment date during the study period. Using a gradient boosting machine, we predicted a composite outcome (i.e., fatal or nonfatal opioid overdose constructed using medical examiner and claims data) in the subsequent month. We compared prediction performance between a Medicaid claims only model to one integrating human services and criminal justice data with Medicaid claims (i.e., integrated model) using several metrics (e.g., C-statistic, number needed to evaluate [NNE] to identify one overdose). Beneficiaries were stratified into risk-score decile subgroups. The samples (training = 79,087, testing = 79,086, validation = 79,086) had similar characteristics (age = 38±18 years, female = 56%, white = 48%, having at least one overdose = 1.7% during study period). Using the validation sample, the integrated model slightly improved on the Medicaid claims only model (C-statistic = 0.885; 95%CI = 0.877–0.892 vs. C-statistic = 0.871; 95%CI = 0.863–0.878), with small corresponding improvements in the NNE and positive predictive value. Nine of the top 30 most important predictors in the integrated model were human services and criminal justice variables. Using the integrated model, approximately 70% of individuals with overdoses were members of the top risk decile (overdose rates in the subsequent month = 47/10,000 beneficiaries). Few individuals in the bottom 9 deciles had overdose episodes (0-12/10,000). Machine-learning algorithms integrating claims and social service and criminal justice data modestly improved opioid overdose prediction among Medicaid beneficiaries for a large U.S. county heavily affected by the opioid crisis.

Suggested Citation

  • Wei-Hsuan Lo-Ciganic & Julie M Donohue & Eric G Hulsey & Susan Barnes & Yuan Li & Courtney C Kuza & Qingnan Yang & Jeanine Buchanich & James L Huang & Christina Mair & Debbie L Wilson & Walid F Gellad, 2021. "Integrating human services and criminal justice data with claims data to predict risk of opioid overdose among Medicaid beneficiaries: A machine-learning approach," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-18, March.
  • Handle: RePEc:plo:pone00:0248360
    DOI: 10.1371/journal.pone.0248360
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0248360
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0248360&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0248360?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Takaya Saito & Marc Rehmsmeier, 2015. "The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-21, March.
    2. Paul Thottakkara & Tezcan Ozrazgat-Baslanti & Bradley B Hupf & Parisa Rashidi & Panos Pardalos & Petar Momcilovic & Azra Bihorac, 2016. "Application of Machine Learning Techniques to High-Dimensional Clinical Data to Forecast Postoperative Complications," PLOS ONE, Public Library of Science, vol. 11(5), pages 1-19, May.
    3. repec:abf:journl:v:31:y:2020:i:3:p:24253-24254 is not listed on IDEAS
    4. Dasgupta, N. & Beletsky, L. & Ciccarone, D., 2018. "Opioid Crisis: No Easy Fix to Its Social and Economic Determinants," American Journal of Public Health, American Public Health Association, vol. 108(2), pages 182-186.
    5. Sean F Altekruse & Candace M Cosgrove & William C Altekruse & Richard A Jenkins & Carlos Blanco, 2020. "Socioeconomic risk factors for fatal opioid overdoses in the United States: Findings from the Mortality Disparities in American Communities Study (MDAC)," PLOS ONE, Public Library of Science, vol. 15(1), pages 1-16, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lee, Heejin & Wilcox, Pamela & Chang, Won, 2025. "Interpretable Machine Learning and Criminological Theories: Global Evidence on Bullying Perpetration and Victimization (2001–2014)," Journal of Criminal Justice, Elsevier, vol. 100(C).
    2. Dimitris Bertsimas & Mohammad M. Fazel-Zarandi & Joshua Ivanhoe & Periklis Petridis, 2025. "Early Detection of Opioid Over-Procurement: A Semisupervised Machine Learning Approach," Manufacturing & Service Operations Management, INFORMS, vol. 27(6), pages 1889-1904, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wei-Hsuan Lo-Ciganic & James L Huang & Hao H Zhang & Jeremy C Weiss & C Kent Kwoh & Julie M Donohue & Adam J Gordon & Gerald Cochran & Daniel C Malone & Courtney C Kuza & Walid F Gellad, 2020. "Using machine learning to predict risk of incident opioid use disorder among fee-for-service Medicare beneficiaries: A prognostic study," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-16, July.
    2. Nicky J. Mehtani & Chika C. Chuku & Meredith C. Meacham & Eric Vittinghoff & Samantha E. Dilworth & Elise D. Riley, 2023. "Housing Instability Associated with Return to Stimulant Use among Previously Abstaining Women," IJERPH, MDPI, vol. 20(19), pages 1-10, September.
    3. Christopher J Greenwood & George J Youssef & Primrose Letcher & Jacqui A Macdonald & Lauryn J Hagg & Ann Sanson & Jenn Mcintosh & Delyse M Hutchinson & John W Toumbourou & Matthew Fuller-Tyszkiewicz &, 2020. "A comparison of penalised regression methods for informing the selection of predictive markers," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-14, November.
    4. Jie-Huei Wang & Cheng-Yu Liu & You-Ruei Min & Zih-Han Wu & Po-Lin Hou, 2024. "Cancer Diagnosis by Gene-Environment Interactions via Combination of SMOTE-Tomek and Overlapped Group Screening Approaches with Application to Imbalanced TCGA Clinical and Genomic Data," Mathematics, MDPI, vol. 12(14), pages 1-24, July.
    5. Le, Hong Hanh & Viviani, Jean-Laurent, 2018. "Predicting bank failure: An improvement by implementing a machine-learning approach to classical financial ratios," Research in International Business and Finance, Elsevier, vol. 44(C), pages 16-25.
    6. Giulio Mattera & Mario Vozza & Joseph Polden & Luigi Nele & Zengxi Pan, 2025. "Frequency informed convolutional autoencoder for in situ anomaly detection in wire arc additive manufacturing," Journal of Intelligent Manufacturing, Springer, vol. 36(8), pages 5819-5834, December.
    7. Weiss, Max & Zoorob, Michael, 2021. "Political frames of public health crises: Discussing the opioid epidemic in the US Congress," Social Science & Medicine, Elsevier, vol. 281(C).
    8. João Chang Junior & Fábio Binuesa & Luiz Fernando Caneo & Aida Luiza Ribeiro Turquetto & Elisandra Cristina Trevisan Calvo Arita & Aline Cristina Barbosa & Alfredo Manoel da Silva Fernandes & Evelinda, 2020. "Improving preoperative risk-of-death prediction in surgery congenital heart defects using artificial intelligence model: A pilot study," PLOS ONE, Public Library of Science, vol. 15(9), pages 1-21, September.
    9. William Encinosa & Didem Bernard & Thomas M. Selden, 2022. "Opioid and non-opioid analgesic prescribing before and after the CDC’s 2016 opioid guideline," International Journal of Health Economics and Management, Springer, vol. 22(1), pages 1-52, March.
    10. Arthur De Sá Ferreira & Ney Meziat-Filho & Ana Paula Antunes Ferreira, 2021. "Double threshold receiver operating characteristic plot for three-modal continuous predictors," Computational Statistics, Springer, vol. 36(3), pages 2231-2245, September.
    11. Fan, Xudong & Wang, Xiaowei & Zhang, Xijin & ASCE Xiong (Bill) Yu, P.E.F., 2022. "Machine learning based water pipe failure prediction: The effects of engineering, geology, climate and socio-economic factors," Reliability Engineering and System Safety, Elsevier, vol. 219(C).
    12. Zhang, Han, 2021. "How Using Machine Learning Classification as a Variable in Regression Leads to Attenuation Bias and What to Do About It," SocArXiv 453jk, Center for Open Science.
    13. Zoe Lindenfeld & Diana Silver & José A Pagán & Donglan Stacy Zhang & Ji Eun Chang, 2024. "Examining the relationship between social determinants of health, measures of structural racism and county-level overdose deaths from 2017–2020," PLOS ONE, Public Library of Science, vol. 19(5), pages 1-11, May.
    14. Aaron R Kaufman & Eitan D Hersh, 2020. "The political consequences of opioid overdoses," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-10, August.
    15. Ye, Yiwan & Shu, Xiaoling, 2025. "Unequal but widespread despairs: Social inequalities and self-rated health trends in the United States in 1972–2018," Social Science & Medicine, Elsevier, vol. 367(C).
    16. Masabho P Milali & Samson S Kiware & Nicodem J Govella & Fredros Okumu & Naveen Bansal & Serdar Bozdag & Jacques D Charlwood & Marta F Maia & Sheila B Ogoma & Floyd E Dowell & George F Corliss & Maggy, 2020. "An autoencoder and artificial neural network-based method to estimate parity status of wild mosquitoes from near-infrared spectra," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-16, June.
    17. Daniel R Jeske, 2018. "Metrics Used When Evaluating the Performance of Statistical Classifiers," Biostatistics and Biometrics Open Access Journal, Juniper Publishers Inc., vol. 8(1), pages 7-9, August.
    18. Martijn van Hasselt & Christopher R. Bollinger & Jeremy W. Bray, 2022. "A Bayesian approach to account for misclassification in prevalence and trend estimation," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(2), pages 351-367, March.
    19. Juliet Chebet Moso & Stéphane Cormier & Cyril de Runz & Hacène Fouchal & John Mwangi Wandeto, 2021. "Anomaly Detection on Data Streams for Smart Agriculture," Agriculture, MDPI, vol. 11(11), pages 1-17, November.
    20. Deiana, Claudio & Giua, Ludovica & Nistico, Roberto, 2019. "The Economics behind the Epidemic: Afghan Opium Price and Prescription Opioids in the US," IZA Discussion Papers 12872, IZA Network @ LISER.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0248360. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.