IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0312046.html

Analyzing and predicting short-term substance use behaviors of persons who use drugs in the great plains of the U.S

Author

Listed:
  • Nguyen Thach
  • Patrick Habecker
  • Bergen Johnston
  • Lillianna Cervantes
  • Anika Eisenbraun
  • Alex Mason
  • Kimberly Tyler
  • Bilal Khan
  • Hau Chan

Abstract

Background: Substance use induces large economic and societal costs in the U.S. Understanding the change in substance use behaviors of persons who use drugs (PWUDs) over time, therefore, is important in order to inform healthcare providers, policymakers, and other stakeholders toward more efficient allocation of limited resources to at-risk PWUDs. Objective: This study examines the short-term (within a year) behavioral changes in substance use of PWUDs at the population and individual levels. Methods: 237 PWUDs in the Great Plains of the U.S. were recruited by our team. The sample provides us longitudinal survey data regarding their individual attributes, including drug use behaviors, at two separate time periods spanning 4-12 months. At the population level, we analyze our data quantitatively for 18 illicit drugs; then, at the individual level, we build interpretable machine learning logistic regression and decision tree models for identifying relevant attributes to predict, for a given PWUD, (i) which drug(s) they would likely use and (ii) which drug(s) they would likely increase usage within the next 12 months. All predictive models were evaluated by computing the (averaged) Area under the Receiver Operating Characteristic curve (AUROC) and Area under the Precision-Recall curve (AUPR) on multiple distinct sets of hold-out sample. Results: At the population level, the extent of usage change and the number of drugs exhibiting usage changes follow power-law distributions. At the individual level, AUROC’s of the models for the top-4 prevalent drugs (marijuana, methamphetamines, amphetamines, and cocaine) range 0.756-0.829 (+2.88-7.66% improvement with respect to baseline models using only current usage of the respective drugs as input) for (i) and 0.670-0.765 (+4.34-18.0%) for (ii). The corresponding AUPR’s of the said models range 0.729-0.947 (+2.49-13.6%) for (i) and 0.348-0.618 (+26.9-87.6%) for (ii). Conclusion: The observed qualitative changes in short-term substance usage and the trained predictive models for (i) and (ii) can potentially inform human decision-making toward efficient allocation of appropriate resources to PWUDs at highest risk.

Suggested Citation

  • Nguyen Thach & Patrick Habecker & Bergen Johnston & Lillianna Cervantes & Anika Eisenbraun & Alex Mason & Kimberly Tyler & Bilal Khan & Hau Chan, 2024. "Analyzing and predicting short-term substance use behaviors of persons who use drugs in the great plains of the U.S," PLOS ONE, Public Library of Science, vol. 19(11), pages 1-33, November.
  • Handle: RePEc:plo:pone00:0312046
    DOI: 10.1371/journal.pone.0312046
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0312046
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0312046&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0312046?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Robert Whelan & Richard Watts & Catherine A. Orr & Robert R. Althoff & Eric Artiges & Tobias Banaschewski & Gareth J. Barker & Arun L. W. Bokde & Christian Büchel & Fabiana M. Carvalho & Patricia J. C, 2014. "Neuropsychosocial profiles of current and future adolescent alcohol misusers," Nature, Nature, vol. 512(7513), pages 185-189, August.
    2. van Buuren, Stef & Groothuis-Oudshoorn, Karin, 2011. "mice: Multivariate Imputation by Chained Equations in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i03).
    3. Fujimoto, Kayo & Valente, Thomas W., 2012. "Social network influences on adolescent substance use: Disentangling structural equivalence from cohesion," Social Science & Medicine, Elsevier, vol. 74(12), pages 1952-1960.
    4. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    5. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    6. Arteaga, Irma & Chen, Chin-Chih & Reynolds, Arthur J., 2010. "Childhood predictors of adult substance abuse," Children and Youth Services Review, Elsevier, vol. 32(8), pages 1108-1120, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christopher J Greenwood & George J Youssef & Primrose Letcher & Jacqui A Macdonald & Lauryn J Hagg & Ann Sanson & Jenn Mcintosh & Delyse M Hutchinson & John W Toumbourou & Matthew Fuller-Tyszkiewicz &, 2020. "A comparison of penalised regression methods for informing the selection of predictive markers," PLOS ONE, Public Library of Science, vol. 15(11), pages 1-14, November.
    2. Christopher Kath & Florian Ziel, 2018. "The value of forecasts: Quantifying the economic gains of accurate quarter-hourly electricity price forecasts," Papers 1811.08604, arXiv.org.
    3. Kath, Christopher & Ziel, Florian, 2018. "The value of forecasts: Quantifying the economic gains of accurate quarter-hourly electricity price forecasts," Energy Economics, Elsevier, vol. 76(C), pages 411-423.
    4. Chenguang Zhang & Masayuki Nigo & Shivani Patel & Duo Yu & Edward Septimus & Hulin Wu, 2024. "Use of Real-World EMR Data to Rapidly Evaluate Treatment Effects of Existing Drugs for Emerging Infectious Diseases: Remdesivir for COVID-19 Treatment as an Example," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 16(3), pages 604-633, December.
    5. Xiaojun Mao & Zhonglei Wang & Shu Yang, 2023. "Matrix completion under complex survey sampling," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(3), pages 463-492, June.
    6. Danhyang Lee & Jae Kwang Kim, 2022. "Semiparametric imputation using conditional Gaussian mixture models under item nonresponse," Biometrics, The International Biometric Society, vol. 78(1), pages 227-237, March.
    7. Liang, Lixing & Zhuang, Yipeng & Yu, Philip L.H., 2024. "Variable selection for high-dimensional incomplete data," Computational Statistics & Data Analysis, Elsevier, vol. 192(C).
    8. Ballering, Aranka V. & Bonvanie, Irma J. & Olde Hartman, Tim C. & Monden, Rei & Rosmalen, Judith G.M., 2020. "Gender and sex independently associate with common somatic symptoms and lifetime prevalence of chronic disease," Social Science & Medicine, Elsevier, vol. 253(C).
    9. Lee, Serim, 2025. "Development of deep learning auto-encoder algorithms for predicting alcohol use in Korean adolescents based on cross-sectional data," Social Science & Medicine, Elsevier, vol. 367(C).
    10. Halewijn M. Drent & Barbara van den Hoofdakker & Jan K. Buitelaar & Pieter J. Hoekstra & Andrea Dietrich, 2022. "Factors Related to Perceived Stigma in Parents of Children and Adolescents in Outpatient Mental Healthcare," IJERPH, MDPI, vol. 19(19), pages 1-14, October.
    11. Faisal Maqbool Zahid & Shahla Faisal & Christian Heumann, 2020. "Variable selection techniques after multiple imputation in high-dimensional data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 29(3), pages 553-580, September.
    12. Michael Bergrab & Christian Aßmann, 2024. "Automated Bayesian variable selection methods for binary regression models with missing covariate data," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 18(2), pages 203-244, June.
    13. David Neuhausler & Jungyeon Yoon & Richard A. Levine & Juanjuan Fan, 2025. "Prediction of Delisting Using a Machine Learning Ensemble," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 14(3), pages 1-58, October.
    14. Stephanie Houle & Ryan Macdonald, 2023. "Identifying Nascent High-Growth Firms Using Machine Learning," Staff Working Papers 23-53, Bank of Canada.
    15. Hua Yun Chen & Hesen Li & Maria Argos & Victoria W. Persky & Mary E. Turyk, 2022. "Statistical Methods for Assessing the Explained Variation of a Health Outcome by a Mixture of Exposures," IJERPH, MDPI, vol. 19(5), pages 1-16, February.
    16. Dylan Clark-Boucher & Jonathan Boss & Maxwell Salvatore & Jennifer A Smith & Lars G Fritsche & Bhramar Mukherjee, 2022. "Assessing the added value of linking electronic health records to improve the prediction of self-reported COVID-19 testing and diagnosis," PLOS ONE, Public Library of Science, vol. 17(7), pages 1-19, July.
    17. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    18. Hauzenberger, Niko & Huber, Florian & Klieber, Karin & Marcellino, Massimiliano, 2025. "Bayesian neural networks for macroeconomic analysis," Journal of Econometrics, Elsevier, vol. 249(PC).
    19. Oxana Babecka Kucharcukova & Jan Bruha, 2016. "Nowcasting the Czech Trade Balance," Working Papers 2016/11, Czech National Bank, Research and Statistics Department.
    20. Carstensen, Kai & Heinrich, Markus & Reif, Magnus & Wolters, Maik H., 2020. "Predicting ordinary and severe recessions with a three-state Markov-switching dynamic factor model," International Journal of Forecasting, Elsevier, vol. 36(3), pages 829-850.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0312046. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.