IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2509.20634.html
   My bibliography  Save this paper

Recidivism and Peer Influence with LLM Text Embeddings in Low Security Correctional Facilities

Author

Listed:
  • Shanjukta Nath
  • Jiwon Hong
  • Jae Ho Chang
  • Keith Warren
  • Subhadeep Paul

Abstract

We find AI embeddings obtained using a pre-trained transformer-based Large Language Model (LLM) of 80,000-120,000 written affirmations and correction exchanges among residents in low-security correctional facilities to be highly predictive of recidivism. The prediction accuracy is 30\% higher with embedding vectors than with only pre-entry covariates. However, since the text embedding vectors are high-dimensional, we perform Zero-Shot classification of these texts to a low-dimensional vector of user-defined classes to aid interpretation while retaining the predictive power. To shed light on the social dynamics inside the correctional facilities, we estimate peer effects in these LLM-generated numerical representations of language with a multivariate peer effect model, adjusting for network endogeneity. We develop new methodology and theory for peer effect estimation that accommodate sparse networks, multivariate latent variables, and correlated multivariate outcomes. With these new methods, we find significant peer effects in language usage for interaction and feedback.

Suggested Citation

  • Shanjukta Nath & Jiwon Hong & Jae Ho Chang & Keith Warren & Subhadeep Paul, 2025. "Recidivism and Peer Influence with LLM Text Embeddings in Low Security Correctional Facilities," Papers 2509.20634, arXiv.org.
  • Handle: RePEc:arx:papers:2509.20634
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2509.20634
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Yann Bramoullé & Habiba Djebbari & Bernard Fortin, 2020. "Peer Effects in Networks: A Survey," Annual Review of Economics, Annual Reviews, vol. 12(1), pages 603-629, August.
    2. Eleni Kalamara & Arthur Turrell & Chris Redl & George Kapetanios & Sujit Kapadia, 2022. "Making text count: Economic forecasting using newspaper text," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 896-919, August.
    3. Paul Brehm & Martin Saavedra, 2025. "Vaccines and Verdicts: How Smallpox Court Decisions Affect Anti-Vaccine Discourse and Mortality," The Economic Journal, Royal Economic Society, vol. 135(668), pages 1229-1260.
    4. Bramoullé, Yann & Djebbari, Habiba & Fortin, Bernard, 2009. "Identification of peer effects through social networks," Journal of Econometrics, Elsevier, vol. 150(1), pages 41-55, May.
    5. Kevin T. Schnepel, 2018. "Good Jobs and Recidivism," Economic Journal, Royal Economic Society, vol. 128(608), pages 447-469, February.
    6. Paul Goldsmith-Pinkham & Guido W. Imbens, 2013. "Social Networks and the Identification of Peer Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 31(3), pages 253-264, July.
    7. Ida Johnsson & Hyungsik Roger Moon, 2021. "Estimation of Peer Effects in Endogenous Social Networks: Control Function Approach," The Review of Economics and Statistics, MIT Press, vol. 103(2), pages 328-345, May.
    8. Fangzheng Xie & Yanxun Xu, 2023. "Efficient Estimation for Random Dot Product Graphs via a One-Step Procedure," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(541), pages 651-664, January.
    9. Eric Auerbach, 2022. "Identification and Estimation of a Partially Linear Regression Model Using Network Data," Econometrica, Econometric Society, vol. 90(1), pages 347-365, January.
    10. Kevin T. Schnepel, 2018. "Good Jobs and Recidivism," Economic Journal, Royal Economic Society, vol. 128(608), pages 447-469.
    11. Ethan Cohen‐Cole & Xiaodong Liu & Yves Zenou, 2018. "Multivariate choices and identification of social interactions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(2), pages 165-178, March.
    12. Quentin Lippmann, 2022. "Gender and lawmaking in times of quotas," Post-Print hal-04120482, HAL.
    13. Yukhnenko, Denis & Farouki, Leen & Fazel, Seena, 2023. "Criminal recidivism rates globally: A 6-year systematic review update," Journal of Criminal Justice, Elsevier, vol. 88(C).
    14. Crystal S. Yang, 2017. "Does Public Assistance Reduce Recidivism?," American Economic Review, American Economic Association, vol. 107(5), pages 551-555, May.
    15. Elliott Ash & Daniel L. Chen & Suresh Naidu, 2022. "Ideas Have Consequences : The Impact of Law and Economics on American Justice," Working Papers hal-03899739, HAL.
    16. Gloria Gennaro & Elliott Ash, 2022. "Emotion and Reason in Political Language," The Economic Journal, Royal Economic Society, vol. 132(643), pages 1037-1059.
    17. Logan M. Lee, 2023. "Halfway Home? Residential Housing and Reincarceration," American Economic Journal: Applied Economics, American Economic Association, vol. 15(3), pages 117-149, July.
    18. Zubin Jelveh & Bruce Kogut & Suresh Naidu, 2024. "Political Language in Economics," The Economic Journal, Royal Economic Society, vol. 134(662), pages 2439-2469.
    19. Cody Tuttle, 2019. "Snapping Back: Food Stamp Bans and Criminal Recidivism," American Economic Journal: Economic Policy, American Economic Association, vol. 11(2), pages 301-327, May.
    20. Edward McFowland & Cosma Rohilla Shalizi, 2023. "Estimating Causal Peer Influence in Homophilous Social Networks by Inferring Latent Locations," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(541), pages 707-718, January.
    21. Patrick Rubin‐Delanchy & Joshua Cape & Minh Tang & Carey E. Priebe, 2022. "A statistical interpretation of spectral embedding: The generalised random dot product graph," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(4), pages 1446-1473, September.
    22. Jennifer L. Doleac, 2023. "Encouraging Desistance from Crime," Journal of Economic Literature, American Economic Association, vol. 61(2), pages 383-427, June.
    23. Lippmann, Quentin, 2022. "Gender and lawmaking in times of quotas," Journal of Public Economics, Elsevier, vol. 207(C).
    24. Zhu, Xuening & Huang, Danyang & Pan, Rui & Wang, Hansheng, 2020. "Multivariate spatial autoregressive model for large scale social networks," Journal of Econometrics, Elsevier, vol. 215(2), pages 591-606.
    25. J Cape & M Tang & C E Priebe, 2019. "Signal-plus-noise matrix models: eigenvector deviations and fluctuations," Biometrika, Biometrika Trust, vol. 106(1), pages 243-250.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tiziano Arduini & Edoardo Rainone, 2024. "Partial identification of treatment response under complementarity and substitutability," Temi di discussione (Economic working papers) 1473, Bank of Italy, Economic Research and International Relations Area.
    2. Jochmans, Koen, 2023. "Peer effects and endogenous social interactions," Journal of Econometrics, Elsevier, vol. 235(2), pages 1203-1214.
    3. Nicolas Debarsy & Julie Le Gallo, 2024. "Identification of spatial spillovers: Do’s and don'ts," Working Papers hal-04549691, HAL.
    4. Diemer, Andreas, 2022. "Endogenous peer effects in diverse friendship networks: Evidence from Swedish classrooms," Economics of Education Review, Elsevier, vol. 89(C).
    5. Alejandro Sanchez-Becerra, 2022. "The Network Propensity Score: Spillovers, Homophily, and Selection into Treatment," Papers 2209.14391, arXiv.org.
    6. Yann Bramoullé & Habiba Djebbari & Bernard Fortin, 2020. "Peer Effects in Networks: A Survey," Annual Review of Economics, Annual Reviews, vol. 12(1), pages 603-629, August.
    7. William W. Wang & Ali Jadbabaie, 2025. "Weak Identification in Peer Effects Estimation," Papers 2508.04897, arXiv.org.
    8. Chih‐Sheng Hsieh & Lung‐Fei Lee & Vincent Boucher, 2020. "Specification and estimation of network formation and network interaction models with the exponential probability distribution," Quantitative Economics, Econometric Society, vol. 11(4), pages 1349-1390, November.
    9. Alejandra Agustina Martínez, 2023. "Raise your Voice! Activism and Peer Effects in Online Social Networks," Working Papers 277, Red Nacional de Investigadores en Economía (RedNIE).
    10. William C. Horrace & Hyunseok Jung & Jonathan L. Presler & Amy Ellen Schwartz, 2025. "What makes a classmate a peer? Examining which peers matter in NYC elementary schools," Journal of Population Economics, Springer;European Society for Population Economics, vol. 38(3), pages 1-28, September.
    11. Lan, Jing & Liu, Zhen, 2019. "Social network effect on income structure of SLCP participants: Evidence from Baitoutan Village, China," Forest Policy and Economics, Elsevier, vol. 106(C), pages 1-1.
    12. Gao, Wayne Yuan & Li, Ming & Xu, Sheng, 2023. "Logical differencing in dyadic network formation models with nontransferable utilities," Journal of Econometrics, Elsevier, vol. 235(1), pages 302-324.
    13. Wennberg, Karl & Norgren, Axel, 2021. "Models of Peer Effects in Education," Working Papers 21/3, Stockholm School of Economics, Center for Educational Leadership and Excellence.
    14. William C. Horrace & Hyunseok Jung & Shane Sanders, 2022. "Network Competition and Team Chemistry in the NBA," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(1), pages 35-49, January.
    15. Alejandra Agustina Martínez, 2023. "Raise your voice! Activism and peer effects in online social networks," Discussion Papers 2023-05, Nottingham Interdisciplinary Centre for Economic and Political Research (NICEP).
    16. Shuyang Sheng & Xiaoting Sun, 2023. "Social Interactions in Endogenous Groups," Papers 2306.01544, arXiv.org, revised May 2025.
    17. Brice Romuald Gueyap Kounga, 2023. "Identification and Estimation of a Semiparametric Logit Model using Network Data," Papers 2310.07151, arXiv.org, revised Jun 2024.
    18. Marguerite Burns & Laura Dague, 2023. "In-Kind Welfare Benefits and Reincarceration Risk: Evidence from Medicaid," NBER Working Papers 31394, National Bureau of Economic Research, Inc.
    19. Sadat Reza & Puneet Manchanda & Juin-Kuan Chong, 2021. "Identification and Estimation of Endogenous Peer Effects Using Partial Network Data from Multiple Reference Groups," Management Science, INFORMS, vol. 67(8), pages 5070-5105, August.
    20. Vazquez-Bare, Gonzalo, 2023. "Identification and estimation of spillover effects in randomized experiments," Journal of Econometrics, Elsevier, vol. 237(1).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2509.20634. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.