IDEAS home Printed from https://ideas.repec.org/a/inm/ormsom/v27y2025i3p770-788.html

Policy Optimization for Personalized Interventions in Behavioral Health

Author

Listed:
  • Jackie Baek

    (Stern School of Business, New York University, New York, New York 10012)

  • Justin J. Boutilier

    (Telfer School of Management, University of Ottawa, Ottawa, Ontario K1N 9B9, Canada)

  • Vivek F. Farias

    (Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142)

  • Jónas Oddur Jónasson

    (Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142)

  • Erez Yoeli

    (Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142)

Abstract

Problem definition : Behavioral health interventions, delivered through digital platforms, have the potential to significantly improve health outcomes through education, motivation, reminders, and outreach. We study the problem of optimizing personalized interventions for patients to maximize a long-term outcome, in which interventions are costly and capacity constrained. We assume we have access to a historical data set collected from an initial pilot study. Methodology/results : We present a new approach for this problem that we dub DecompPI , which decomposes the state space for a system of patients to the individual level and then approximates one step of policy iteration. Implementing DecompPI simply consists of a prediction task using the data set, alleviating the need for online experimentation. DecompPI is a generic, model-free algorithm that can be used irrespective of the underlying patient behavior model. We derive theoretical guarantees on a simple, special case of the model that is representative of our problem setting. When the initial policy used to collect the data is randomized, we establish an approximation guarantee for DecompPI with respect to the improvement beyond a null policy that does not allocate interventions. We show that this guarantee is robust to estimation errors. We then conduct a rigorous empirical case study using real-world data from a mobile health platform for improving treatment adherence for tuberculosis. Using a validated simulation model, we demonstrate that DecompPI can provide the same efficacy as the status quo approach with approximately half the capacity of interventions. Managerial implications : DecompPI is simple and easy to implement for an organization aiming to improve long-term behavior through targeted interventions, and this paper demonstrates its strong performance both theoretically and empirically, particularly in resource-limited settings.

Suggested Citation

  • Jackie Baek & Justin J. Boutilier & Vivek F. Farias & Jónas Oddur Jónasson & Erez Yoeli, 2025. "Policy Optimization for Personalized Interventions in Behavioral Health," Manufacturing & Service Operations Management, INFORMS, vol. 27(3), pages 770-788, May.
  • Handle: RePEc:inm:ormsom:v:27:y:2025:i:3:p:770-788
    DOI: 10.1287/msom.2023.0548
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/msom.2023.0548
    Download Restriction: no

    File URL: https://libkey.io/10.1287/msom.2023.0548?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sze-chuan Suen & Diana Negoescu & Joel Goh, 2022. "Design of Incentive Programs for Optimal Medication Adherence in the Presence of Observable Consumption," Operations Research, INFORMS, vol. 70(3), pages 1691-1716, May.
    2. Sze-chuan Suen & Margaret L. Brandeau & Jeremy D. Goldhaber-Fiebert, 2018. "Optimal timing of drug sensitivity testing for patients on first-line tuberculosis treatment," Health Care Management Science, Springer, vol. 21(4), pages 632-646, December.
    3. Josh C. D’Aeth & Shubhechyya Ghosal & Fiona Grimm & David Haw & Esma Koca & Krystal Lau & Huikang Liu & Stefano Moret & Dheeya Rizmie & Peter C. Smith & Giovanni Forchini & Marisa Miraldo & Wolfram Wi, 2023. "Optimal Hospital Care Scheduling During the SARS-CoV-2 Pandemic," Management Science, INFORMS, vol. 69(10), pages 5923-5947, October.
    4. Mills, Stuart, 2022. "Personalized nudging," Behavioural Public Policy, Cambridge University Press, vol. 6(1), pages 150-159, January.
    5. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    6. Ruggeri, Kai & Benzerga, Amel & Verra, Sanne & Folke, Tomas, 2023. "A behavioral approach to personalizing public health," Behavioural Public Policy, Cambridge University Press, vol. 7(2), pages 457-469, April.
    7. Daniel Adelman & Adam J. Mersereau, 2008. "Relaxations of Weakly Coupled Stochastic Dynamic Programs," Operations Research, INFORMS, vol. 56(3), pages 712-727, June.
    8. Justin J. Boutilier & Jónas Oddur Jónasson & Erez Yoeli, 2022. "Improving Tuberculosis Treatment Adherence Support: The Case for Targeted Behavioral Interventions," Manufacturing & Service Operations Management, INFORMS, vol. 24(6), pages 2925-2943, November.
    9. Aswani, Anil & Kaminsky, Philip & Mintz, Yonatan & Flowers, Elena & Fukuoka, Yoshimi, 2019. "Behavioral modeling in weight loss interventions," European Journal of Operational Research, Elsevier, vol. 272(3), pages 1058-1072.
    10. K.D. Glazebrook & H.M. Mitchell, 2002. "An index policy for a stochastic scheduling model with improving/deteriorating jobs," Naval Research Logistics (NRL), John Wiley & Sons, vol. 49(7), pages 706-721, October.
    11. P. S. Ansell & K. D. Glazebrook & J. Niño-Mora & M. O'Keeffe, 2003. "Whittle's index policy for a multi-class queueing system with convex holding costs," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 57(1), pages 21-39, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Emma Gibson & Sarang Deo & Jónas Oddur Jónasson & Mphatso Kachule & Kara Palamountain, 2023. "Redesigning Sample Transportation in Malawi Through Improved Data Sharing and Daily Route Optimization," Manufacturing & Service Operations Management, INFORMS, vol. 25(4), pages 1209-1226, July.
    2. Nicolas Gast & Bruno Gaujal & Kimang Khun, 2023. "Testing indexability and computing Whittle and Gittins index in subcubic time," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 97(3), pages 391-436, June.
    3. David B. Brown & Martin B. Haugh, 2017. "Information Relaxation Bounds for Infinite Horizon Markov Decision Processes," Operations Research, INFORMS, vol. 65(5), pages 1355-1379, October.
    4. José Niño-Mora, 2023. "Markovian Restless Bandits and Index Policies: A Review," Mathematics, MDPI, vol. 11(7), pages 1-27, March.
    5. K.D. Glazebrook & C. Kirkbride, 2004. "Index policies for the routing of background jobs," Naval Research Logistics (NRL), John Wiley & Sons, vol. 51(6), pages 856-872, September.
    6. Selvaprabu Nadarajah & Andre A. Cire, 2025. "Self-Adapting Network Relaxations for Weakly Coupled Markov Decision Processes," Management Science, INFORMS, vol. 71(2), pages 1779-1802, February.
    7. Leah A. Jacobs & Alec McClean & Zach Branson & Edward Kennedy & Alex Fixler, 2024. "Incremental Propensity Score Effects for Criminology: An Application Assessing the Relationship Between Homelessness, Behavioral Health Problems, and Recidivism," Journal of Quantitative Criminology, Springer, vol. 40(4), pages 707-726, December.
    8. Mao, Minghai & Raiola, Antonio & Yang, Da, 2025. "Double machine learning for Oaxaca-Blinder decomposition," Economics Letters, Elsevier, vol. 255(C).
    9. José Niño-Mora, 2006. "Restless Bandit Marginal Productivity Indices, Diminishing Returns, and Optimal Control of Make-to-Order/Make-to-Stock M/G/1 Queues," Mathematics of Operations Research, INFORMS, vol. 31(1), pages 50-84, February.
    10. Asanov, Anastasiya-Mariya & Asanov, Igor & Buenstorf, Guido, 2024. "A low-cost digital first aid tool to reduce psychological distress in refugees: A multi-country randomized controlled trial of self-help online in the first months after the invasion of Ukraine," Social Science & Medicine, Elsevier, vol. 362(C).
    11. Justin Whitehouse & Qizhao Chen & Morgane Austern & Vasilis Syrgkanis, 2025. "Inference on Optimal Policy Values and Other Irregular Functionals via Softmax Smoothing," Papers 2507.11780, arXiv.org, revised Mar 2026.
    12. Nicolaj N. Mühlbach, 2020. "Tree-based Synthetic Control Methods: Consequences of moving the US Embassy," CREATES Research Papers 2020-04, Department of Economics and Business Economics, Aarhus University.
    13. Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    14. Ruoxuan Xiong & Allison Koenecke & Michael Powell & Zhu Shen & Joshua T. Vogelstein & Susan Athey, 2021. "Federated Causal Inference in Heterogeneous Observational Data," Papers 2107.11732, arXiv.org, revised Apr 2023.
    15. Arne Henningsen & Guy Low & David Wuepper & Tobias Dalhaus & Hugo Storm & Dagim Belay & Stefan Hirsch, 2026. "Estimating Causal Effects With Observational Data: Guidelines for Agricultural and Applied Economists," Journal of Agricultural Economics, Wiley Blackwell, vol. 77(2), pages 356-382, June.
    16. Chou-Chun Wu & Yiwen Cao & Sze-chuan Suen & Eugene Lin, 2024. "Examining chronic kidney disease screening frequency among diabetics: a POMDP approach," Health Care Management Science, Springer, vol. 27(3), pages 391-414, September.
    17. Santiago R. Balseiro & David B. Brown & Chen Chen, 2021. "Dynamic Pricing of Relocating Resources in Large Networks," Management Science, INFORMS, vol. 67(7), pages 4075-4094, July.
    18. Khanh Duong, 2024. "Is meritocracy just? New evidence from Boolean analysis and Machine learning," Journal of Computational Social Science, Springer, vol. 7(2), pages 1795-1821, October.
    19. Jelena Bradic & Weijie Ji & Yuqian Zhang, 2021. "High-dimensional Inference for Dynamic Treatment Effects," Papers 2110.04924, arXiv.org, revised May 2023.
    20. Bingnan Guo & Yuren Qian & Xinyan Guo & Hao Zhang, 2025. "Impact of Zero-Waste City Pilot Policies on Urban Energy Consumption Intensity: Causal Inference Based on Double Machine Learning," Sustainability, MDPI, vol. 17(11), pages 1-25, May.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormsom:v:27:y:2025:i:3:p:770-788. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.