IDEAS home Printed from https://ideas.repec.org/a/inm/ormsom/v27y2025i3p770-788.html

Policy Optimization for Personalized Interventions in Behavioral Health

Author

Listed:
  • Jackie Baek

    (Stern School of Business, New York University, New York, New York 10012)

  • Justin J. Boutilier

    (Telfer School of Management, University of Ottawa, Ottawa, Ontario K1N 9B9, Canada)

  • Vivek F. Farias

    (Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142)

  • Jónas Oddur Jónasson

    (Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142)

  • Erez Yoeli

    (Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142)

Abstract

Problem definition : Behavioral health interventions, delivered through digital platforms, have the potential to significantly improve health outcomes through education, motivation, reminders, and outreach. We study the problem of optimizing personalized interventions for patients to maximize a long-term outcome, in which interventions are costly and capacity constrained. We assume we have access to a historical data set collected from an initial pilot study. Methodology/results : We present a new approach for this problem that we dub DecompPI , which decomposes the state space for a system of patients to the individual level and then approximates one step of policy iteration. Implementing DecompPI simply consists of a prediction task using the data set, alleviating the need for online experimentation. DecompPI is a generic, model-free algorithm that can be used irrespective of the underlying patient behavior model. We derive theoretical guarantees on a simple, special case of the model that is representative of our problem setting. When the initial policy used to collect the data is randomized, we establish an approximation guarantee for DecompPI with respect to the improvement beyond a null policy that does not allocate interventions. We show that this guarantee is robust to estimation errors. We then conduct a rigorous empirical case study using real-world data from a mobile health platform for improving treatment adherence for tuberculosis. Using a validated simulation model, we demonstrate that DecompPI can provide the same efficacy as the status quo approach with approximately half the capacity of interventions. Managerial implications : DecompPI is simple and easy to implement for an organization aiming to improve long-term behavior through targeted interventions, and this paper demonstrates its strong performance both theoretically and empirically, particularly in resource-limited settings.

Suggested Citation

  • Jackie Baek & Justin J. Boutilier & Vivek F. Farias & Jónas Oddur Jónasson & Erez Yoeli, 2025. "Policy Optimization for Personalized Interventions in Behavioral Health," Manufacturing & Service Operations Management, INFORMS, vol. 27(3), pages 770-788, May.
  • Handle: RePEc:inm:ormsom:v:27:y:2025:i:3:p:770-788
    DOI: 10.1287/msom.2023.0548
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/msom.2023.0548
    Download Restriction: no

    File URL: https://libkey.io/10.1287/msom.2023.0548?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Mills, Stuart, 2022. "Personalized nudging," Behavioural Public Policy, Cambridge University Press, vol. 6(1), pages 150-159, January.
    2. Ruggeri, Kai & Benzerga, Amel & Verra, Sanne & Folke, Tomas, 2023. "A behavioral approach to personalizing public health," Behavioural Public Policy, Cambridge University Press, vol. 7(2), pages 457-469, April.
    3. Aswani, Anil & Kaminsky, Philip & Mintz, Yonatan & Flowers, Elena & Fukuoka, Yoshimi, 2019. "Behavioral modeling in weight loss interventions," European Journal of Operational Research, Elsevier, vol. 272(3), pages 1058-1072.
    4. P. S. Ansell & K. D. Glazebrook & J. Niño-Mora & M. O'Keeffe, 2003. "Whittle's index policy for a multi-class queueing system with convex holding costs," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 57(1), pages 21-39, April.
    5. Sze-chuan Suen & Diana Negoescu & Joel Goh, 2022. "Design of Incentive Programs for Optimal Medication Adherence in the Presence of Observable Consumption," Operations Research, INFORMS, vol. 70(3), pages 1691-1716, May.
    6. Sze-chuan Suen & Margaret L. Brandeau & Jeremy D. Goldhaber-Fiebert, 2018. "Optimal timing of drug sensitivity testing for patients on first-line tuberculosis treatment," Health Care Management Science, Springer, vol. 21(4), pages 632-646, December.
    7. Josh C. D’Aeth & Shubhechyya Ghosal & Fiona Grimm & David Haw & Esma Koca & Krystal Lau & Huikang Liu & Stefano Moret & Dheeya Rizmie & Peter C. Smith & Giovanni Forchini & Marisa Miraldo & Wolfram Wi, 2023. "Optimal Hospital Care Scheduling During the SARS-CoV-2 Pandemic," Management Science, INFORMS, vol. 69(10), pages 5923-5947, October.
    8. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    9. Daniel Adelman & Adam J. Mersereau, 2008. "Relaxations of Weakly Coupled Stochastic Dynamic Programs," Operations Research, INFORMS, vol. 56(3), pages 712-727, June.
    10. Justin J. Boutilier & Jónas Oddur Jónasson & Erez Yoeli, 2022. "Improving Tuberculosis Treatment Adherence Support: The Case for Targeted Behavioral Interventions," Manufacturing & Service Operations Management, INFORMS, vol. 24(6), pages 2925-2943, November.
    11. K.D. Glazebrook & H.M. Mitchell, 2002. "An index policy for a stochastic scheduling model with improving/deteriorating jobs," Naval Research Logistics (NRL), John Wiley & Sons, vol. 49(7), pages 706-721, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicolas Gast & Bruno Gaujal & Kimang Khun, 2023. "Testing indexability and computing Whittle and Gittins index in subcubic time," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 97(3), pages 391-436, June.
    2. Emma Gibson & Sarang Deo & Jónas Oddur Jónasson & Mphatso Kachule & Kara Palamountain, 2023. "Redesigning Sample Transportation in Malawi Through Improved Data Sharing and Daily Route Optimization," Manufacturing & Service Operations Management, INFORMS, vol. 25(4), pages 1209-1226, July.
    3. David B. Brown & Martin B. Haugh, 2017. "Information Relaxation Bounds for Infinite Horizon Markov Decision Processes," Operations Research, INFORMS, vol. 65(5), pages 1355-1379, October.
    4. José Niño-Mora, 2023. "Markovian Restless Bandits and Index Policies: A Review," Mathematics, MDPI, vol. 11(7), pages 1-27, March.
    5. K.D. Glazebrook & C. Kirkbride, 2004. "Index policies for the routing of background jobs," Naval Research Logistics (NRL), John Wiley & Sons, vol. 51(6), pages 856-872, September.
    6. Selvaprabu Nadarajah & Andre A. Cire, 2025. "Self-Adapting Network Relaxations for Weakly Coupled Markov Decision Processes," Management Science, INFORMS, vol. 71(2), pages 1779-1802, February.
    7. Leah A. Jacobs & Alec McClean & Zach Branson & Edward Kennedy & Alex Fixler, 2024. "Incremental Propensity Score Effects for Criminology: An Application Assessing the Relationship Between Homelessness, Behavioral Health Problems, and Recidivism," Journal of Quantitative Criminology, Springer, vol. 40(4), pages 707-726, December.
    8. José Niño-Mora, 2006. "Restless Bandit Marginal Productivity Indices, Diminishing Returns, and Optimal Control of Make-to-Order/Make-to-Stock M/G/1 Queues," Mathematics of Operations Research, INFORMS, vol. 31(1), pages 50-84, February.
    9. Asanov, Anastasiya-Mariya & Asanov, Igor & Buenstorf, Guido, 2024. "A low-cost digital first aid tool to reduce psychological distress in refugees: A multi-country randomized controlled trial of self-help online in the first months after the invasion of Ukraine," Social Science & Medicine, Elsevier, vol. 362(C).
    10. Justin Whitehouse & Qizhao Chen & Morgane Austern & Vasilis Syrgkanis, 2025. "Inference on Optimal Policy Values and Other Irregular Functionals via Softmax Smoothing," Papers 2507.11780, arXiv.org, revised Mar 2026.
    11. Santiago R. Balseiro & David B. Brown & Chen Chen, 2021. "Dynamic Pricing of Relocating Resources in Large Networks," Management Science, INFORMS, vol. 67(7), pages 4075-4094, July.
    12. Khanh Duong, 2024. "Is meritocracy just? New evidence from Boolean analysis and Machine learning," Journal of Computational Social Science, Springer, vol. 7(2), pages 1795-1821, October.
    13. Davide Viviano & Jelena Bradic, 2019. "Synthetic learner: model-free inference on treatments over time," Papers 1904.01490, arXiv.org, revised Aug 2022.
    14. Yoganathan, Vignesh & Osburg, Victoria-Sophie, 2024. "The mind in the machine: Estimating mind perception's effect on user satisfaction with voice-based conversational agents," Journal of Business Research, Elsevier, vol. 175(C).
    15. Sung Jae Jun & Sokbae Lee, 2024. "Causal Inference Under Outcome-Based Sampling with Monotonicity Assumptions," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(3), pages 998-1009, July.
    16. Guo, Jiaqi & Wang, Qiang & Li, Rongrong, 2024. "Can official development assistance promote renewable energy in sub-Saharan Africa countries? A matter of institutional transparency of recipient countries," Energy Policy, Elsevier, vol. 186(C).
    17. Konstantin Avrachenkov & Vivek S. Borkar & Pratik Shah, 2026. "Lagrangian index policy for restless bandits with average reward," Queueing Systems: Theory and Applications, Springer, vol. 110(1), pages 1-34, March.
    18. Tim Munday & James Brookes, 2021. "Mark my words: the transmission of central bank communication to the general public via the print media," Bank of England working papers 944, Bank of England.
    19. Juan Carlos Escanciano & Telmo P'erez-Izquierdo, 2023. "Automatic Locally Robust GMM with Machine-Learning-Generated Regressors," Papers 2301.10643, arXiv.org, revised Mar 2026.
    20. Wei Wen & Kangan Jiang & Xiaojing Shao, 2025. "The Impact of Big Data Pilot Zones on Urban Ecological Resilience: Evidence from a Machine Learning Approach," Sustainability, MDPI, vol. 17(7), pages 1-22, March.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormsom:v:27:y:2025:i:3:p:770-788. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.