IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v56y2008i6p1382-1392.html
   My bibliography  Save this article

Algorithmic Prediction of Health-Care Costs

Author

Listed:
  • Dimitris Bertsimas

    (Operations Research Center, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139)

  • Margrét V. Bjarnadóttir

    (Stanford Graduate School of Business, Stanford, California 94305)

  • Michael A. Kane

    (Medical Department, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139)

  • J. Christian Kryder

    (D2Hawkeye, Waltham, Massachusetts 02453)

  • Rudra Pandey

    (D2Hawkeye, Waltham, Massachusetts 02453)

  • Santosh Vempala

    (ARC ThinkTank, Georgia Institute of Technology, Atlanta, Georgia 30332)

  • Grant Wang

    (Electrical Engineering and Computer Science Department, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139)

Abstract

The rising cost of health care is one of the world's most important problems. Accordingly, predicting such costs with accuracy is a significant first step in addressing this problem. Since the 1980s, there has been research on the predictive modeling of medical costs based on (health insurance) claims data using heuristic rules and regression methods. These methods, however, have not been appropriately validated using populations that the methods have not seen. We utilize modern data-mining methods, specifically classification trees and clustering algorithms, along with claims data from over 800,000 insured individuals over three years, to provide rigorously validated predictions of health-care costs in the third year, based on medical and cost data from the first two years. We quantify the accuracy of our predictions using unseen (out-of-sample) data from over 200,000 members. The key findings are: (a) our data-mining methods provide accurate predictions of medical costs and represent a powerful tool for prediction of health-care costs, (b) the pattern of past cost data is a strong predictor of future costs, and (c) medical information only contributes to accurate prediction of medical costs of high-cost members.

Suggested Citation

  • Dimitris Bertsimas & Margrét V. Bjarnadóttir & Michael A. Kane & J. Christian Kryder & Rudra Pandey & Santosh Vempala & Grant Wang, 2008. "Algorithmic Prediction of Health-Care Costs," Operations Research, INFORMS, vol. 56(6), pages 1382-1392, December.
  • Handle: RePEc:inm:oropre:v:56:y:2008:i:6:p:1382-1392
    DOI: 10.1287/opre.1080.0619
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/opre.1080.0619
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.1080.0619?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kim H. & Loh W.Y., 2001. "Classification Trees With Unbiased Multiway Splits," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 589-604, June.
    2. Van de ven, Wynand P.M.M. & Ellis, Randall P., 2000. "Risk adjustment in competitive health plan markets," Handbook of Health Economics, in: A. J. Culyer & J. P. Newhouse (ed.), Handbook of Health Economics, edition 1, volume 1, chapter 14, pages 755-845, Elsevier.
    3. Manning, Willard G. & Mullahy, John, 2001. "Estimating log models: to transform or not to transform?," Journal of Health Economics, Elsevier, vol. 20(4), pages 461-494, July.
    4. A. J. Culyer & J. P. Newhouse (ed.), 2000. "Handbook of Health Economics," Handbook of Health Economics, Elsevier, edition 1, volume 1, number 1.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Adam Maidman & Lan Wang, 2018. "New semiparametric method for predicting high‐cost patients," Biometrics, The International Biometric Society, vol. 74(3), pages 1104-1111, September.
    2. Sriubaite, I. & Harris, A. & Jones, A.M. & Gabbe, B., 2020. "Economic Consequences of Road Traffic Injuries. Application of the Super Learner algorithm," Health, Econometrics and Data Group (HEDG) Working Papers 20/20, HEDG, c/o Department of Economics, University of York.
    3. Florian Buchner & Jürgen Wasem & Sonja Schillo, 2017. "Regression Trees Identify Relevant Interactions: Can This Improve the Predictive Performance of Risk Adjustment?," Health Economics, John Wiley & Sons, Ltd., vol. 26(1), pages 74-85, January.
    4. Yeongah Choi & Jiho An & Seiyoung Ryu & Jaekyeong Kim, 2022. "Development and Evaluation of Machine Learning-Based High-Cost Prediction Model Using Health Check-Up Data by the National Health Insurance Service of Korea," IJERPH, MDPI, vol. 19(20), pages 1-16, October.
    5. D. Cattel & R. C. Kleef & R. C. J. A. Vliet, 2017. "A method to simulate incentives for cost containment under various cost sharing designs: an application to a first-euro deductible and a doughnut hole," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 18(8), pages 987-1000, November.
    6. Sungchul Park & Anirban Basu, 2018. "Alternative evaluation metrics for risk adjustment methods," Health Economics, John Wiley & Sons, Ltd., vol. 27(6), pages 984-1010, June.
    7. Lennon, Conor, 2021. "Are the costs of employer-sponsored health insurance passed on to workers at the individual level?," Economics & Human Biology, Elsevier, vol. 41(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jones, A. & Lomas, J. & Rice, N., 2014. "Going Beyond the Mean in Healthcare Cost Regressions: a Comparison of Methods for Estimating the Full Conditional Distribution," Health, Econometrics and Data Group (HEDG) Working Papers 14/26, HEDG, c/o Department of Economics, University of York.
    2. Jones, A.M, 2010. "Models For Health Care," Health, Econometrics and Data Group (HEDG) Working Papers 10/01, HEDG, c/o Department of Economics, University of York.
    3. Jacob Glazer & Thomas G. McGuire & Julie Shi, 2016. "Risk Adjustment of Health Plan Payments to Correct Inefficient Plan Choice from Adverse Selection," NBER Chapters, in: Measuring and Modeling Health Care Costs, pages 379-418, National Bureau of Economic Research, Inc.
    4. R. Halbersma & M. Mikkers & E. Motchenkova & I. Seinen, 2011. "Market structure and hospital–insurer bargaining in the Netherlands," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 12(6), pages 589-603, December.
    5. Amir Marashi & Shima Ghassem Pour & Vincy Li & Chris Rissel & Federico Girosi, 2019. "The association between physical activity and hospital payments for acute admissions in the Australian population aged 45 and over," PLOS ONE, Public Library of Science, vol. 14(6), pages 1-16, June.
    6. Pilny, Adam & Wübker, Ansgar & Ziebarth, Nicolas R., 2017. "Introducing risk adjustment and free health plan choice in employer-based health insurance: Evidence from Germany," Journal of Health Economics, Elsevier, vol. 56(C), pages 330-351.
    7. Eijkenaar, Frank & van Vliet, René C.J.A., 2017. "Improving risk equalization for individuals with persistently high costs: Experiences from the Netherlands," Health Policy, Elsevier, vol. 121(11), pages 1169-1176.
    8. Hao Yu, 2017. "China’s medical savings accounts: an analysis of the price elasticity of demand for health care," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 18(6), pages 773-785, July.
    9. Cantoni, Eva & Ronchetti, Elvezio, 2006. "A robust approach for skewed and heavy-tailed outcomes in the analysis of health care expenditures," Journal of Health Economics, Elsevier, vol. 25(2), pages 198-213, March.
    10. Belli, Paolo, 2001. "How adverse selection affects the health insurance market," Policy Research Working Paper Series 2574, The World Bank.
    11. Beck, Konstantin & Trottmann, Maria & Zweifel, Peter, 2010. "Risk adjustment in health insurance and its long-term effectiveness," Journal of Health Economics, Elsevier, vol. 29(4), pages 489-498, July.
    12. Kathryn Antioch & Randall Ellis & Steve Gillett & Daniel Borovnicar & Ric Marshall, 2007. "Risk adjustment policy options for casemix funding: international lessons in financing reform," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 8(3), pages 195-212, September.
    13. Lehmann, Hansjorg & Zweifel, Peter, 2004. "Innovation and risk selection in deregulated social health insurance," Journal of Health Economics, Elsevier, vol. 23(5), pages 997-1012, September.
    14. Simón Ramírez Amaya & Adolfo J. Quiroz & Álvaro José Riascos Villegas, 2019. "Regression by clustering using metropolis-hastings," Documentos de Trabajo 18180, Quantil.
    15. Keane, Michael, 2004. "Modeling Health Insurance Choice Using the Heterogeneous Logit Model," MPRA Paper 55203, University Library of Munich, Germany.
    16. Karen Eggleston & Randall P. Ellis & Mingshan Lu, 2007. "Prevention and Dynamic Risk Adjustment," Boston University - Department of Economics - Working Papers Series WP2007-023, Boston University - Department of Economics.
    17. repec:clg:wpaper:2007-06 is not listed on IDEAS
    18. Karen Eggleston & Brian K. Chen & Chih-Hung Chen & Ying Isabel Chen & Talitha Feenstra & Toshiaki Iizuka & Janet Tin Kei Lam & Gabriel M. Leung & Jui-fen Rachel Lu & Beatriz Rodriguez-Sanchez & Jeroen, 2020. "Are quality-adjusted medical prices declining for chronic disease? Evidence from diabetes care in four health systems," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 21(5), pages 689-702, July.
    19. Ayman Fouda & Gianluca Fiorentini & Francesco Paolucci, 2017. "Competitive Health Markets and Risk Equalisation in Australia: Lessons Learnt from Other Countries," Applied Health Economics and Health Policy, Springer, vol. 15(6), pages 745-754, December.
    20. Alice Sanwald & Engelbert Theurl, 2017. "Out-of-pocket expenditures for pharmaceuticals: lessons from the Austrian household budget survey," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 18(4), pages 435-447, May.
    21. Grossman, Michael, 2004. "The demand for health, 30 years later: a very personal retrospective and prospective reflection," Journal of Health Economics, Elsevier, vol. 23(4), pages 629-636, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:56:y:2008:i:6:p:1382-1392. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.