IDEAS home Printed from https://ideas.repec.org/p/jgu/wpaper/2026.html
   My bibliography  Save this paper

A Random Forest a Day Keeps the Doctor Away

Author

Listed:
  • Markus Eyting

    (Johannes Gutenberg University)

Abstract

Using a unique dataset from a German health check-up provider including detailed individual questionnaire data as well as medical test data, I apply a random forest to predict several health risk factors. I evaluate the prediction performance using various metrics and find decent prediction qualities across all outcomes. By identifying the most relevant predictor variables, I compile concise and validated questionnaire tools to identify individuals’ blood pressure, blood glucose, and cholesterol levels, their risk of a coronary heart disease, whether or not they suffer from plaque or a metabolic syndrome as well as their relative fitness levels. In a second step, I compare the prediction results to physician predictions of the same patient observations. I find that the random forest outperforms the physicians if predictions are based on the same information set. When additionally providing the physicians with the random forest predictions for a particular patient observation, the physicians align with the random forest predictions. Finally, while the random forest considers various psychological scales, the physicians focus on family health history information instead.

Suggested Citation

  • Markus Eyting, 2020. "A Random Forest a Day Keeps the Doctor Away," Working Papers 2026, Gutenberg School of Management and Economics, Johannes Gutenberg-Universität Mainz.
  • Handle: RePEc:jgu:wpaper:2026
    as

    Download full text from publisher

    File URL: https://download.uni-mainz.de/RePEc/pdf/Discussion_Paper_2026.pdf
    File Function: First version, 2020
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Joshua Schwartzstein, 2014. "Selective Attention And Learning," Journal of the European Economic Association, European Economic Association, vol. 12(6), pages 1423-1452, December.
    2. Björn Bartling & Ernst Fehr & Daniel Schunk, 2012. "Health effects on children’s willingness to compete," Experimental Economics, Springer;Economic Science Association, vol. 15(1), pages 58-70, March.
    3. Stigler, George J., 2011. "Economics of Information," Ekonomicheskaya Politika / Economic Policy, Russian Presidential Academy of National Economy and Public Administration, vol. 5, pages 35-49.
    4. Xavier Gabaix, 2014. "A Sparsity-Based Model of Bounded Rationality," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 129(4), pages 1661-1710.
    5. Victor Chernozhukov & Mert Demirer & Esther Duflo & Iván Fernández-Val, 2018. "Generic Machine Learning Inference on Heterogeneous Treatment Effects in Randomized Experiments, with an Application to Immunization in India," NBER Working Papers 24678, National Bureau of Economic Research, Inc.
    6. Pedro Bordalo & Nicola Gennaioli & Andrei Shleifer, 2013. "Salience and Consumer Choice," Journal of Political Economy, University of Chicago Press, vol. 121(5), pages 803-843.
    7. Sendhil Mullainathan & Ziad Obermeyer, 2017. "Does Machine Learning Automate Moral Hazard and Error?," American Economic Review, American Economic Association, vol. 107(5), pages 476-480, May.
    8. Benjamin Enke & Florian Zimmermann, 2019. "Correlation Neglect in Belief Formation," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 86(1), pages 313-332.
    9. Stefan Wager & Susan Athey, 2018. "Estimation and Inference of Heterogeneous Treatment Effects using Random Forests," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1228-1242, July.
    10. George Laking & Joanne Lord & Alastair Fischer, 2006. "The economics of diagnosis," Health Economics, John Wiley & Sons, Ltd., vol. 15(10), pages 1109-1120, October.
    11. Hendrik Jürges, 2007. "True health vs response styles: exploring cross‐country differences in self‐reported health," Health Economics, John Wiley & Sons, Ltd., vol. 16(2), pages 163-178, February.
    12. Botond Koszegi & Adam Szeidl, 2013. "A Model of Focusing in Economic Choice," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 128(1), pages 53-104.
    13. Benjamin Handel & Joshua Schwartzstein, 2018. "Frictions or Mental Gaps: What's Behind the Information We (Don't) Use and When Do We Care?," Journal of Economic Perspectives, American Economic Association, vol. 32(1), pages 155-178, Winter.
    14. Susan Athey, 2018. "The Impact of Machine Learning on Economics," NBER Chapters, in: The Economics of Artificial Intelligence: An Agenda, pages 507-547, National Bureau of Economic Research, Inc.
    15. Chen, Daniel L. & Schonger, Martin & Wickens, Chris, 2016. "oTree—An open-source platform for laboratory, online, and field experiments," Journal of Behavioral and Experimental Finance, Elsevier, vol. 9(C), pages 88-97.
    16. Daron Acemoglu & Pascual Restrepo, 2018. "The Race between Man and Machine: Implications of Technology for Growth, Factor Shares, and Employment," American Economic Review, American Economic Association, vol. 108(6), pages 1488-1542, June.
    17. Andrew Caplin & Mark Dean, 2015. "Revealed Preference, Rational Inattention, and Costly Information Acquisition," American Economic Review, American Economic Association, vol. 105(7), pages 2183-2203, July.
    18. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    19. Sims, Christopher A., 2003. "Implications of rational inattention," Journal of Monetary Economics, Elsevier, vol. 50(3), pages 665-690, April.
    20. Su-In Lee & Safiye Celik & Benjamin A. Logsdon & Scott M. Lundberg & Timothy J. Martins & Vivian G. Oehler & Elihu H. Estey & Chris P. Miller & Sylvia Chien & Jin Dai & Akanksha Saxena & C. Anthony Bl, 2018. "A machine learning approach to integrate big data for precision medicine in acute myeloid leukemia," Nature Communications, Nature, vol. 9(1), pages 1-13, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. George Loewenstein & Zachary Wojtowicz, 2023. "The Economics of Attention," CESifo Working Paper Series 10712, CESifo.
    2. Xavier Gabaix, 2017. "Behavioral Inattention," NBER Working Papers 24096, National Bureau of Economic Research, Inc.
    3. Bartosz Maćkowiak & Filip Matějka & Mirko Wiederholt, 2023. "Rational Inattention: A Review," Journal of Economic Literature, American Economic Association, vol. 61(1), pages 226-273, March.
    4. Emmanuel Farhi & Xavier Gabaix, 2020. "Optimal Taxation with Behavioral Agents," American Economic Review, American Economic Association, vol. 110(1), pages 298-336, January.
    5. Jan Engelmann & Alejandro Hirmas & Joël van der Weele, 2021. "Top Down or Bottom Up? Disentangling the Channels of Attention in Risky Choice," Tinbergen Institute Discussion Papers 21-031/I, Tinbergen Institute.
    6. Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," Working Papers halshs-03735680, HAL.
    7. Brocas, Isabelle & Carrillo, Juan D., 2021. "Value computation and modulation: A neuroeconomic theory of self-control as constrained optimization," Journal of Economic Theory, Elsevier, vol. 198(C).
    8. Rema Hanna & Sendhi Mullainathan & Josh Schwartstein, 2012. "Learning Through Noticing: Theory and Experimental Evidence in Farming," CID Working Papers 245, Center for International Development at Harvard University.
    9. Roc Armenter & Michèle Müller-Itten & Zachary Stangebye, 2020. "Rational Inattention via Ignorance Equivalence," Working Papers 20-24, Federal Reserve Bank of Philadelphia.
    10. Canidio, Andrea, 2015. "Focusing effect and the poverty trap," European Economic Review, Elsevier, vol. 76(C), pages 222-238.
    11. Dertwinkel-Kalt, Markus & Wenzel, Tobias, 2019. "Focusing and framing of risky alternatives," Journal of Economic Behavior & Organization, Elsevier, vol. 159(C), pages 289-304.
    12. Momsen, Katharina & Ohndorf, Markus, 2023. "Information avoidance: Self-image concerns, inattention, and ideology," Journal of Economic Behavior & Organization, Elsevier, vol. 211(C), pages 386-400.
    13. Benjamin Handel & Joshua Schwartzstein, 2018. "Frictions or Mental Gaps: What's Behind the Information We (Don't) Use and When Do We Care?," Journal of Economic Perspectives, American Economic Association, vol. 32(1), pages 155-178, Winter.
    14. Marcelo Arbex & Enlinson Mattos, 2020. "Tax Preferences and Optimal Income Taxation," Working Papers 2006, University of Windsor, Department of Economics.
    15. Andrew Ellis & Yusufcan Masatlioglu, 2022. "Choice with Endogenous Categorization [The Adaptive Nature of Human Categorization]," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 89(1), pages 240-278.
    16. Frank Huettner & Tamer Boyacı & Yalçın Akçay, 2019. "Consumer Choice Under Limited Attention When Alternatives Have Different Information Costs," Operations Research, INFORMS, vol. 67(3), pages 671-699, May.
    17. Pedro Bordalo & John Conlon & Nicola Gennaioli & Spencer Kwon & Andrei Shleifer, 2023. "How People Use Statistics," Working Papers 699, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
    18. Miao Liu, 2022. "Assessing Human Information Processing in Lending Decisions: A Machine Learning Approach," Journal of Accounting Research, Wiley Blackwell, vol. 60(2), pages 607-651, May.
    19. Matějka, Filip & Mackowiak, Bartosz & Wiederholt, Mirko, 2018. "Survey: Rational Inattention, a Disciplined Behavioral Model," CEPR Discussion Papers 13243, C.E.P.R. Discussion Papers.
    20. Ashesh Rambachan, 2022. "Identifying Prediction Mistakes in Observational Data," NBER Chapters, in: Economics of Artificial Intelligence, National Bureau of Economic Research, Inc.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jgu:wpaper:2026. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Research Unit IPP (email available below). General contact details of provider: https://edirc.repec.org/data/vlmaide.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.