IDEAS home Printed from https://ideas.repec.org/a/eee/tefoso/v199y2024ics0040162523007655.html
   My bibliography  Save this article

Machine learning and the optimization of prediction-based policies

Author

Listed:
  • Battiston, Pietro
  • Gamba, Simona
  • Santoro, Alessandro

Abstract

We present a procedure for the optimal implementation of public policies that involve predicting an individual behavior or characteristic. By linking prediction errors of any given classification model to the resulting social welfare, we provide a simple measure to rank different models and select the optimal one. Such measure is defined as the difference between the social welfare of a given policy and that of an error-free policy, and it is related to the ROC curve employed in the Machine Learning literature. We extend the cost isometrics approach described in the literature by considering the case of heterogeneous costs of type I and II errors. We apply our approach to the prediction of inaccurate tax returns issued by Italian self-employed and sole proprietorships. We show that the approach can result in substantial increases in revenues, and that random forest models, beyond providing comparatively good predictions, yield important insights. In our case, they both provide empirical support for existing theories on tax evasion — highlighting, for instance, cross-sectoral heterogeneity — and extend our understanding of the phenomenon — such as the role of bunching.

Suggested Citation

  • Battiston, Pietro & Gamba, Simona & Santoro, Alessandro, 2024. "Machine learning and the optimization of prediction-based policies," Technological Forecasting and Social Change, Elsevier, vol. 199(C).
  • Handle: RePEc:eee:tefoso:v:199:y:2024:i:c:s0040162523007655
    DOI: 10.1016/j.techfore.2023.123080
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040162523007655
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.techfore.2023.123080?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jon Kleinberg & Jens Ludwig & Sendhil Mullainathan & Ziad Obermeyer, 2015. "Prediction Policy Problems," American Economic Review, American Economic Association, vol. 105(5), pages 491-495, May.
    2. Albert Meijer & Martijn Wessels, 2019. "Predictive Policing: Review of Benefits and Drawbacks," International Journal of Public Administration, Taylor & Francis Journals, vol. 42(12), pages 1031-1039, September.
    3. Toru Kitagawa & Aleksey Tetenov, 2018. "Who Should Be Treated? Empirical Welfare Maximization Methods for Treatment Choice," Econometrica, Econometric Society, vol. 86(2), pages 591-616, March.
    4. Miguel Almunia & David Lopez-Rodriguez, 2018. "Under the Radar: The Effects of Monitoring Firms on Tax Compliance," American Economic Journal: Economic Policy, American Economic Association, vol. 10(1), pages 1-38, February.
    5. Saavedra, Santiago & Romero, Mauricio, 2021. "Local incentives and national tax evasion: The response of illegal mining to a tax reform in Colombia," European Economic Review, Elsevier, vol. 138(C).
    6. Henrik Jacobsen Kleven & Wojciech Kopczuk, 2011. "Transfer Program Complexity and the Take-Up of Social Benefits," American Economic Journal: Economic Policy, American Economic Association, vol. 3(1), pages 54-90, February.
    7. Joel Slemrod, 1989. "The Return To Tax Simplification: an Econometric Analysis," Public Finance Review, , vol. 17(1), pages 3-27, January.
    8. John Muschelli, 2020. "ROC and AUC with a Binary Predictor: a Potentially Misleading Metric," Journal of Classification, Springer;The Classification Society, vol. 37(3), pages 696-708, October.
    9. Keen, Michael & Slemrod, Joel, 2017. "Optimal tax administration," Journal of Public Economics, Elsevier, vol. 152(C), pages 133-142.
    10. Susan Athey & Stefan Wager, 2021. "Policy Learning With Observational Data," Econometrica, Econometric Society, vol. 89(1), pages 133-161, January.
    11. A. B. Atkinson & N. H. Stern, 1974. "Pigou, Taxation and Public Goods," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 41(1), pages 119-128.
    12. Michele Rabasco & Pietro Battiston, 2023. "Predicting the deterrence effect of tax audits. A machine learning approach," Metroeconomica, Wiley Blackwell, vol. 74(3), pages 531-556, July.
    13. Jonah E. Rockoff & Brian A. Jacob & Thomas J. Kane & Douglas O. Staiger, 2011. "Can You Recognize an Effective Teacher When You Recruit One?," Education Finance and Policy, MIT Press, vol. 6(1), pages 43-74, January.
    14. Sebastian Beer & Matthias Kasper & Erich Kirchler & Brian Erard, 0. "Do Audits Deter or Provoke Future Tax Noncompliance? Evidence on Self-Employed Taxpayers," CESifo Economic Studies, CESifo Group, vol. 66(3), pages 248-264.
    15. Jon Kleinberg & Himabindu Lakkaraju & Jure Leskovec & Jens Ludwig & Sendhil Mullainathan, 2018. "Human Decisions and Machine Predictions," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 133(1), pages 237-293.
    16. Alberto Abadie & Maximilian Kasy, 2019. "Choosing Among Regularized Estimators in Empirical Economics: The Risk of Machine Learning," The Review of Economics and Statistics, MIT Press, vol. 101(5), pages 743-762, December.
    17. Amy Finkelstein & Matthew J Notowidigdo, 2019. "Take-Up and Targeting: Experimental Evidence from SNAP," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 134(3), pages 1505-1556.
    18. Maximilian Kasy & Anja Sautmann, 2021. "Adaptive Treatment Assignment in Experiments for Policy Choice," Econometrica, Econometric Society, vol. 89(1), pages 113-132, January.
    19. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    20. Karen Yeung, 2018. "Algorithmic regulation: A critical interrogation," Regulation & Governance, John Wiley & Sons, vol. 12(4), pages 505-523, December.
    21. Mayshar, Joram, 1991. " Taxation with Costly Administration," Scandinavian Journal of Economics, Wiley Blackwell, vol. 93(1), pages 75-88.
    22. Andini, Monica & Ciani, Emanuele & de Blasio, Guido & D'Ignazio, Alessio & Salvestrini, Viola, 2018. "Targeting with machine learning: An application to a tax rebate program in Italy," Journal of Economic Behavior & Organization, Elsevier, vol. 156(C), pages 86-102.
    23. Miriam Steurer & Robert J. Hill & Norbert Pfeifer, 2021. "Metrics for evaluating the performance of machine learning based automated valuation models," Journal of Property Research, Taylor & Francis Journals, vol. 38(2), pages 99-129, April.
    24. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    25. Alessandro Santoro & Carlo V. Fiorio, 2011. "Taxpayer Behavior When Audit Rules Are Known: Evidence from Italy," Public Finance Review, , vol. 39(1), pages 103-123, January.
    26. Dana Chandler & Steven D. Levitt & John A. List, 2011. "Predicting and Preventing Shootings among At-Risk Youth," American Economic Review, American Economic Association, vol. 101(3), pages 288-292, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pietro Battiston & Simona Gamba & Alessandro Santoro, 2020. "Optimizing Tax Administration Policies with Machine Learning," Working Papers 436, University of Milano-Bicocca, Department of Economics, revised Mar 2020.
    2. de Blasio, Guido & D'Ignazio, Alessio & Letta, Marco, 2022. "Gotham city. Predicting ‘corrupted’ municipalities with machine learning," Technological Forecasting and Social Change, Elsevier, vol. 184(C).
    3. Guido de Blasio & Alessio D'Ignazio & Marco Letta, 2020. "Predicting Corruption Crimes with Machine Learning. A Study for the Italian Municipalities," Working Papers 16/20, Sapienza University of Rome, DISS.
    4. Andini, Monica & Ciani, Emanuele & de Blasio, Guido & D'Ignazio, Alessio & Salvestrini, Viola, 2018. "Targeting with machine learning: An application to a tax rebate program in Italy," Journal of Economic Behavior & Organization, Elsevier, vol. 156(C), pages 86-102.
    5. Monica Andini & Emanuele Ciani & Guido de Blasio & Alessio D'Ignazio & Viola Salvestrini, 2017. "Targeting policy-compliers with machine learning: an application to a tax rebate programme in Italy," Temi di discussione (Economic working papers) 1158, Bank of Italy, Economic Research and International Relations Area.
    6. Elliott Ash & Sergio Galletta & Tommaso Giommoni, 2021. "A Machine Learning Approach to Analyze and Support Anti-Corruption Policy," CESifo Working Paper Series 9015, CESifo.
    7. Tobias Cagala & Ulrich Glogowsky & Johannes Rincke & Anthony Strittmatter, 2021. "Optimal Targeting in Fundraising: A Causal Machine-Learning Approach," Papers 2103.10251, arXiv.org, revised Sep 2021.
    8. Garbero, Alessandra & Sakos, Grayson & Cerulli, Giovanni, 2023. "Towards data-driven project design: Providing optimal treatment rules for development projects," Socio-Economic Planning Sciences, Elsevier, vol. 89(C).
    9. Tobias Cagala & Ulrich Glogowsky & Johannes Rincke & Anthony Strittmatter, 2021. "Optimal Targeting in Fundraising: A Machine-Learning Approach," CESifo Working Paper Series 9037, CESifo.
    10. Andini, Monica & Boldrini, Michela & Ciani, Emanuele & de Blasio, Guido & D'Ignazio, Alessio & Paladini, Andrea, 2022. "Machine learning in the service of policy targeting: The case of public credit guarantees," Journal of Economic Behavior & Organization, Elsevier, vol. 198(C), pages 434-475.
    11. Alessandra Garbero & Marco Letta, 2022. "Predicting household resilience with machine learning: preliminary cross-country tests," Empirical Economics, Springer, vol. 63(4), pages 2057-2070, October.
    12. Cerqua, Augusto & Letta, Marco, 2022. "Local inequalities of the COVID-19 crisis," Regional Science and Urban Economics, Elsevier, vol. 92(C).
    13. Sophie-Charlotte Klose & Johannes Lederer, 2020. "A Pipeline for Variable Selection and False Discovery Rate Control With an Application in Labor Economics," Papers 2006.12296, arXiv.org, revised Jun 2020.
    14. McKenzie, David & Sansone, Dario, 2017. "Man vs. Machine in Predicting Successful Entrepreneurs: Evidence from a Business Plan Competition in Nigeria," CEPR Discussion Papers 12523, C.E.P.R. Discussion Papers.
    15. Juan Carlos Perdomo, 2023. "The Relative Value of Prediction in Algorithmic Decision Making," Papers 2312.08511, arXiv.org, revised May 2024.
    16. McKenzie, David & Sansone, Dario, 2019. "Predicting entrepreneurial success is hard: Evidence from a business plan competition in Nigeria," Journal of Development Economics, Elsevier, vol. 141(C).
    17. Jorge Mejia & Shawn Mankad & Anandasivam Gopal, 2019. "A for Effort? Using the Crowd to Identify Moral Hazard in New York City Restaurant Hygiene Inspections," Information Systems Research, INFORMS, vol. 30(4), pages 1363-1386, December.
    18. Susan Athey, 2018. "The Impact of Machine Learning on Economics," NBER Chapters, in: The Economics of Artificial Intelligence: An Agenda, pages 507-547, National Bureau of Economic Research, Inc.
    19. Erik Heilmann & Janosch Henze & Heike Wetzel, 2021. "Machine learning in energy forecasts with an application to high frequency electricity consumption data," MAGKS Papers on Economics 202135, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
    20. Delogu, Marco & Lagravinese, Raffaele & Paolini, Dimitri & Resce, Giuliano, 2024. "Predicting dropout from higher education: Evidence from Italy," Economic Modelling, Elsevier, vol. 130(C).

    More about this item

    Keywords

    Prediction; Public policy; ROC curve; Machine learning; Tax behavior;
    All these keywords.

    JEL classification:

    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • D78 - Microeconomics - - Analysis of Collective Decision-Making - - - Positive Analysis of Policy Formulation and Implementation
    • H50 - Public Economics - - National Government Expenditures and Related Policies - - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:tefoso:v:199:y:2024:i:c:s0040162523007655. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.sciencedirect.com/science/journal/00401625 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.