IDEAS home Printed from https://ideas.repec.org/a/eee/indorg/v101y2025ics0167718725000414.html

Unlevel playing field? Machine learning meets state aid regulation

Author

Listed:
  • Barone, Guglielmo
  • Letta, Marco

Abstract

The regulation of State Aid is crucial for a well-functioning European Union Single Market. However, both non-compliance of Member States and subsidies from abroad can jeopardize the level playing field. This paper uses machine learning techniques applied to financial statements data to detect potentially distortive public subsidies to companies in the European Union Single Market. We achieve high out-of-sample predictive accuracy and use the machine predictions to flag suspect cases of hidden recipients and explore the characteristics of these firms.

Suggested Citation

  • Barone, Guglielmo & Letta, Marco, 2025. "Unlevel playing field? Machine learning meets state aid regulation," International Journal of Industrial Organization, Elsevier, vol. 101(C).
  • Handle: RePEc:eee:indorg:v:101:y:2025:i:c:s0167718725000414
    DOI: 10.1016/j.ijindorg.2025.103175
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167718725000414
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijindorg.2025.103175?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Silveira, Douglas & de Moraes, Lucas B. & Fiuza, Eduardo P.S. & Cajueiro, Daniel O., 2023. "Who are you? Cartel detection using unlabeled data," International Journal of Industrial Organization, Elsevier, vol. 88(C).
    2. Sigrist, Fabio & Hirnschall, Christoph, 2019. "Grabit: Gradient tree-boosted Tobit models for default prediction," Journal of Banking & Finance, Elsevier, vol. 102(C), pages 177-192.
    3. Max Kuhn & Kjell Johnson, 2013. "Applied Predictive Modeling," Springer Books, Springer, number 978-1-4614-6849-3, March.
    4. Timothy I Cannings & Yingying Fan & Richard J Samworth, 2020. "Classification with imperfect training labels," Biometrika, Biometrika Trust, vol. 107(2), pages 311-330.
    5. Myrto Kalouptsidi, 2018. "Detection and Impact of Industrial Subsidies: The Case of Chinese Shipbuilding," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 85(2), pages 1111-1158.
    6. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    7. Tomasz Kamil Michalski & Guillaume Stoltz, 2010. "Do countries falsify economic date strategically? Some evidence that they do," Working Papers hal-00540794, HAL.
    8. Tomasz Michalski & Gilles Stoltz, 2013. "Do Countries Falsify Economic Data Strategically? Some Evidence That They Might," The Review of Economics and Statistics, MIT Press, vol. 95(2), pages 591-616, May.
    9. Weili Wu & Zhao Zhang & Wonjun Lee & Ding-Zhu Du, 2020. "Optimal Coverage in Wireless Sensor Networks," Springer Optimization and Its Applications, Springer, number 978-3-030-52824-9, January.
    10. Li, Cong-Cong & Xu, Hai-Chuan & Zhou, Wei-Xing, 2020. "News coverage and portfolio returns: Evidence from China," Pacific-Basin Finance Journal, Elsevier, vol. 60(C).
    11. Weili Wu & Zhao Zhang & Wonjun Lee & Ding-Zhu Du, 2020. "Lifetime of Coverage," Springer Optimization and Its Applications, in: Optimal Coverage in Wireless Sensor Networks, chapter 0, pages 67-86, Springer.
    12. Matej Bajgar & Giuseppe Berlingieri & Sara Calligaris & Chiara Criscuolo & Jonathan Timmis, 2020. "Coverage and representativeness of Orbis data," OECD Science, Technology and Industry Working Papers 2020/06, OECD Publishing.
    13. Falck, Oliver & Gollier, Christian & Woessmann, Ludger (ed.), 2011. "Industrial Policy for National Champions," MIT Press Books, The MIT Press, edition 1, volume 1, number 026201601x, December.
    14. Robert H. Porter & J. Douglas Zona, 1999. "Ohio School Milk Markets: An Analysis of Bidding," RAND Journal of Economics, The RAND Corporation, vol. 30(2), pages 263-288, Summer.
    15. James, Robert & Leung, Henry & Prokhorov, Artem, 2023. "A machine learning attack on illegal trading," Journal of Banking & Finance, Elsevier, vol. 148(C).
    16. Tai, Chung-Ching & Lin, Hung-Wen & Chie, Bin-Tzong & Tung, Chen-Yuan, 2019. "Predicting the failures of prediction markets: A procedure of decision making using classification models," International Journal of Forecasting, Elsevier, vol. 35(1), pages 297-312.
    17. Gallego, Jorge & Rivero, Gonzalo & Martínez, Juan, 2021. "Preventing rather than punishing: An early warning model of malfeasance in public procurement," International Journal of Forecasting, Elsevier, vol. 37(1), pages 360-377.
    18. Weili Wu & Zhao Zhang & Wonjun Lee & Ding-Zhu Du, 2020. "Probabilistic Coverage," Springer Optimization and Its Applications, in: Optimal Coverage in Wireless Sensor Networks, chapter 0, pages 203-207, Springer.
    19. Şebnem Kalemli- Özcan & Bent E. Sørensen & Carolina Villegas-Sanchez & Vadym Volosovych & Sevcan Yeşiltaş, 2024. "How to Construct Nationally Representative Firm-Level Data from the Orbis Global Database: New Facts on SMEs and Aggregate Implications for Industry Concentration," American Economic Journal: Macroeconomics, American Economic Association, vol. 16(2), pages 353-374, April.
    20. Huber, Martin & Imhof, David, 2019. "Machine learning with screens for detecting bid-rigging cartels," International Journal of Industrial Organization, Elsevier, vol. 65(C), pages 277-301.
    21. Weili Wu & Zhao Zhang & Wonjun Lee & Ding-Zhu Du, 2020. "Barrier Coverage," Springer Optimization and Its Applications, in: Optimal Coverage in Wireless Sensor Networks, chapter 0, pages 159-181, Springer.
    22. Weili Wu & Zhao Zhang & Wonjun Lee & Ding-Zhu Du, 2020. "Partial Coverage," Springer Optimization and Its Applications, in: Optimal Coverage in Wireless Sensor Networks, chapter 0, pages 193-202, Springer.
    23. Vitezslav Titl & Deni Mazrekaj & Fritz Schiltz, 2024. "Identifying Politically Connected Firms: A Machine Learning Approach," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 86(1), pages 137-155, February.
    24. Mark J. Nigrini, 2005. "An Assessment of the Change in the Incidence of Earnings Management Around the Enron‐Andersen Episode," Review of Accounting and Finance, Emerald Group Publishing Limited, vol. 4(1), pages 92-110, January.
    25. Huber, Martin & Imhof, David, 2023. "Flagging cartel participants with deep learning based on convolutional neural networks," International Journal of Industrial Organization, Elsevier, vol. 89(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bedri Kamil Onur Tas, 2024. "A machine learning approach to detect collusion in public procurement with limited information," Journal of Computational Social Science, Springer, vol. 7(2), pages 1913-1935, October.
    2. Aaltio, Aapo & Buri, Riku & Jokelainen, Antto & Lundberg, Johan, 2025. "Complementary bidding and cartel detection: Evidence from Nordic asphalt markets," International Journal of Industrial Organization, Elsevier, vol. 98(C).
    3. Brown, David P. & Eckert, Andrew & Silveira, Douglas, 2023. "Screening for collusion in wholesale electricity markets: A literature review," Utilities Policy, Elsevier, vol. 85(C).
    4. David Imhof & Emanuel W Viklund & Martin Huber, 2025. "Catching Bid-rigging Cartels with Graph Attention Neural Networks," Papers 2507.12369, arXiv.org, revised Jul 2025.
    5. Hoang, Daniel & Wiegratz, Kevin, 2022. "Machine learning methods in finance: Recent applications and prospects," Working Paper Series in Economics 158, Karlsruhe Institute of Technology (KIT), Department of Economics and Management.
    6. Aineas Kostas Mallios, 2023. "Manipulation in reported dividends: Empirical evidence from US banks," Economics Bulletin, AccessEcon, vol. 43(1), pages 441-461.
    7. Ronelle Burger & Canh Thien Dang & Trudy Owens, 2017. "Better performing NGOs do report more accurately: Evidence from investigating Ugandan NGO financial accounts," Discussion Papers 2017-10, University of Nottingham, CREDIT.
    8. Hannes Wallimann & Silvio Sticher, 2023. "On suspicious tracks: machine-learning based approaches to detect cartels in railway-infrastructure procurement," Papers 2304.11888, arXiv.org.
    9. Saad Ahmad & Jeffrey Bergstrand & Jordi Paniagua & Heather Wickramarachi, 2025. "The Multinational Revenue, Employment, and Investment Database (MREID)," Review of International Economics, Wiley Blackwell, vol. 33(4), pages 817-836, September.
    10. Banu Demir Pakel & Beata Smarzynska Javorcik & Beata Smarzynska Javorcik, 2018. "Forensics, Elasticities and Benford's Law," CESifo Working Paper Series 7266, CESifo.
    11. Dang, Canh Thien & Owens, Trudy, 2020. "Does transparency come at the cost of charitable services? Evidence from investigating British charities," Journal of Economic Behavior & Organization, Elsevier, vol. 172(C), pages 314-343.
    12. repec:osf:socarx:48g5c_v1 is not listed on IDEAS
    13. Dan Amiram & Zahn Bozanic & Ethan Rouen, 2015. "Financial statement errors: evidence from the distributional properties of financial statement numbers," Review of Accounting Studies, Springer, vol. 20(4), pages 1540-1593, December.
    14. Biswas, Amit K. & von Hagen, Jürgen & Sarkar, Sandip, 2022. "FDI Mismatch, trade Mis-reporting, and hidden capital Movements: The USA - China case," Journal of International Money and Finance, Elsevier, vol. 120(C).
    15. Brutscher, Philipp-Bastian & Coali, Andrea & Delanote, Julie & Harasztosi, Peter, 2020. "EIB Group Survey on Investment and Investment Finance: A technical note on data quality," EIB Working Papers 2020/08, European Investment Bank (EIB).
    16. Caravaggio, Nicola & Resce, Giuliano & Vaquero-Piñeiro, Cristina, 2025. "Predicting policy funding allocation with Machine Learning," Socio-Economic Planning Sciences, Elsevier, vol. 98(C).
    17. Bovin, Andreas & Bos, Iwan, 2023. "Market Shares as Collusive Marker: Evidence from the European Truck Industry," Research Memorandum 011, Maastricht University, Graduate School of Business and Economics (GSBE).
    18. Carl Benedikt Frey & Giorgio Presidente, 2024. "Privacy regulation and firm performance: Estimating the GDPR effect globally," Economic Inquiry, Western Economic Association International, vol. 62(3), pages 1074-1089, July.
    19. Thomas Stoerk, 2015. "Statistical corruption in Beijing’s air quality data has likely ended in 2012," GRI Working Papers 194, Grantham Research Institute on Climate Change and the Environment.
    20. Mukherjee, Abhiroop & Panayotov, George & Shon, Janghoon, 2021. "Eye in the sky: Private satellites and government macro data," Journal of Financial Economics, Elsevier, vol. 141(1), pages 234-254.
    21. Crispino, Marta & Conteduca, Francesco Paolo, 2024. "It's a match! Linking foreign counterparts in Italian customs data to their balance sheets," International Economics, Elsevier, vol. 179(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    JEL classification:

    • L49 - Industrial Organization - - Antitrust Issues and Policies - - - Other
    • H25 - Public Economics - - Taxation, Subsidies, and Revenue - - - Business Taxes and Subsidies
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:indorg:v:101:y:2025:i:c:s0167718725000414. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/505551 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.