IDEAS home Printed from https://ideas.repec.org/p/osf/socarx/ruxkh_v1.html
   My bibliography  Save this paper

Governing Synthetic Data in the Financial Sector

Author

Listed:
  • Spears, Taylor C.

    (University of Edinburgh)

  • Hansen, Kristian Bondo
  • Xu, Ruowen
  • Millo, Yuval

Abstract

Synthetic datasets, artificially generated to mimic real-world data while maintaining anonymization, have emerged as a promising technology in the financial sector, attracting support from regulators and market participants as a solution to data privacy and scarcity challenges limiting machine learning deployment. This paper argues that synthetic data's effects on financial markets depend critically on how these technologies are embedded within existing machine learning infrastructural ``stacks'' rather than on their intrinsic properties. We identify three key tensions that will determine whether adoption proves beneficial or harmful: (1) data circulability versus opacity, particularly the "double opacity" problem arising from stacked machine learning systems, (2) model-induced scattering versus model-induced herding in market participant behaviour, and (3) flattening versus deepening of data platform power. These tensions directly correspond to core regulatory priorities around model risk management, systemic risk, and competition policy. Using financial audit as a case study, we demonstrate how these tensions interact in practice and propose governance frameworks, including a synthetic data labelling regime to preserve contextual information when datasets cross organizational boundaries.

Suggested Citation

  • Spears, Taylor C. & Hansen, Kristian Bondo & Xu, Ruowen & Millo, Yuval, 2025. "Governing Synthetic Data in the Financial Sector," SocArXiv ruxkh_v1, Center for Open Science.
  • Handle: RePEc:osf:socarx:ruxkh_v1
    DOI: 10.31219/osf.io/ruxkh_v1
    as

    Download full text from publisher

    File URL: https://osf.io/download/68baa89d28a718439fa6c109/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/ruxkh_v1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Farmer, J. Doyne & Axtell, Robert L., 2022. "Agent-Based Modeling in Economics and Finance: Past, Present, and Future," INET Oxford Working Papers 2022-10, Institute for New Economic Thinking at the Oxford Martin School, University of Oxford.
    2. Johannes Petry, 2021. "From National Marketplaces to Global Providers of Financial Infrastructures: Exchanges, Infrastructures and Structural Power in Global Finance," New Political Economy, Taylor & Francis Journals, vol. 26(4), pages 574-597, July.
    3. Pinzur, David, 2016. "Making the grade: infrastructural semiotics and derivative market outcomes on the Chicago Board of Trade and New Orleans Cotton Exchange, 1856–1909," LSE Research Online Documents on Economics 102988, London School of Economics and Political Science, LSE Library.
    4. Sisson, Charles A., 1979. "The Synthetic Micro Data File: A New Tool for Economists," Journal of Agricultural Economics Research, United States Department of Agriculture, Economic Research Service, vol. 31(3), pages 1-10, July.
    5. Hans Buhler & Blanka Horvath & Terry Lyons & Imanol Perez Arribas & Ben Wood, 2020. "A Data-driven Market Simulator for Small Data Environments," Papers 2006.14498, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cheng Wang & Chuwen Wang & Shirong Zeng & Jianguo Liu & Changjun Jiang, 2025. "Advanced simulation paradigm of human behaviour unveils complex financial systemic projection," Papers 2503.20787, arXiv.org, revised May 2025.
    2. Sebastian Jaimungal, 2022. "Reinforcement learning and stochastic optimisation," Finance and Stochastics, Springer, vol. 26(1), pages 103-129, January.
    3. Michele Vodret & Iacopo Mastromatteo & Bence Tóth & Michael Benzaquen, 2023. "Microfounding GARCH models and beyond: a Kyle-inspired model with adaptive agents," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 18(3), pages 599-625, July.
    4. Denis Koshelev & Alexey Ponomarenko & Sergei Seleznev, 2023. "Amortized Neural Networks for Agent-Based Model Forecasting," Bank of Russia Working Paper Series wps115, Bank of Russia.
    5. Aleksei Pastushkov, 2024. "Market efficiency, informational asymmetry and pseudo-collusion of adaptively learning agents," Papers 2411.05032, arXiv.org.
    6. Luca Grilli & Domenico Santoro, 2022. "Forecasting financial time series with Boltzmann entropy through neural networks," Computational Management Science, Springer, vol. 19(4), pages 665-681, October.
    7. Hans Buehler & Blanka Horvath & Yannick Limmer & Thorsten Schmidt, 2025. "Uncertainty-Aware Strategies: A Model-Agnostic Framework for Robust Financial Optimization through Subsampling," Papers 2506.07299, arXiv.org.
    8. Yannick Limmer & Blanka Horvath, 2023. "Robust Hedging GANs," Papers 2307.02310, arXiv.org.
    9. Jizhou Wang & Xiaodan Fang & Lei Huang & Yongfeng Huang, 2025. "TaxAgent: How Large Language Model Designs Fiscal Policy," Papers 2506.02838, arXiv.org.
    10. Mohamed Hamdouche & Pierre Henry-Labordere & Huy^en Pham, 2023. "Generative modeling for time series via Schr{\"o}dinger bridge," Papers 2304.05093, arXiv.org.
    11. Marco Catola & Silvia Leoni, 2025. "Pollution Abatement and Lobbying in a Cournot Game: An Agent-Based Modelling Approach," Computational Economics, Springer;Society for Computational Economics, vol. 65(2), pages 637-664, February.
    12. Demis Legrenzi & Emanuele Ciola & Davide Bazzana, 2025. "Adaptation to climate-induced macrofinancial risks: top-down and bottom-up solutions," Working Papers 2025.18, Fondazione Eni Enrico Mattei.
    13. Kaspereit, Thomas, 2024. "Mandatory disclosure of open-ended real estate fund shares that are registered for redemption?," International Review of Law and Economics, Elsevier, vol. 80(C).
    14. Christa Cuchiero & Philipp Schmocker & Josef Teichmann, 2023. "Global universal approximation of functional input maps on weighted spaces," Papers 2306.03303, arXiv.org, revised Nov 2025.
    15. Arthur, W. Brian, 2023. "Economics in nouns and verbs," Journal of Economic Behavior & Organization, Elsevier, vol. 205(C), pages 638-647.
    16. Aaron Wheeler & Jeffrey D. Varner, 2023. "Scalable Agent-Based Modeling for Complex Financial Market Simulations," Papers 2312.14903, arXiv.org, revised Jan 2024.
    17. Benjamin Patrick Evans & Sumitra Ganesh, 2024. "Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning," Papers 2402.00787, arXiv.org.
    18. Darie Flavius Cosmin & Miron Alexandra Dorina & Ciurea Iulia Cristina, 2024. "Geoeconomic Dynamics in a New Economic Global Order from West to East," Proceedings of the International Conference on Business Excellence, Sciendo, vol. 18(1), pages 469-481.
    19. Fulin Guo, 2023. "GPT in Game Theory Experiments," Papers 2305.05516, arXiv.org, revised Dec 2023.
    20. Kshama Dwarakanath & Jialin Dong & Svitlana Vyetrenko, 2024. "Tax Credits and Household Behavior: The Roles of Myopic Decision-Making and Liquidity in a Simulated Economy," Papers 2408.10391, arXiv.org, revised Oct 2024.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:ruxkh_v1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.