LLM Personas as a Substitute for Field Experiments in Method Benchmarking

LLM Personas as a Substitute for Field Experiments in Method Benchmarking

Author

Listed:

Enoch Hyunwook Kang

Abstract

Field experiments (A/B tests) are often the most credible benchmark for methods (algorithms) in societal systems, but their cost and latency bottleneck rapid methodological progress. LLM-based persona simulation offers a cheap synthetic alternative, yet it is unclear whether replacing humans with personas preserves the benchmark interface that adaptive methods optimize against. We prove an if-and-only-if characterization: when (i) methods observe only the aggregate outcome (aggregate-only observation) and (ii) evaluation depends only on the submitted artifact and not on the method's identity or provenance (method-blind evaluation), swapping humans for personas is just panel change from the method's point of view, indistinguishable from changing the evaluation population (e.g., New York to Jakarta). Furthermore, we move from validity to usefulness: we define an information-theoretic discriminability of the induced aggregate channel and show that making persona benchmarking as decision-relevant as a field experiment is fundamentally a sample-size question, yielding explicit bounds on the number of independent persona evaluations required to reliably distinguish meaningfully different methods at a chosen resolution.

Suggested Citation

Enoch Hyunwook Kang, 2025. "LLM Personas as a Substitute for Field Experiments in Method Benchmarking," Papers 2512.21080, arXiv.org, revised Jan 2026.

Handle: RePEc:arx:papers:2512.21080

Download full text from publisher

References listed on IDEAS

George Gui & Seungwoo Kim, 2025. "Leveraging LLMs to Improve Experimental Design: A Generative Stratification Approach," Papers 2509.25709, arXiv.org.
Oriana Bandiera & Iwan Barankay & Imran Rasul, 2011. "Field Experiments with Firms," Journal of Economic Perspectives, American Economic Association, vol. 25(3), pages 63-82, Summer.
- Bandiera, Oriana & Barankay, Iwan & Rasul, Imran, 2011. "Field Experiments with Firms," CEPR Discussion Papers 8412, Centre for Economic Policy Research.
- Oriana Bandiera & Iwan Baranky & Imran Rasul, 2011. "Field Experiments with Firms," STICERD - Economic Organisation and Public Policy Discussion Papers Series 028, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
- Bandiera, Oriana & Barankay, Iwan & Rasul, Imran, 2011. "Field Experiments with Firms," IZA Discussion Papers 5723, IZA Network @ LISER.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Guido Friebel & Matthias Heinz & Miriam Krueger & Nikolay Zubanov, 2017. "Team Incentives and Performance: Evidence from a Retail Chain," American Economic Review, American Economic Association, vol. 107(8), pages 2168-2203, August.
- Friebel, Guido & Zubanov, Nick & Heinz, Matthias & KrÃ¼ger, Miriam, 2015. "Team incentives and performance: Evidence from a retail chain," CEPR Discussion Papers 10796, Centre for Economic Policy Research.
- Friebel, Guido & Heinz, Matthias & Krueger, Miriam & Zubanov, Nikolay, 2017. "Team incentives and performance: Evidence from a retail chain," VfS Annual Conference 2017 (Vienna): Alternative Structures for Money and Banking 168285, Verein für Socialpolitik / German Economic Association.
- Friebel, Guido & Heinz, Matthias & Krüger, Miriam & Zubanov, Nick, 2015. "Team Incentives and Performance: Evidence from a Retail Chain," IZA Discussion Papers 9316, IZA Network @ LISER.
Pedro Carneiro & Sokbae Lee & Daniel Wilhelm, 2020. "Optimal data collection for randomized control trials," The Econometrics Journal, Royal Economic Society, vol. 23(1), pages 1-31.
- Pedro Carneiro & Sokbae (Simon) Lee & Daniel Wilhelm, 2016. "Optimal data collection for randomized control trials," CeMMAP working papers CWP15/16, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Pedro Carneiro & Sokbae (Simon) Lee & Daniel Wilhelm, 2017. "Optimal data collection for randomized control trials," CeMMAP working papers 15/17, Institute for Fiscal Studies.
- Pedro Carneiro & Sokbae (Simon) Lee & Daniel Wilhelm, 2017. "Optimal data collection for randomized control trials," CeMMAP working papers 45/17, Institute for Fiscal Studies.
- Pedro Carneiro & Sokbae (Simon) Lee & Daniel Wilhelm, 2017. "Optimal data collection for randomized control trials," CeMMAP working papers CWP15/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Carneiro, Pedro & Lee, Sokbae & Wilhelm, Daniel, 2016. "Optimal Data Collection for Randomized Control Trials," IZA Discussion Papers 9908, IZA Network @ LISER.
- Pedro Carneiro & Sokbae (Simon) Lee & Daniel Wilhelm, 2019. "Optimal Data Collection for Randomized Control Trials," CeMMAP working papers CWP21/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Pedro Carneiro & Sokbae (Simon) Lee & Daniel Wilhelm, 2016. "Optimal data collection for randomized control trials," CeMMAP working papers 15/16, Institute for Fiscal Studies.
- Pedro Carneiro & Sokbae (Simon) Lee & Daniel Wilhelm, 2017. "Optimal data collection for randomized control trials," CeMMAP working papers CWP45/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Pedro Carneiro & Sokbae Lee & Daniel Wilhelm, 2016. "Optimal Data Collection for Randomized Control Trials," Papers 1603.03675, arXiv.org, revised Aug 2016.
Ethan Ilzetzki & Saverio Simonelli, 2017. "Measuring Productivity Dispersion: Lessons From Counting One-Hundred Million Ballots," CSEF Working Papers 483, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
- Ilzetzki, Ethan & Simonelli, Saverio, 2017. "Measuring Productivity Dispersion: Lessons From Counting One-Hundred Million Ballots," CEPR Discussion Papers 12273, Centre for Economic Policy Research.
- Ethan Ilzetzki & Saverio Simonelli, 2017. "Measuring Productivity Dispersion: Lessons from counting one-hundred million ballots," Discussion Papers 1725, Centre for Macroeconomics (CFM).
- Ilzetzki, Ethan & Simonelli, Saverio, 2017. "Measuring productivity dispersion:Lessons from counting one-hundred million ballots," LSE Research Online Documents on Economics 86150, London School of Economics and Political Science, LSE Library.
Petri Böckerman & Alex Bryson & Pekka Ilmakunnas, 2013. "Does high involvement management lead to higher pay?," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 176(4), pages 861-885, October.
- Dr Alex Bryson, 2011. "Does High Involvement Management Lead to Higher Pay?," National Institute of Economic and Social Research (NIESR) Discussion Papers 376, National Institute of Economic and Social Research.
- Böckerman, Petri & Bryson, Alex & Ilmakunnas, Pekka, 2011. "Does high involvement management lead to higher pay?," MPRA Paper 28711, University Library of Munich, Germany.
- Petri Böckerman & Alex Bryson & Pekka Ilmakunnas, 2012. "Does High Involvement Management Lead to Higher Pay?," Working Papers 1285, Tampere University, Faculty of Management and Business, Economics.
- Böckerman, Petri & Bryson, Alex & Ilmakunnas, Pekka, 2011. "Does high involvement management lead to higher pay?," LSE Research Online Documents on Economics 38575, London School of Economics and Political Science, LSE Library.
- Alex Bryson & Petri Böckerman & Pekka Ilmakunnas, 2011. "Does High Involvement Management Lead to Higher Pay?," CEP Discussion Papers dp1046, Centre for Economic Performance, LSE.
Jakob Alfitian & Dirk Sliwka & Timo Vogelsang, 2024. "When Bonuses Backfire: Evidence from the Workplace," Management Science, INFORMS, vol. 70(9), pages 6395-6414, September.
Stijn Baert & Sunčica Vujić, 2018. "Does it pay to care? Volunteering and employment opportunities," Journal of Population Economics, Springer;European Society for Population Economics, vol. 31(3), pages 819-836, July.
Kathrin Manthei & Dirk Sliwka & Timo Vogelsang, 2021. "Performance Pay and Prior Learning—Evidence from a Retail Chain," Management Science, INFORMS, vol. 67(11), pages 6998-7022, November.
Eric Floyd & John A. List, 2016. "Using Field Experiments in Accounting and Finance," Journal of Accounting Research, John Wiley & Sons, Ltd., vol. 54(2), pages 437-475, May.
- Eric Floyd & John List, 2016. "Using Field Experiments in Accounting and Finance," Artefactual Field Experiments 00410, The Field Experiments Website.
Abebe, Girum & Caria, Stefano & Fafchamps, Marcel & Falco, Paolo & Franklin, Simon & Quinn, Simon & Shilpi, Forhad, 2017. "Matching firms and workers in a field experiment in Ethiopia," LSE Research Online Documents on Economics 86572, London School of Economics and Political Science, LSE Library.
- Girum Abebe & Stefano Caria & Marcel Fafchamps & Paolo Falco & Simon Franklin & Simon Quinn & Forhad Shilpi, 2017. "Matching Firms and Workers in a Field Experiment in Ethiopia," SERC Discussion Papers 0225, Centre for Economic Performance, LSE.
Manthei, Kathrin & Sliwka, Dirk & Vogelsang, Timo, 2017. "Performance Pay May Not Raise Performance – A Cautionary Tale Based On Evidence from Large Scale Field Experiments in a Retail Chain," VfS Annual Conference 2017 (Vienna): Alternative Structures for Money and Banking 168287, Verein für Socialpolitik / German Economic Association.
Olivier Armantier & Amadou Boly, 2015. "Framing Of Incentives And Effort Provision," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 56(3), pages 917-938, August.
Greer K. Gosnell & John A. List & Robert Metcalfe, 2016. "A New Approach to an Age-Old Problem: Solving Externalities by Incenting Workers Directly," NBER Working Papers 22316, National Bureau of Economic Research, Inc.
- Greer Gosnell & John List & Robert Metcalfe, 2016. "A New Approach to an Age-Old Problem: Solving Externalities by Incenting Workers Directly," Framed Field Experiments 00412, The Field Experiments Website.
- Greer Gosnell & John List & Robert Metcalfe, 2017. "A new approach to an age-old problem: solving externalities by incenting workers directly," GRI Working Papers 262, Grantham Research Institute on Climate Change and the Environment.
- Gosnell, Greer & Metcalfe, Robert & List, John A, 2016. "A new approach to an age-old problem: solving externalities by incenting workers directly," LSE Research Online Documents on Economics 84331, London School of Economics and Political Science, LSE Library.
Paweł Doligalski & Abdoulaye Ndiaye & Nicolas Werquin, 2023. "Redistribution with Performance Pay," Journal of Political Economy Macroeconomics, University of Chicago Press, vol. 1(2), pages 371-402.
- Doligalski, Pawel & Ndiaye, Abdoulaye & Werquin, Nicolas, 2020. "Redistribution with Performance Pay," MPRA Paper 102652, University Library of Munich, Germany.
- Doligalski, Pawel & Werquin, Nicolas & Ndiaye, Abdoulaye, 2020. "Redistribution with Performance Pay," TSE Working Papers 20-1092, Toulouse School of Economics (TSE).
- Doligalski, Pawel & Ndiaye, Abdoulaye & Werquin, Nicolas, 2022. "Redistribution with Performance Pay," CEPR Discussion Papers 14648, Centre for Economic Policy Research.
- Pawel Doligalski & Abdoulaye Ndiaye & Nicolas Werquin, 2020. "Redistribution with Performance Pay," CESifo Working Paper Series 8267, CESifo.
- Pawel Doligalski & Abdoulaye Ndiaye & Nicolas Werquin, 2020. "Redistribution with Performance Pay," Bristol Economics Discussion Papers 20/721, School of Economics, University of Bristol, UK.
Simon Wiederhold, 2012. "The Role of Public Procurement in Innovation: Theory and Empirical Evidence," ifo Beiträge zur Wirtschaftsforschung, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, number 43, April.
Marcel Fafchamps & Simon Quinn, 2018. "Networks and Manufacturing Firms in Africa: Results from a Randomized Field Experiment," The World Bank Economic Review, World Bank, vol. 32(3), pages 656-675.
- Marcel Fafchamps & Simon Quinn, 2014. "Networks and Manufacturing Firms in Africa: Results from a Randomized Field Experiment," CSAE Working Paper Series 2014-25, Centre for the Study of African Economies, University of Oxford.
- Marcel Fafchamps & Simon R. Quinn, 2015. "Networks and Manufacturing Firms in Africa: Results from a Randomized Field Experiment," NBER Working Papers 21132, National Bureau of Economic Research, Inc.
- Marcel Fafchamps & Simon R. Quinn, 2015. "Networks and Manufacturing Firms in Africa: Results from a Randomized Field Experiment," NBER Working Papers 21132, National Bureau of Economic Research, Inc.
John Pencavel, 2013. "The Productivity Of Working Hours," Discussion Papers 13-006, Stanford Institute for Economic Policy Research.
- Pencavel, John H., 2014. "The Productivity of Working Hours," IZA Discussion Papers 8129, IZA Network @ LISER.
Robert Gibbons & John Roberts, 2012. "Introduction [The Handbook of Organizational Economics]," Introductory Chapters,, Princeton University Press.
Loureiro, Maria & Labandeira, Xavier, 2019. "Exploring Energy Use in Retail Stores: A Field Experiment," Energy Economics, Elsevier, vol. 84(S1).

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-CMP-2026-01-19 (Computational Economics)
NEP-EXP-2026-01-19 (Experimental Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.21080. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

LLM Personas as a Substitute for Field Experiments in Method Benchmarking

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data