Using Large Language Models for Text Classification in Experimental Economics

My bibliography Save this paper

Using Large Language Models for Text Classification in Experimental Economics

Author

Listed:

Can Celebi
(University of Mannheim)
Stefan Penczynski
(School of Economics and Centre for Behavioural and Experimental Social Science, University of East Anglia)

Registered:

Abstract

In our study, we compare the classification capabilities of GPT-3.5 and GPT-4 with human annotators using text data from economic experiments. We analysed four text corpora, focusing on two domains: promises and strategic reasoning. Starting with prompts close to those given to human annotators, we subsequently explored alternative prompts to investigate the effect of varying classification instructions and degrees of background information on the models' classification performance. Additionally, we varied the number of examples in a prompt (few-shot vs zero-shot) and the use of the zero-shot "Chain of Thought" prompting technique. Our findings show that GPT-4's performance is comparable to human annotators, achieving accuracy levels near or over 90% in three tasks, and in the most challenging task of classifying strategic thinking in asymmetric coordination games, it reaches an accuracy level above 70%.

Suggested Citation

Can Celebi & Stefan Penczynski, 2024. "Using Large Language Models for Text Classification in Experimental Economics," Working Paper series, University of East Anglia, Centre for Behavioural and Experimental Social Science (CBESS) 24-01, School of Economics, University of East Anglia, Norwich, UK..

Handle: RePEc:uea:wcbess:24-01

Download full text from publisher

References listed on IDEAS

Guarnaschelli, Serena & McKelvey, Richard D. & Palfrey, Thomas R., 2000. "An Experimental Study of Jury Decision Rules," American Political Science Review, Cambridge University Press, vol. 94(2), pages 407-423, June.
Paul Glasserman & Caden Lin, 2023. "Assessing Look-Ahead Bias in Stock Return Predictions Generated By GPT Sentiment Analysis," Papers 2309.17322, arXiv.org.
David J. Cooper & John H. Kagel, 2005. "Are Two Heads Better Than One? Team versus Individual Play in Signaling Games," American Economic Review, American Economic Association, vol. 95(3), pages 477-509, June.
Smales, Lee A., 2023. "Classification of RBA monetary policy announcements using ChatGPT," Finance Research Letters, Elsevier, vol. 58(PC).
Alonso-Robisco, Andres & Carbó, José Manuel, 2023. "Analysis of CBDC narrative by central banks using large language models," Finance Research Letters, Elsevier, vol. 58(PC).
- Andres Alonso-Robisco & Jose Manuel Carbo, 2023. "Analysis of CBDC Narrative OF Central Banks using Large Language Models," Working Papers 2321, Banco de España.
Daniel Houser & Erte Xiao, 2011. "Classification of natural language messages using a coordination game," Experimental Economics, Springer;Economic Science Association, vol. 14(1), pages 1-14, March.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Simeon Schudy & Susanna Grundmann & Lisa Spantig, 2024. "Individual Preferences for Truth-Telling," CESifo Working Paper Series 11521, CESifo.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Julian Junyan Wang & Victor Xiaoqi Wang, 2025. "Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks," Papers 2503.16974, arXiv.org, revised Jun 2025.
Dong, Mengming Michael & Stratopoulos, Theophanis C. & Wang, Victor Xiaoqi, 2024. "A scoping review of ChatGPT research in accounting and finance," International Journal of Accounting Information Systems, Elsevier, vol. 55(C).
- Mengming Michael Dong & Theophanis C. Stratopoulos & Victor Xiaoqi Wang, 2024. "A Scoping Review of ChatGPT Research in Accounting and Finance," Papers 2412.05731, arXiv.org.
Jiabin Wu, 2018. "Indirect higher order beliefs and cooperation," Experimental Economics, Springer;Economic Science Association, vol. 21(4), pages 858-876, December.
- Wu, Jiabin, 2016. "Indirect Higher Order Beliefs and Cooperation," MPRA Paper 69600, University Library of Munich, Germany.
Baethge, Caroline, 2016. "Performance in the beauty contest: How strategic discussion enhances team reasoning," Passauer Diskussionspapiere, Betriebswirtschaftliche Reihe B-17-16, University of Passau, Faculty of Business and Economics.
Ronald Bosman & Heike Hennig-Schmidt & Frans Winden, 2006. "Exploring group decision making in a power-to-take experiment," Experimental Economics, Springer;Economic Science Association, vol. 9(1), pages 35-51, April.
van Dijk, Frans & Sonnemans, Joep & Bauw, Eddy, 2014. "Judicial error by groups and individuals," Journal of Economic Behavior & Organization, Elsevier, vol. 108(C), pages 224-235.
- Frans van Dijk & Joep H. Sonnemans & Ed Bauw, 2012. "Judicial Error by Groups and Individuals," Tinbergen Institute Discussion Papers 12-029/3, Tinbergen Institute.
Auerswald, Heike & Schmidt, Carsten & Thum, Marcel & Torsvik, Gaute, 2018. "Teams in a public goods experiment with punishment," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 72(C), pages 28-39.
Jacob K. Goeree & Leeat Yariv, 2009. "An experimental study of jury deliberation," IEW - Working Papers 438, Institute for Empirical Research in Economics - University of Zurich.
Andres, Maximilian & Bruttel, Lisa & Friedrichsen, Jana, 2023. "How communication makes the difference between a cartel and tacit collusion: A machine learning approach," European Economic Review, Elsevier, vol. 152(C).
- Andres, Maximilian & Bruttel, Lisa & Friedrichsen, Jana, 2023. "How communication makes the difference between a cartel and tacit collusion: A machine learning approach," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 152, pages 1-1.
- Maximilian Andres & Lisa Bruttel & Jana Friedrichsen, 2022. "How Communication Makes the Difference between a Cartel and Tacit Collusion: A Machine Learning Approach," CESifo Working Paper Series 10024, CESifo.
- Maximilian Andres & Lisa Bruttel & Jana Friedrichsen, 2022. "How communication makes the difference between a cartel and tacit collusion: a machine learning approach," CEPA Discussion Papers 53, Center for Economic Policy Analysis.
- Maximilian Andres & Lisa Bruttel & Jana Friedrichsen, 2022. "How Communication Makes the Difference between a Cartel and Tacit Collusion: A Machine Learning Approach," Discussion Papers of DIW Berlin 2000, DIW Berlin, German Institute for Economic Research.
Nick Feltovich & Yasuyo Hamaguchi, 2018. "The Effect of Whistle‐Blowing Incentives on Collusion: An Experimental Study of Leniency Programs," Southern Economic Journal, John Wiley & Sons, vol. 84(4), pages 1024-1049, April.
David J. Cooper & Ian Krajbich & Charles N. Noussair, 2019. "Choice-Process Data in Experimental Economics," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 5(1), pages 1-13, August.
Wang, Siyu & Houser, Daniel, 2019. "Demanding or deferring? An experimental analysis of the economic value of communication with attitude," Games and Economic Behavior, Elsevier, vol. 115(C), pages 381-395.
W. Viscusi & Owen Phillips & Stephan Kroll, 2011. "Risky investment decisions: How are individuals influenced by their groups?," Journal of Risk and Uncertainty, Springer, vol. 43(2), pages 81-106, October.
Andres, Maximilian & Bruttel, Lisa & Friedrichsen, Jana, 2021. "How do sanctions work? The choice between cartel formation and tacit collusion," VfS Annual Conference 2021 (Virtual Conference): Climate Economics 242372, Verein für Socialpolitik / German Economic Association.
Ito, Arata & Sato, Masahiro & Ota, Rui, 2025. "A novel content-based approach to measuring monetary policy uncertainty using fine-tuned LLMs," Finance Research Letters, Elsevier, vol. 75(C).
Boris Ginzburg & JosÔøΩ-Alberto Guerra, 2017. "When Ignorance is Bliss: Theory and Experiment on Collective Learning," Documentos CEDE 15377, Universidad de los Andes, Facultad de Economía, CEDE.
Elten, Jonas van & Penczynski, Stefan P., 2020. "Coordination games with asymmetric payoffs: An experimental study with intra-group communication," Journal of Economic Behavior & Organization, Elsevier, vol. 169(C), pages 158-188.
Nobuyuki Hanaki & Ali I. Ozkes, 2023. "Strategic environment effect and communication," Experimental Economics, Springer;Economic Science Association, vol. 26(3), pages 588-621, July.
Maximilian Andres & Lisa Bruttel & Jana Friedrichsen, 2020. "Choosing between explicit cartel formation and tacit collusion – An experiment," CEPA Discussion Papers 19, Center for Economic Policy Analysis.
Arata ITO & Masahiro SATO & Rui OTA, 2024. "Content-based Metric on Monetary Policy Uncertainty by Using Large Language Models," Discussion papers 24080, Research Institute of Economy, Trade and Industry (RIETI).

More about this item

Keywords

; ; ; ;

NEP fields

This paper has been announced in the following NEP Reports:

NEP-AIN-2024-07-15 (Artificial Intelligence)
NEP-BIG-2024-07-15 (Big Data)
NEP-CMP-2024-07-15 (Computational Economics)
NEP-EXP-2024-07-15 (Experimental Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:uea:wcbess:24-01. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Cara Liggins (email available below). General contact details of provider: https://edirc.repec.org/data/esueauk.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Using Large Language Models for Text Classification in Experimental Economics

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data