IDEAS home Printed from https://ideas.repec.org/p/uea/wcbess/24-01.html

Using Large Language Models for Text Classification in Experimental Economics

Author

Listed:
  • Can Celebi

    (University of Mannheim)

  • Stefan Penczynski

    (School of Economics and Centre for Behavioural and Experimental Social Science, University of East Anglia)

Abstract

In our study, we compare the classification capabilities of GPT-3.5 and GPT-4 with human annotators using text data from economic experiments. We analysed four text corpora, focusing on two domains: promises and strategic reasoning. Starting with prompts close to those given to human annotators, we subsequently explored alternative prompts to investigate the effect of varying classification instructions and degrees of background information on the models' classification performance. Additionally, we varied the number of examples in a prompt (few-shot vs zero-shot) and the use of the zero-shot "Chain of Thought" prompting technique. Our findings show that GPT-4's performance is comparable to human annotators, achieving accuracy levels near or over 90% in three tasks, and in the most challenging task of classifying strategic thinking in asymmetric coordination games, it reaches an accuracy level above 70%.

Suggested Citation

  • Can Celebi & Stefan Penczynski, 2024. "Using Large Language Models for Text Classification in Experimental Economics," Working Paper series, University of East Anglia, Centre for Behavioural and Experimental Social Science (CBESS) 24-01, School of Economics, University of East Anglia, Norwich, UK..
  • Handle: RePEc:uea:wcbess:24-01
    as

    Download full text from publisher

    File URL: https://ueaeco.github.io/working-papers/papers/cbess/UEA-CBESS-24-01.pdf
    File Function: main text
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Alonso-Robisco, Andres & Carbó, José Manuel, 2023. "Analysis of CBDC narrative by central banks using large language models," Finance Research Letters, Elsevier, vol. 58(PC).
    2. Guarnaschelli, Serena & McKelvey, Richard D. & Palfrey, Thomas R., 2000. "An Experimental Study of Jury Decision Rules," American Political Science Review, Cambridge University Press, vol. 94(2), pages 407-423, June.
    3. Paul Glasserman & Caden Lin, 2023. "Assessing Look-Ahead Bias in Stock Return Predictions Generated By GPT Sentiment Analysis," Papers 2309.17322, arXiv.org.
    4. Smales, Lee A., 2023. "Classification of RBA monetary policy announcements using ChatGPT," Finance Research Letters, Elsevier, vol. 58(PC).
    5. Huseyn Ismayilov & Jan Potters, 2016. "Why do promises affect trustworthiness, or do they?," Experimental Economics, Springer;Economic Science Association, vol. 19(2), pages 382-393, June.
    6. David J. Cooper & John H. Kagel, 2005. "Are Two Heads Better Than One? Team versus Individual Play in Signaling Games," American Economic Review, American Economic Association, vol. 95(3), pages 477-509, June.
    7. Daniel Houser & Erte Xiao, 2011. "Classification of natural language messages using a coordination game," Experimental Economics, Springer;Economic Science Association, vol. 14(1), pages 1-14, March.
    8. Christoph Vanberg, 2008. "Why Do People Keep Their Promises? An Experimental Test of Two Explanations -super-1," Econometrica, Econometric Society, vol. 76(6), pages 1467-1480, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. repec:ces:ceswps:_11521 is not listed on IDEAS
    2. Bruttel, Lisa & Nithammer, Juri, 2025. "Opinion Piece: How to pre-register experimental studies that involve machine learning for text data analysis," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 118(C).
    3. Nina Xue & Lata Gangadharan & Philip J. Grossman, 2025. "Are more heads more motivated than one? The role of communication in group belief updating," Department of Economics Working Papers wuwp375, Vienna University of Economics and Business, Department of Economics.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zakaria Babutsidze & Nobuyuki Hanaki & Adam Zylbersztejn, 2021. "Nonverbal content and trust: An experiment on digital communication," Economic Inquiry, Western Economic Association International, vol. 59(4), pages 1517-1532, October.
    2. Xiangdong Qin & Siyu Wang & Mike Zhiren Wu, 2024. "Is it what you say or how you say it?," Experimental Economics, Springer;Economic Science Association, vol. 27(4), pages 874-921, September.
    3. Tebbe, Eva & Wegener, Benjamin, 2022. "Is natural language processing the cheap charlie of analyzing cheap talk? A horse race between classifiers on experimental communication data," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 96(C).
    4. Julian Junyan Wang & Victor Xiaoqi Wang, 2025. "Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks," Papers 2503.16974, arXiv.org, revised Sep 2025.
    5. Dong, Mengming Michael & Stratopoulos, Theophanis C. & Wang, Victor Xiaoqi, 2024. "A scoping review of ChatGPT research in accounting and finance," International Journal of Accounting Information Systems, Elsevier, vol. 55(C).
    6. Zakaria Babutsidze & Nobuyuki Hanaki & Adam Zylbersztejn, 2020. "Nonverbal content and swift trust: An experiment on digital communication," Working Papers 2008, Groupe d'Analyse et de Théorie Economique Lyon St-Etienne (GATE Lyon St-Etienne), Université de Lyon.
    7. Zakaria Babutsidze & Nobuyuki Hanaki & Adam Zylbersztejn, 2019. "Digital Communication and Swift Trust," Post-Print halshs-02409314, HAL.
    8. Sorravich Kingsuwankul & Chloe Tergiman & Marie Claire Villeval, 2023. "Why do oaths work? Image concerns and credibility in promise keeping," Working Papers hal-04209489, HAL.
    9. Faralla, Valeria & Borà, Guido & Innocenti, Alessandro & Novarese, Marco, 2020. "Promises in group decision making," Research in Economics, Elsevier, vol. 74(1), pages 1-11.
    10. Jiabin Wu, 2018. "Indirect higher order beliefs and cooperation," Experimental Economics, Springer;Economic Science Association, vol. 21(4), pages 858-876, December.
    11. Baethge, Caroline, 2016. "Performance in the beauty contest: How strategic discussion enhances team reasoning," Passauer Diskussionspapiere, Betriebswirtschaftliche Reihe B-17-16, University of Passau, Faculty of Business and Economics.
    12. Cary Deck & Maroš Servátka & Steven Tucker, 2013. "An examination of the effect of messages on cooperation under double-blind and single-blind payoff procedures," Experimental Economics, Springer;Economic Science Association, vol. 16(4), pages 597-607, December.
    13. Dufwenberg, Martin & Feldman, Paul & Servátka, Maroš & Tarrasó, Jorge & Vadovič, Radovan, 2023. "Honesty in the city," Games and Economic Behavior, Elsevier, vol. 139(C), pages 15-25.
      • Dufwenberg, Martin & Servátka, Maroš & Tarrasó, Jorge & Vadovič, Radovan, 2021. "Honesty in the City," MPRA Paper 106256, University Library of Munich, Germany.
      • Martin Dufwenberg & Paul Feldman & Maros Servatka & Jorge Tarraso & Radovan Vadovic, 2022. "Honesty in the City," Working Papers 2022-03, University of Alaska Anchorage, Department of Economics.
      • Dufwenberg, Martin & Feldman, Paul & Servátka, Maroš & Tarrasó, Jorge & Vadovič, Radovan, 2022. "Honesty in the city," MPRA Paper 115044, University Library of Munich, Germany.
    14. Manski, Charles F. & Neri, Claudia, 2013. "First- and second-order subjective expectations in strategic decision-making: Experimental evidence," Games and Economic Behavior, Elsevier, vol. 81(C), pages 232-254.
    15. repec:hal:wpaper:halshs-03620418 is not listed on IDEAS
    16. Jingnan Chen & Daniel Houser, 2017. "Promises and lies: can observers detect deception in written messages," Experimental Economics, Springer;Economic Science Association, vol. 20(2), pages 396-419, June.
    17. Antinyan, Armenak & Corazzini, Luca & D'Agostino, Elena & Pavesi, Filippo, 2023. "Watch your words: An experimental study on communication and the opportunity cost of delegation," Journal of Economic Behavior & Organization, Elsevier, vol. 214(C), pages 216-232.
    18. Giovanni Di Bartolomeo & Martin Dufwenberg & Stefano Papa, 2023. "Promises and partner-switch," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 9(1), pages 77-89, June.
    19. Koukoumelis, Anastasios & Levati, M. Vittoria & Weisser, Johannes, 2012. "Leading by words: A voluntary contribution experiment with one-way communication," Journal of Economic Behavior & Organization, Elsevier, vol. 81(2), pages 379-390.
    20. Attanasi, Giuseppe & Rimbaud, Claire & Villeval, Marie Claire, 2023. "Guilt aversion in (new) games: Does partners' payoff vulnerability matter?," Games and Economic Behavior, Elsevier, vol. 142(C), pages 690-717.
    21. Breitmoser, Yves & Valasek, Justin, 2023. "Why do committees work?," Discussion Paper Series in Economics 18/2023, Norwegian School of Economics, Department of Economics.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:uea:wcbess:24-01. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Cara Liggins (email available below). General contact details of provider: https://edirc.repec.org/data/esueauk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.