IDEAS home Printed from https://ideas.repec.org/p/osf/socarx/y74mu_v1.html

Using large language models as a source of human behavioral data in social science experiments

Author

Listed:
  • van Loon, Austin
  • Kanopka, Klint

    (New York University)

Abstract

Large language models (LLMs) have prompted proposals to replace human subjects in social science experiments with simulated responses. Empirical evaluations suggest that this practice---often called silicon sampling---can sometimes approximate human behavior but is unreliable. We delineate where this approach may still provide value and where it may not, but primarily study an alternative approach: one in which model-based predictions are used not as substitutes for human data, but as auxiliary measurements within randomized experiments. We formalize the inference of causal estimands from mixed-subjects randomized controlled trials, in which outcomes are observed for a subset of units while predictions are available for all units. Under transparent design conditions, we derive a family of estimators that remain unbiased for the average treatment effect in finite samples while exploiting predictions to reduce variance. We characterize when prediction-powered, calibration-based, arm-specifically tuned, and difference-in-predictions estimators improve precision, and we provide a software package which operationalizes these results and aids researchers to jointly select estimators and allocate budgets between human data collection and prediction generation. Together, our results show how generative artificial intelligence can improve experimental social science without compromising scientific validity.

Suggested Citation

  • van Loon, Austin & Kanopka, Klint, 2026. "Using large language models as a source of human behavioral data in social science experiments," SocArXiv y74mu_v1, Center for Open Science.
  • Handle: RePEc:osf:socarx:y74mu_v1
    DOI: 10.31219/osf.io/y74mu_v1
    as

    Download full text from publisher

    File URL: https://osf.io/download/69cebb3f98a7740c34744970/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/y74mu_v1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. John J. Horton & Apostolos Filippas & Benjamin S. Manning, 2023. "Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?," NBER Working Papers 31122, National Bureau of Economic Research, Inc.
    2. Arel-Bundock, Vincent & Briggs, Ryan C. & Doucouliagos, Hristos & Mendoza Aviña, Marco & Stanley, Tom D., 2022. "Quantitative Political Science Research is Greatly Underpowered," I4R Discussion Paper Series 6, The Institute for Replication (I4R).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Felix Chopra & Ingar Haaland, 2023. "Conducting qualitative interviews with AI," CEBI working paper series 23-06, University of Copenhagen. Department of Economics. The Center for Economic Behavior and Inequality (CEBI).
    2. Hua Li & Qifang Wang & Ye Wu, 2025. "From Mobile Media to Generative AI: The Evolutionary Logic of Computational Social Science Across Data, Methods, and Theory," Mathematics, MDPI, vol. 13(19), pages 1-17, September.
    3. Ben Weidmann & Yixian Xu & David J. Deming, 2025. "Measuring Human Leadership Skills with Artificially Intelligent Agents," Papers 2508.02966, arXiv.org.
    4. Wenjun Cao, 2025. "Black Box Absorption: LLMs Undermining Innovative Ideas," Papers 2510.20612, arXiv.org.
    5. Iñaki Aldasoro & Ajit Desai, 2025. "Money Talks: AI Agents for Cash Management in Payment Systems," Staff Working Papers 25-35, Bank of Canada.
    6. Joy Buchanan & Stephen Hill & Olga Shapoval, 2024. "ChatGPT Hallucinates Non-existent Citations: Evidence from Economics," The American Economist, Sage Publications, vol. 69(1), pages 80-87, March.
    7. Siting Estee Lu, 2024. "Strategic Interactions between Large Language Models-based Agents in Beauty Contests," Papers 2404.08492, arXiv.org, revised Oct 2024.
    8. Felipe A. Csaszar & Harsh Ketkar & Hyunjin Kim, 2024. "Artificial Intelligence and Strategic Decision-Making: Evidence from Entrepreneurs and Investors," Strategy Science, INFORMS, vol. 9(4), pages 322-345, December.
    9. Alejandro Lopez-Lira & Yuehua Tang & Mingyin Zhu, 2025. "The Memorization Problem: Can We Trust LLMs' Economic Forecasts?," Papers 2504.14765, arXiv.org, revised Dec 2025.
    10. Shumiao Ouyang & Hayong Yun & Xingjian Zheng, 2024. "AI as Decision-Maker: Ethics and Risk Preferences of LLMs," Papers 2406.01168, arXiv.org, revised Jun 2025.
    11. Thomas Henning & Siddhartha M. Ojha & Ross Spoon & Jiatong Han & Colin F. Camerer, 2025. "LLM Agents Do Not Replicate Human Market Traders: Evidence From Experimental Finance," Papers 2502.15800, arXiv.org, revised Oct 2025.
    12. Kevin Leyton-Brown & Paul Milgrom & Neil Newman & Ilya Segal, 2024. "Artificial Intelligence and Market Design: Lessons Learned from Radio Spectrum Reallocation," NBER Chapters, in: New Directions in Market Design, pages 119-151, National Bureau of Economic Research, Inc.
    13. Yuqi Nie & Yaxuan Kong & Xiaowen Dong & John M. Mulvey & H. Vincent Poor & Qingsong Wen & Stefan Zohren, 2024. "A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges," Papers 2406.11903, arXiv.org.
    14. C. Monica Capra & Thomas J. Kniesner, 2025. "Daniel Kahneman’s underappreciated last published paper: Empirical implications for benefit-cost analysis and a chat session discussion with bots," Journal of Risk and Uncertainty, Springer, vol. 71(1), pages 29-51, August.
    15. Cagno, Daniela Di & Lin, Lihui, 2025. "How do individuals interact with an AI advisor in strategic reasoning? An experimental study in beauty contest," Journal of Economic Behavior & Organization, Elsevier, vol. 237(C).
    16. Ayato Kitadai & Sinndy Dayana Rico Lugo & Yudai Tsurusaki & Yusuke Fukasawa & Nariaki Nishino, 2024. "Can AI with High Reasoning Ability Replicate Human-like Decision Making in Economic Experiments?," Papers 2406.11426, arXiv.org.
    17. Navid Ghaffarzadegan & Aritra Majumdar & Ross Williams & Niyousha Hosseinichimeh, 2024. "Generative agent‐based modeling: an introduction and tutorial," System Dynamics Review, System Dynamics Society, vol. 40(1), January.
    18. Kevin He & Ran Shorrer & Mengjia Xia, 2025. "Human Misperception of Generative-AI Alignment:A Laboratory Experiment," PIER Working Paper Archive 25-019, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
    19. Hongshen Sun & Juanjuan Zhang, 2025. "From Model Choice to Model Belief: Establishing a New Measure for LLM-Based Research," Papers 2512.23184, arXiv.org.
    20. Felipe A. Csaszar & Aticus Peterson & Daniel Wilde, 2026. "The Strategic Foresight of LLMs: Evidence from a Fully Prospective Venture Tournament," Papers 2602.01684, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:y74mu_v1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.