Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions

My bibliography Save this paper

Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions

Author

Listed:

Olivier Toubia
George Z. Gui
Tianyi Peng
Daniel J. Merlau
Ang Li
Haozhe Chen

Registered:

Abstract

LLM-based digital twin simulation, where large language models are used to emulate individual human behavior, holds great promise for research in AI, social science, and digital experimentation. However, progress in this area has been hindered by the scarcity of real, individual-level datasets that are both large and publicly available. This lack of high-quality ground truth limits both the development and validation of digital twin methodologies. To address this gap, we introduce a large-scale, public dataset designed to capture a rich and holistic view of individual human behavior. We survey a representative sample of $N = 2,058$ participants (average 2.42 hours per person) in the US across four waves with 500 questions in total, covering a comprehensive battery of demographic, psychological, economic, personality, and cognitive measures, as well as replications of behavioral economics experiments and a pricing survey. The final wave repeats tasks from earlier waves to establish a test-retest accuracy baseline. Initial analyses suggest the data are of high quality and show promise for constructing digital twins that predict human behavior well at the individual and aggregate levels. By making the full dataset publicly available, we aim to establish a valuable testbed for the development and benchmarking of LLM-based persona simulations. Beyond LLM applications, due to its unique breadth and scale the dataset also enables broad social science research, including studies of cross-construct correlations and heterogeneous treatment effects.

Suggested Citation

Olivier Toubia & George Z. Gui & Tianyi Peng & Daniel J. Merlau & Ang Li & Haozhe Chen, 2025. "Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions," Papers 2505.17479, arXiv.org.

Handle: RePEc:arx:papers:2505.17479

Download full text from publisher

References listed on IDEAS

Gergana Y. Nenkov & Maureen Morrin & Andrew Ward & Barry Schwartz & John Hulland, 2008. "A short form of the Maximization Scale: Factor structure, reliability and validity studies," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 3, pages 371-388, June.
John J. Horton, 2023. "Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?," NBER Working Papers 31122, National Bureau of Economic Research, Inc.
Richard H. Thaler, 2008. "Mental Accounting and Consumer Choice," Marketing Science, INFORMS, vol. 27(1), pages 15-25, 01-02.
- Richard Thaler, 1985. "Mental Accounting and Consumer Choice," Marketing Science, INFORMS, vol. 4(3), pages 199-214.
repec:cup:judgdm:v:3:y:2008:i::p:371-388 is not listed on IDEAS
Scott I. Rick & Cynthia E. Cryder & George Loewenstein, 2008. "Tightwads and Spendthrifts," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 34(6), pages 767-782, October.
John J. Horton, 2023. "Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?," Papers 2301.07543, arXiv.org.
Fabio Motoki & Valdemar Pinho Neto & Victor Rodrigues, 2024. "More human than human: measuring ChatGPT political bias," Public Choice, Springer, vol. 198(1), pages 3-23, January.
Nenkov, Gergana Y. & Morrin, Maureen & Ward, Andrew & Schwartz, Barry & Hulland, John, 2008. "A short form of the Maximization Scale: Factor structure, reliability and validity studies," Judgment and Decision Making, Cambridge University Press, vol. 3(5), pages 371-388, June.
Guth, Werner & Schmittberger, Rolf & Schwarze, Bernd, 1982. "An experimental analysis of ultimatum bargaining," Journal of Economic Behavior & Organization, Elsevier, vol. 3(4), pages 367-388, December.
Eric J Johnson & Stephan Meier & Olivier Toubia, 2019. "What’s the Catch? Suspicion of Bank Motives and Sluggish Refinancing," The Review of Financial Studies, Society for Financial Studies, vol. 32(2), pages 467-495.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

George Gui & Seungwoo Kim, 2025. "Leveraging LLMs to Improve Experimental Design: A Generative Stratification Approach," Papers 2509.25709, arXiv.org.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

George Gui & Seungwoo Kim, 2025. "Leveraging LLMs to Improve Experimental Design: A Generative Stratification Approach," Papers 2509.25709, arXiv.org.
Augusto Gonzalez-Bonorino & Monica Capra & Emilio Pantoja, 2025. "LLMs Model Non-WEIRD Populations: Experiments with Synthetic Cultural Agents," Papers 2501.06834, arXiv.org.
Yingnan Yan & Tianming Liu & Yafeng Yin, 2025. "Valuing Time in Silicon: Can Large Language Model Replicate Human Value of Travel Time," Papers 2507.22244, arXiv.org.
Yizhao Jiang, 2022. "The Influence of Payment Method: Do Consumers Pay More with Mobile Payment?," Papers 2210.14631, arXiv.org.
Andreas Leibbrandt, 2016. "Behavioral Constraints on Pricing: Experimental Evidence on Price Discrimination and Customer Antagonism," CESifo Working Paper Series 6214, CESifo.
Kevin Leyton-Brown & Paul Milgrom & Neil Newman & Ilya Segal, 2024. "Artificial Intelligence and Market Design: Lessons Learned from Radio Spectrum Reallocation," NBER Chapters, in: New Directions in Market Design, National Bureau of Economic Research, Inc.
Thomas Wagner, 1998. "Reciprocity And Efficiency," Rationality and Society, , vol. 10(3), pages 347-375, August.
C. Monica Capra & Thomas J. Kniesner, 2025. "Daniel Kahneman’s underappreciated last published paper: Empirical implications for benefit-cost analysis and a chat session discussion with bots," Journal of Risk and Uncertainty, Springer, vol. 71(1), pages 29-51, August.
- Capra, C. Monica & Kniesner, Thomas J., 2025. "Daniel Kahneman’s Underappreciated Last Published Paper: Empirical Implications for Benefit-Cost Analysis and a Chat Session Discussion with Bots," IZA Discussion Papers 17841, Institute of Labor Economics (IZA).
Ho, Cony Ming-Shen & Chin, Shih-Chun (Daniel) & Wang, TzuShuo Ryan, 2025. "Recurring versus one-time donation requests: The toll on attracting donors," Journal of Business Research, Elsevier, vol. 192(C).
Francisco Gomes & Michael Haliassos & Tarun Ramadorai, 2021. "Household Finance," Journal of Economic Literature, American Economic Association, vol. 59(3), pages 919-1000, September.
- Haliassos, Michael & Gomes, Francisco, 2020. "Household Finance," CEPR Discussion Papers 14502, C.E.P.R. Discussion Papers.
- Gomes, Francisco J. & Haliassos, Michael & Ramadorai, Tarun, 2020. "Household finance," IMFS Working Paper Series 138, Goethe University Frankfurt, Institute for Monetary and Financial Stability (IMFS).
Kirshner, Samuel N., 2024. "GPT and CLT: The impact of ChatGPT's level of abstraction on consumer recommendations," Journal of Retailing and Consumer Services, Elsevier, vol. 76(C).
Wen, Tong & Leung, Xi Y. & Li, Bin & Hu, Lingyan, 2021. "Examining framing effect in travel package purchase: An application of double-entry mental accounting theory," Annals of Tourism Research, Elsevier, vol. 90(C).
Shu Wang & Zijun Yao & Shuhuai Zhang & Jianuo Gai & Tracy Xiao Liu & Songfa Zhong, 2025. "When Experimental Economics Meets Large Language Models: Evidence-based Tactics," Papers 2505.21371, arXiv.org, revised Jul 2025.
Rabin, Matthew, 1993. "Incorporating Fairness into Game Theory and Economics," American Economic Review, American Economic Association, vol. 83(5), pages 1281-1302, December.
- Matthew Rabin., 1992. "Incorporating Fairness into Game Theory and Economics," Economics Working Papers 92-199, University of California at Berkeley.
- M. Rabin, 2001. "Incorporating Fairness into Game Theory and Economics," Levine's Working Paper Archive 511, David K. Levine.
Zengqing Wu & Run Peng & Xu Han & Shuyuan Zheng & Yixin Zhang & Chuan Xiao, 2023. "Smart Agent-Based Modeling: On the Use of Large Language Models in Computer Simulations," Papers 2311.06330, arXiv.org, revised Dec 2023.
repec:osf:osfxxx:udz28_v1 is not listed on IDEAS
Hui Chen & Antoine Didisheim & Luciano Somoza & Hanqing Tian, 2025. "A Financial Brain Scan of the LLM," Papers 2508.21285, arXiv.org.
Joshua C. Yang & Damian Dailisan & Marcin Korecki & Carina I. Hausladen & Dirk Helbing, 2024. "LLM Voting: Human Choices and AI Collective Decision Making," Papers 2402.01766, arXiv.org, revised Aug 2024.
Elif Akata & Lion Schulz & Julian Coda-Forno & Seong Joon Oh & Matthias Bethge & Eric Schulz, 2025. "Playing repeated games with large language models," Nature Human Behaviour, Nature, vol. 9(7), pages 1380-1390, July.
Nir Chemaya & Daniel Martin, 2024. "Perceptions and detection of AI use in manuscript preparation for academic journals," PLOS ONE, Public Library of Science, vol. 19(7), pages 1-16, July.
- Nir Chemaya & Daniel Martin, 2023. "Perceptions and Detection of AI Use in Manuscript Preparation for Academic Journals," Papers 2311.14720, arXiv.org, revised Jan 2024.
Lijia Ma & Xingchen Xu & Yong Tan, 2024. "Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines," Papers 2402.19421, arXiv.org.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-AIN-2025-06-16 (Artificial Intelligence)
NEP-CMP-2025-06-16 (Computational Economics)
NEP-EXP-2025-06-16 (Experimental Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2505.17479. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data