From Model Choice to Model Belief: Establishing a New Measure for LLM-Based Research

From Model Choice to Model Belief: Establishing a New Measure for LLM-Based Research

Author

Listed:

Hongshen Sun
Juanjuan Zhang

Abstract

Large language models (LLMs) are increasingly used to simulate human behavior, but common practices to use LLM-generated data are inefficient. Treating an LLM's output ("model choice") as a single data point underutilizes the information inherent to the probabilistic nature of LLMs. This paper introduces and formalizes "model belief," a measure derived from an LLM's token-level probabilities that captures the model's belief distribution over choice alternatives in a single generation run. The authors prove that model belief is asymptotically equivalent to the mean of model choices (a non-trivial property) but forms a more statistically efficient estimator, with lower variance and a faster convergence rate. Analogous properties are shown to hold for smooth functions of model belief and model choice often used in downstream applications. The authors demonstrate the performance of model belief through a demand estimation study, where an LLM simulates consumer responses to different prices. In practical settings with limited numbers of runs, model belief explains and predicts ground-truth model choice better than model choice itself, and reduces the computation needed to reach sufficiently accurate estimates by roughly a factor of 20. The findings support using model belief as the default measure to extract more information from LLM-generated data.

Suggested Citation

Hongshen Sun & Juanjuan Zhang, 2025. "From Model Choice to Model Belief: Establishing a New Measure for LLM-Based Research," Papers 2512.23184, arXiv.org.

Handle: RePEc:arx:papers:2512.23184

Download full text from publisher

References listed on IDEAS

Peter M. Guadagni & John D. C. Little, 1983. "A Logit Model of Brand Choice Calibrated on Scanner Data," Marketing Science, INFORMS, vol. 2(3), pages 203-238.
John R. Hauser & Steven M. Shugan, 1980. "Intensity Measures of Consumer Preference," Operations Research, INFORMS, vol. 28(2), pages 278-320, April.
Jens Ludwig & Sendhil Mullainathan & Ashesh Rambachan, 2024. "Large Language Models: An Applied Econometric Framework," Papers 2412.07031, arXiv.org, revised Dec 2025.
- Jens Ludwig & Sendhil Mullainathan & Ashesh Rambachan, 2025. "Large Language Models: An Applied Econometric Framework," NBER Working Papers 33344, National Bureau of Economic Research, Inc.
Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, November.
- Kenneth Train, 2003. "Discrete Choice Methods with Simulation," Online economics textbooks, SUNY-Oswego, Department of Economics, number emetr2, December.
- Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521747387, Enero-Abr.
Artem Timoshenko & Chengfeng Mao & John R. Hauser, 2025. "Transforming the Voice of the Customer: Large Language Models for Identifying Customer Needs," Papers 2503.01870, arXiv.org, revised Apr 2026.
Song Lin & Juanjuan Zhang & John R. Hauser, 2015. "Learning from Experience, Simply," Marketing Science, INFORMS, vol. 34(1), pages 1-19, January.
Daniel McFadden, 1986. "The Choice Theory Approach to Market Research," Marketing Science, INFORMS, vol. 5(4), pages 275-297.
Nitin Mehta & Surendra Rajiv & Kannan Srinivasan, 2003. "Price Uncertainty and Consumer Search: A Structural Model of Consideration Set Formation," Marketing Science, INFORMS, vol. 22(1), pages 58-84, June.
Zikun Ye & Hema Yoganarasimhan & Yufeng Zheng, 2025. "LOLA: LLM-Assisted Online Learning Algorithm for Content Experiments," Marketing Science, INFORMS, vol. 44(5), pages 995-1016, September.
Yinheng Li & Shaofei Wang & Han Ding & Hang Chen, 2023. "Large Language Models in Finance: A Survey," Papers 2311.10723, arXiv.org, revised Jul 2024.
Ali Goli & Amandeep Singh, 2024. "Frontiers: Can Large Language Models Capture Human Preferences?," Marketing Science, INFORMS, vol. 43(4), pages 709-722, July.
Charness, Gary & Gneezy, Uri & Kuhn, Michael A., 2012. "Experimental methods: Between-subject and within-subject design," Journal of Economic Behavior & Organization, Elsevier, vol. 81(1), pages 1-8.
Monica Wadhwa & Kuangjie Zhang, 2015. "This Number Just Feels Right: The Impact of Roundedness of Price Numbers on Product Evaluations," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 41(5), pages 1172-1185.
Argyle, Lisa P. & Busby, Ethan C. & Fulda, Nancy & Gubler, Joshua R. & Rytting, Christopher & Wingate, David, 2023. "Out of One, Many: Using Language Models to Simulate Human Samples," Political Analysis, Cambridge University Press, vol. 31(3), pages 337-351, July.
Ilia Shumailov & Zakhar Shumaylov & Yiren Zhao & Nicolas Papernot & Ross Anderson & Yarin Gal, 2024. "AI models collapse when trained on recursively generated data," Nature, Nature, vol. 631(8022), pages 755-759, July.
Hauser, John R & Wernerfelt, Birger, 1990. "An Evaluation Cost Model of Consideration Sets," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 16(4), pages 393-408, March.
Pradeep K. Chintagunta & Harikesh S. Nair, 2011. "Structural Workshop Paper --Discrete-Choice Models of Consumer Demand in Marketing," Marketing Science, INFORMS, vol. 30(6), pages 977-996, November.
John J. Horton & Apostolos Filippas & Benjamin S. Manning, 2023. "Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?," NBER Working Papers 31122, National Bureau of Economic Research, Inc.
John J. Horton & Apostolos Filippas & Benjamin S. Manning, 2023. "Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?," Papers 2301.07543, arXiv.org, revised Feb 2026.
Peiyao Li & Noah Castelo & Zsolt Katona & Miklos Sarvary, 2024. "Frontiers: Determining the Validity of Large Language Models for Automated Perceptual Analysis," Marketing Science, INFORMS, vol. 43(2), pages 254-266, March.
George Gui & Olivier Toubia, 2023. "The Challenge of Using LLMs to Simulate Human Behavior: A Causal Inference Perspective," Papers 2312.15524, arXiv.org, revised Nov 2025.
Abhijit V. Banerjee, 1992. "A Simple Model of Herd Behavior," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 107(3), pages 797-817.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

George Gui & Seungwoo Kim, 2025. "Leveraging LLMs to Improve Experimental Design: A Generative Stratification Approach," Papers 2509.25709, arXiv.org.
Koji Takahashi & Joon Suk Park, 2025. "Generative AI for Surveys on Payment Apps: AIs' View on Privacy and Technology," IMES Discussion Paper Series 25-E-13, Institute for Monetary and Economic Studies, Bank of Japan.
Hui Chen & Antoine Didisheim & Mohammad & Pourmohammadi & Luciano Somoza & Hanqing Tian, 2025. "A Financial Brain Scan of the LLM," Papers 2508.21285, arXiv.org, revised Feb 2026.
Yuan Gao & Dokyun Lee & Gordon Burtch & Sina Fazelpour, 2024. "Take Caution in Using LLMs as Human Surrogates: Scylla Ex Machina," Papers 2410.19599, arXiv.org, revised Jan 2025.
Matthew O. Jackson & Qiaozhu Me & Stephanie W. Wang & Yutong Xie & Walter Yuan & Seth Benzell & Erik Brynjolfsson & Colin F. Camerer & James Evans & Brian Jabarian & Jon Kleinberg & Juanjuan Meng & Se, 2025. "AI Behavioral Science," Papers 2509.13323, arXiv.org.
Elisabeth Honka & Pradeep Chintagunta, 2017. "Simultaneous or Sequential? Search Strategies in the U.S. Auto Insurance Industry," Marketing Science, INFORMS, vol. 36(1), pages 21-42, January.
Anne Lundgaard Hansen & Seung Jung Lee, 2025. "Financial Stability Implications of Generative AI: Taming the Animal Spirits," Papers 2510.01451, arXiv.org.
Ali Aouad & Danny Segev, 2021. "Display Optimization for Vertically Differentiated Locations Under Multinomial Logit Preferences," Management Science, INFORMS, vol. 67(6), pages 3519-3550, June.
Seung Jung Lee & Anne Lundgaard Hansen, 2025. "Financial Stability Implications of Generative AI: Taming the Animal Spirits," Finance and Economics Discussion Series 2025-090, Board of Governors of the Federal Reserve System (U.S.).
Wayne Gao & Sukjin Han & Annie Liang, 2026. "How Well Do LLMs Predict Human Behavior? A Measure of their Pretrained Knowledge," Papers 2601.12343, arXiv.org.
Ferraz, Vinícius & Olah, Tamas & Sazedul, Ratin & Schmidt, Robert & Schwieren, Christiane, 2025. "When Artificial Minds Negotiate: Dark Personality and the Ultimatum Game in Large Language Models," Working Papers 0768, University of Heidelberg, Department of Economics.
Paola Cillo & Gaia Rubera, 2025. "Generative AI in innovation and marketing processes: A roadmap of research opportunities," Journal of the Academy of Marketing Science, Springer, vol. 53(3), pages 684-701, May.
Hortense Fong & George Gui, 2024. "Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs," Papers 2412.15239, arXiv.org, revised Jul 2025.
Joseph Pancras, 2010. "A Framework to Determine the Value of Consumer Consideration Set Information for Firm Pricing Strategies," Computational Economics, Springer;Society for Computational Economics, vol. 35(3), pages 269-300, March.
Nikoleta Anesti & Edward Hill & Andreas Joseph, 2025. "Inflation Attitudes of Large Language Models," Papers 2512.14306, arXiv.org.
Andrés Elberg & Pedro M. Gardete & Rosario Macera & Carlos Noton, 2019. "Dynamic effects of price promotions: field evidence, consumer search, and supply-side implications," Quantitative Marketing and Economics (QME), Springer, vol. 17(1), pages 1-58, March.
Herhausen, Dennis & Ludwig, Stephan & Abedin, Ehsan & Haque, Nasim Ul & de Jong, David, 2025. "From words to insights: Text analysis in business research," Journal of Business Research, Elsevier, vol. 198(C).
Shu Wang & Zijun Yao & Shuhuai Zhang & Jianuo Gai & Tracy Xiao Liu & Songfa Zhong, 2025. "When Experimental Economics Meets Large Language Models: Evidence-based Tactics," Papers 2505.21371, arXiv.org, revised Jul 2025.
Peter Stüttgen & Peter Boatwright & Robert T. Monroe, 2012. "A Satisficing Choice Model," Marketing Science, INFORMS, vol. 31(6), pages 878-899, November.
Andrew T. Ching & Tülin Erdem & Michael P. Keane, 2020. "How much do consumers know about the quality of products? Evidence from the diaper market," The Japanese Economic Review, Springer, vol. 71(4), pages 541-569, October.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-AIN-2026-01-12 (Artificial Intelligence)
NEP-CMP-2026-01-12 (Computational Economics)
NEP-DCM-2026-01-12 (Discrete Choice Models)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2512.23184. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

From Model Choice to Model Belief: Establishing a New Measure for LLM-Based Research

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data