IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2507.17564.html
   My bibliography  Save this paper

Decoding Consumer Preferences Using Attention-Based Language Models

Author

Listed:
  • Joshua Foster
  • Fredrik Odegaard

Abstract

This paper proposes a new demand estimation method using attention-based language models. An encoder-only language model is trained in a two-stage process to analyze the natural language descriptions of used cars from a large US-based online auction marketplace. The approach enables semi-nonparametrically estimation for the demand primitives of a structural model representing the private valuations and market size for each vehicle listing. In the first stage, the language model is fine-tuned to encode the target auction outcomes using the natural language vehicle descriptions. In the second stage, the trained language model's encodings are projected into the parameter space of the structural model. The model's capability to conduct counterfactual analyses within the trained market space is validated using a subsample of withheld auction data, which includes a set of unique "zero shot" instances.

Suggested Citation

  • Joshua Foster & Fredrik Odegaard, 2025. "Decoding Consumer Preferences Using Attention-Based Language Models," Papers 2507.17564, arXiv.org.
  • Handle: RePEc:arx:papers:2507.17564
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2507.17564
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jens Ludwig & Sendhil Mullainathan, 2024. "Machine Learning as a Tool for Hypothesis Generation," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 139(2), pages 751-827.
    2. Suprateek Sarker & Hillol Bala & Yili Hong & Atreyi Kankanhalli & Matti Rossi & Bin Gu & Gal Oestreicher-Singer, 2025. "Advancing Next-Generation Multimethod Research in Information Systems: A Framework and Some Recommendations for Authors and Evaluators," Information Systems Research, INFORMS, vol. 36(2), pages 647-668, June.
    3. Sameer Mehta & Milind Dawande & Ganesh Janakiraman & Vijay Mookerjee, 2021. "How to Sell a Data Set? Pricing Policies for Data Monetization," Information Systems Research, INFORMS, vol. 32(4), pages 1281-1297, December.
    4. Artem Timoshenko & John R. Hauser, 2019. "Identifying Customer Needs from User-Generated Content," Marketing Science, INFORMS, vol. 38(1), pages 1-20, January.
    5. Kai Yang & Raymond Y. K. Lau & Ahmed Abbasi, 2023. "Getting Personal: A Deep Learning Artifact for Text-Based Measurement of Personality," Information Systems Research, INFORMS, vol. 34(1), pages 194-222, March.
    6. Martin Bichler & Alok Gupta & Wolfgang Ketter, 2010. "Research Commentary ---Designing Smart Markets," Information Systems Research, INFORMS, vol. 21(4), pages 688-699, December.
    7. Fenton, Victor M. & Gallant, A. Ronald, 1996. "Qualitative and asymptotic performance of SNP density estimators," Journal of Econometrics, Elsevier, vol. 74(1), pages 77-118, September.
    8. Elliott Ash & Stephen Hansen, 2023. "Text Algorithms in Economics," Annual Review of Economics, Annual Reviews, vol. 15(1), pages 659-688, September.
    9. Robert F. Easley & Rafael Tenorio, 2004. "Jump Bidding Strategies in Internet Auctions," Management Science, INFORMS, vol. 50(10), pages 1407-1419, October.
    10. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    11. Yanzhen Chen & Huaxia Rui & Andrew B. Whinston, 2025. "Conversation Analytics: Can Machines Read Between the Lines in Real-Time Strategic Conversations?," Information Systems Research, INFORMS, vol. 36(1), pages 440-455, March.
    12. Fenton, Victor M & Gallant, A Ronald, 1996. "Erratum [Convergence Rates of SNP Density Estimators]," Econometrica, Econometric Society, vol. 64(6), pages 1493-1493, November.
    13. Ali Goli & Amandeep Singh, 2024. "Frontiers: Can Large Language Models Capture Human Preferences?," Marketing Science, INFORMS, vol. 43(4), pages 709-722, July.
    14. Hammaad Adam & Pu He & Fanyin Zheng, 2024. "Machine Learning for Demand Estimation in Long Tail Markets," Management Science, INFORMS, vol. 70(8), pages 5040-5065, August.
    15. Keyon Vafa & Susan Athey & David M. Blei, 2025. "Estimating wage disparities using foundation models," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 122(22), pages 2427298122-, June.
    16. Susan Athey & Guido W. Imbens, 2019. "Machine Learning Methods That Economists Should Know About," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 685-725, August.
    17. Athey, Susan & Imbens, Guido W., 2019. "Machine Learning Methods Economists Should Know About," Research Papers 3776, Stanford University, Graduate School of Business.
    18. Fedor Iskhakov & John Rust & Bertel Schjerning, 2020. "Machine learning and structural econometrics: contrasts and synergies," The Econometrics Journal, Royal Economic Society, vol. 23(3), pages 81-124.
    19. Ahmed Abbasi & Jeffrey Parsons & Gautam Pant & Olivia R. Liu Sheng & Suprateek Sarker, 2024. "Pathways for Design Research on Artificial Intelligence," Information Systems Research, INFORMS, vol. 35(2), pages 441-459, June.
    20. Gallant, A Ronald & Nychka, Douglas W, 1987. "Semi-nonparametric Maximum Likelihood Estimation," Econometrica, Econometric Society, vol. 55(2), pages 363-390, March.
    21. van Giffen, Benjamin & Herhausen, Dennis & Fahse, Tobias, 2022. "Overcoming the pitfalls and perils of algorithms: A classification of machine learning biases and mitigation methods," Journal of Business Research, Elsevier, vol. 144(C), pages 93-106.
    22. Wolfstetter, Elmar, 1996. "Auctions: An Introduction," Journal of Economic Surveys, Wiley Blackwell, vol. 10(4), pages 367-420, December.
    23. Pedro Aceves & James A. Evans, 2024. "Mobilizing Conceptual Spaces: How Word Embedding Models Can Inform Measurement and Theory Within Organization Science," Organization Science, INFORMS, vol. 35(3), pages 788-814, May.
    24. Zenan Chen & Jason Chan, 2024. "Large Language Model in Creative Work: The Role of Collaboration Modality and User Expertise," Management Science, INFORMS, vol. 70(12), pages 9101-9117, December.
    25. Dinesh Puranam & Vrinda Kadiyali & Vishal Narayan, 2021. "The Impact of Increase in Minimum Wages on Consumer Perceptions of Service: A Transformer Model of Online Restaurant Reviews," Marketing Science, INFORMS, vol. 40(5), pages 985-1004, September.
    26. Yixin Lu & Alok Gupta & Wolfgang Ketter & Eric van Heck, 2019. "Information Transparency in Business-to-Business Auction Markets: The Role of Winner Identity Disclosure," Management Science, INFORMS, vol. 65(9), pages 4261-4279, September.
    27. De Liu & Adib Bagh, 2020. "Preserving Bidder Privacy in Assignment Auctions: Design and Measurement," Management Science, INFORMS, vol. 66(7), pages 3162-3182, July.
    28. Shawn McCarthy & Gita Alaghband, 2024. "Fin-ALICE: Artificial Linguistic Intelligence Causal Econometrics," JRFM, MDPI, vol. 17(12), pages 1-21, November.
    29. Christopher Avery, 1998. "Strategic Jump Bidding in English Auctions," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 65(2), pages 185-210.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sophie-Charlotte Klose & Johannes Lederer, 2020. "A Pipeline for Variable Selection and False Discovery Rate Control With an Application in Labor Economics," Papers 2006.12296, arXiv.org, revised Jun 2020.
    2. Byron Botha & Rulof Burger & Kevin Kotzé & Neil Rankin & Daan Steenkamp, 2023. "Big data forecasting of South African inflation," Empirical Economics, Springer, vol. 65(1), pages 149-188, July.
    3. Dang, Hai-Anh & Carleto, Gero & Gourlay, Sydney & Abanokova, Kseniya, 2023. "Addressing Soil Quality Data Gaps with Imputation: Evidence from Ethiopia and Uganda," 2023 Annual Meeting, July 23-25, Washington D.C. 335648, Agricultural and Applied Economics Association.
    4. Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2022. "Urban economics in a historical perspective: Recovering data with machine learning," Regional Science and Urban Economics, Elsevier, vol. 94(C).
    5. Arenas, Andreu & Calsamiglia, Caterina, 2022. "Gender Differences in High-Stakes Performance and College Admission Policies," IZA Discussion Papers 15550, Institute of Labor Economics (IZA).
    6. Tsang, Andrew, 2021. "Uncovering Heterogeneous Regional Impacts of Chinese Monetary Policy," MPRA Paper 110703, University Library of Munich, Germany.
    7. Rama K. Malladi, 2024. "Benchmark Analysis of Machine Learning Methods to Forecast the U.S. Annual Inflation Rate During a High-Decile Inflation Period," Computational Economics, Springer;Society for Computational Economics, vol. 64(1), pages 335-375, July.
    8. Tranos, Emmanouil & Incera, Andre Carrascal & Willis, George, 2022. "Using the web to predict regional trade flows: data extraction, modelling, and validation," OSF Preprints 9bu5z, Center for Open Science.
    9. Hai‐Anh H. Dang & Talip Kilic & Kseniya Abanokova & Calogero Carletto, 2025. "Poverty Imputation in Contexts Without Consumption Data: A Revisit With Further Refinements," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 71(1), February.
    10. Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.
    11. Blankenship, Brian & Aklin, Michaël & Urpelainen, Johannes & Nandan, Vagisha, 2022. "Jobs for a just transition: Evidence on coal job preferences from India," Energy Policy, Elsevier, vol. 165(C).
    12. Andrei Dubovik & Adam Elbourne & Bram Hendriks & Mark Kattenberg, 2022. "Forecasting World Trade Using Big Data and Machine Learning Techniques," CPB Discussion Paper 441, CPB Netherlands Bureau for Economic Policy Analysis.
    13. Donna B. Gilleskie, 2021. "In sickness and in health, until death do us part: A case for theory," Southern Economic Journal, John Wiley & Sons, vol. 87(3), pages 753-768, January.
    14. Askitas, Nikos, 2024. "A Hands-on Machine Learning Primer for Social Scientists: Math, Algorithms and Code," IZA Discussion Papers 17014, Institute of Labor Economics (IZA).
    15. Arthur Charpentier & Romuald Élie & Carl Remlinger, 2023. "Reinforcement Learning in Economics and Finance," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 425-462, June.
    16. Delogu, Marco & Lagravinese, Raffaele & Paolini, Dimitri & Resce, Giuliano, 2024. "Predicting dropout from higher education: Evidence from Italy," Economic Modelling, Elsevier, vol. 130(C).
    17. Jinjuan Yang & Jiayuan Xin & Yan Zeng & Pei Jose Liu, 2025. "Signaling and perceiving on equity crowdfunding decisions — a machine learning approach," Small Business Economics, Springer, vol. 65(1), pages 315-356, June.
    18. Liesenfeld, Roman & Breitung, Jörg, 1998. "Simulation based methods of moments in empirical finance," Tübinger Diskussionsbeiträge 136, University of Tübingen, School of Business and Economics.
    19. Ibrahima Sarr & Hai-Anh H. Dang & Carlos Santiago Guzman Gutierrez & Theresa Beltramo & Paolo Verme, 2025. "Using Cross-Survey Imputation to Estimate Poverty for Venezuelan Refugees in Colombia," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 177(1), pages 207-251, March.
    20. Mona Aghdaee & Bonny Parkinson & Kompal Sinha & Yuanyuan Gu & Rajan Sharma & Emma Olin & Henry Cutler, 2022. "An examination of machine learning to map non‐preference based patient reported outcome measures to health state utility values," Health Economics, John Wiley & Sons, Ltd., vol. 31(8), pages 1525-1557, August.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2507.17564. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.