IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2507.17564.html

Decoding Consumer Preferences Using Attention-Based Language Models

Author

Listed:
  • Joshua Foster
  • Fredrik Odegaard

Abstract

This paper proposes a new demand estimation method using attention-based language models. An encoder-only language model is trained in a two-stage process to analyze the natural language descriptions of used cars from a large US-based online auction marketplace. The approach enables semi-nonparametrically estimation for the demand primitives of a structural model representing the private valuations and market size for each vehicle listing. In the first stage, the language model is fine-tuned to encode the target auction outcomes using the natural language vehicle descriptions. In the second stage, the trained language model's encodings are projected into the parameter space of the structural model. The model's capability to conduct counterfactual analyses within the trained market space is validated using a subsample of withheld auction data, which includes a set of unique "zero shot" instances.

Suggested Citation

  • Joshua Foster & Fredrik Odegaard, 2025. "Decoding Consumer Preferences Using Attention-Based Language Models," Papers 2507.17564, arXiv.org.
  • Handle: RePEc:arx:papers:2507.17564
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2507.17564
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Susan Athey & Guido W. Imbens, 2019. "Machine Learning Methods That Economists Should Know About," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 685-725, August.
    2. Athey, Susan & Imbens, Guido W., 2019. "Machine Learning Methods Economists Should Know About," Research Papers 3776, Stanford University, Graduate School of Business.
    3. Jens Ludwig & Sendhil Mullainathan, 2024. "Machine Learning as a Tool for Hypothesis Generation," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 139(2), pages 751-827.
    4. Suprateek Sarker & Hillol Bala & Yili Hong & Atreyi Kankanhalli & Matti Rossi & Bin Gu & Gal Oestreicher-Singer, 2025. "Advancing Next-Generation Multimethod Research in Information Systems: A Framework and Some Recommendations for Authors and Evaluators," Information Systems Research, INFORMS, vol. 36(2), pages 647-668, June.
    5. Sameer Mehta & Milind Dawande & Ganesh Janakiraman & Vijay Mookerjee, 2021. "How to Sell a Data Set? Pricing Policies for Data Monetization," Information Systems Research, INFORMS, vol. 32(4), pages 1281-1297, December.
    6. Artem Timoshenko & John R. Hauser, 2019. "Identifying Customer Needs from User-Generated Content," Marketing Science, INFORMS, vol. 38(1), pages 1-20, January.
    7. Kai Yang & Raymond Y. K. Lau & Ahmed Abbasi, 2023. "Getting Personal: A Deep Learning Artifact for Text-Based Measurement of Personality," Information Systems Research, INFORMS, vol. 34(1), pages 194-222, March.
    8. Fedor Iskhakov & John Rust & Bertel Schjerning, 2020. "Machine learning and structural econometrics: contrasts and synergies," The Econometrics Journal, Royal Economic Society, vol. 23(3), pages 81-124.
    9. Ahmed Abbasi & Jeffrey Parsons & Gautam Pant & Olivia R. Liu Sheng & Suprateek Sarker, 2024. "Pathways for Design Research on Artificial Intelligence," Information Systems Research, INFORMS, vol. 35(2), pages 441-459, June.
    10. Gallant, A Ronald & Nychka, Douglas W, 1987. "Semi-nonparametric Maximum Likelihood Estimation," Econometrica, Econometric Society, vol. 55(2), pages 363-390, March.
    11. Martin Bichler & Alok Gupta & Wolfgang Ketter, 2010. "Research Commentary ---Designing Smart Markets," Information Systems Research, INFORMS, vol. 21(4), pages 688-699, December.
    12. van Giffen, Benjamin & Herhausen, Dennis & Fahse, Tobias, 2022. "Overcoming the pitfalls and perils of algorithms: A classification of machine learning biases and mitigation methods," Journal of Business Research, Elsevier, vol. 144(C), pages 93-106.
    13. Fenton, Victor M. & Gallant, A. Ronald, 1996. "Qualitative and asymptotic performance of SNP density estimators," Journal of Econometrics, Elsevier, vol. 74(1), pages 77-118, September.
    14. Robert F. Easley & Rafael Tenorio, 2004. "Jump Bidding Strategies in Internet Auctions," Management Science, INFORMS, vol. 50(10), pages 1407-1419, October.
    15. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    16. Wolfstetter, Elmar, 1996. "Auctions: An Introduction," Journal of Economic Surveys, Wiley Blackwell, vol. 10(4), pages 367-420, December.
    17. Yanzhen Chen & Huaxia Rui & Andrew B. Whinston, 2025. "Conversation Analytics: Can Machines Read Between the Lines in Real-Time Strategic Conversations?," Information Systems Research, INFORMS, vol. 36(1), pages 440-455, March.
    18. Pedro Aceves & James A. Evans, 2024. "Mobilizing Conceptual Spaces: How Word Embedding Models Can Inform Measurement and Theory Within Organization Science," Organization Science, INFORMS, vol. 35(3), pages 788-814, May.
    19. Zenan Chen & Jason Chan, 2024. "Large Language Model in Creative Work: The Role of Collaboration Modality and User Expertise," Management Science, INFORMS, vol. 70(12), pages 9101-9117, December.
    20. Dinesh Puranam & Vrinda Kadiyali & Vishal Narayan, 2021. "The Impact of Increase in Minimum Wages on Consumer Perceptions of Service: A Transformer Model of Online Restaurant Reviews," Marketing Science, INFORMS, vol. 40(5), pages 985-1004, September.
    21. Yixin Lu & Alok Gupta & Wolfgang Ketter & Eric van Heck, 2019. "Information Transparency in Business-to-Business Auction Markets: The Role of Winner Identity Disclosure," Management Science, INFORMS, vol. 65(9), pages 4261-4279, September.
    22. Fenton, Victor M & Gallant, A Ronald, 1996. "Erratum [Convergence Rates of SNP Density Estimators]," Econometrica, Econometric Society, vol. 64(6), pages 1493-1493, November.
    23. Ali Goli & Amandeep Singh, 2024. "Frontiers: Can Large Language Models Capture Human Preferences?," Marketing Science, INFORMS, vol. 43(4), pages 709-722, July.
    24. Keyon Vafa & Susan Athey & David M. Blei, 2025. "Estimating wage disparities using foundation models," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 122(22), pages 2427298122-, June.
    25. Hammaad Adam & Pu He & Fanyin Zheng, 2024. "Machine Learning for Demand Estimation in Long Tail Markets," Management Science, INFORMS, vol. 70(8), pages 5040-5065, August.
    26. De Liu & Adib Bagh, 2020. "Preserving Bidder Privacy in Assignment Auctions: Design and Measurement," Management Science, INFORMS, vol. 66(7), pages 3162-3182, July.
    27. Shawn McCarthy & Gita Alaghband, 2024. "Fin-ALICE: Artificial Linguistic Intelligence Causal Econometrics," JRFM, MDPI, vol. 17(12), pages 1-21, November.
    28. Elliott Ash & Stephen Hansen, 2023. "Text Algorithms in Economics," Annual Review of Economics, Annual Reviews, vol. 15(1), pages 659-688, September.
    29. Christopher Avery, 1998. "Strategic Jump Bidding in English Auctions," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 65(2), pages 185-210.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sophie-Charlotte Klose & Johannes Lederer, 2020. "A Pipeline for Variable Selection and False Discovery Rate Control With an Application in Labor Economics," Papers 2006.12296, arXiv.org, revised Jun 2020.
    2. Hurmeranta, Risto & Lyytikäinen, Teemu, 2025. "Nominal Loss Aversion in the Housing Market and Household Mobility," Working Papers 178, VATT Institute for Economic Research.
    3. Dang, Hai-Anh & Carleto, Gero & Gourlay, Sydney & Abanokova, Kseniya, 2023. "Addressing Soil Quality Data Gaps with Imputation: Evidence from Ethiopia and Uganda," 2023 Annual Meeting, July 23-25, Washington D.C. 335648, Agricultural and Applied Economics Association.
    4. Arenas, Andreu & Calsamiglia, Caterina, 2022. "Gender Differences in High-Stakes Performance and College Admission Policies," IZA Discussion Papers 15550, Institute of Labor Economics (IZA).
    5. Rama K. Malladi, 2024. "Benchmark Analysis of Machine Learning Methods to Forecast the U.S. Annual Inflation Rate During a High-Decile Inflation Period," Computational Economics, Springer;Society for Computational Economics, vol. 64(1), pages 335-375, July.
    6. Tranos, Emmanouil & Incera, Andre Carrascal & Willis, George, 2022. "Using the web to predict regional trade flows: data extraction, modelling, and validation," OSF Preprints 9bu5z, Center for Open Science.
    7. Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.
    8. Blankenship, Brian & Aklin, Michaël & Urpelainen, Johannes & Nandan, Vagisha, 2022. "Jobs for a just transition: Evidence on coal job preferences from India," Energy Policy, Elsevier, vol. 165(C).
    9. Andrei Dubovik & Adam Elbourne & Bram Hendriks & Mark Kattenberg, 2022. "Forecasting World Trade Using Big Data and Machine Learning Techniques," CPB Discussion Paper 441, CPB Netherlands Bureau for Economic Policy Analysis.
    10. Donna B. Gilleskie, 2021. "In sickness and in health, until death do us part: A case for theory," Southern Economic Journal, John Wiley & Sons, vol. 87(3), pages 753-768, January.
    11. Askitas, Nikos, 2024. "A Hands-on Machine Learning Primer for Social Scientists: Math, Algorithms and Code," IZA Discussion Papers 17014, Institute of Labor Economics (IZA).
    12. Arthur Charpentier & Romuald Élie & Carl Remlinger, 2023. "Reinforcement Learning in Economics and Finance," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 425-462, June.
    13. Jinjuan Yang & Jiayuan Xin & Yan Zeng & Pei Jose Liu, 2025. "Signaling and perceiving on equity crowdfunding decisions — a machine learning approach," Small Business Economics, Springer, vol. 65(1), pages 315-356, June.
    14. Liesenfeld, Roman & Breitung, Jörg, 1998. "Simulation based methods of moments in empirical finance," Tübinger Diskussionsbeiträge 136, University of Tübingen, School of Business and Economics.
    15. Ibrahima Sarr & Hai-Anh H. Dang & Carlos Santiago Guzman Gutierrez & Theresa Beltramo & Paolo Verme, 2025. "Using Cross-Survey Imputation to Estimate Poverty for Venezuelan Refugees in Colombia," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 177(1), pages 207-251, March.
    16. Mona Aghdaee & Bonny Parkinson & Kompal Sinha & Yuanyuan Gu & Rajan Sharma & Emma Olin & Henry Cutler, 2022. "An examination of machine learning to map non‐preference based patient reported outcome measures to health state utility values," Health Economics, John Wiley & Sons, Ltd., vol. 31(8), pages 1525-1557, August.
    17. Lily Davies & Mark Kattenberg & Benedikt Vogt, 2023. "Predicting Firm Exits with Machine Learning: Implications for Selection into COVID-19 Support and Productivity Growth," CPB Discussion Paper 444, CPB Netherlands Bureau for Economic Policy Analysis.
    18. Domonkos F. Vamossy, 2024. "Social Media Emotions and Market Behavior," Papers 2404.03792, arXiv.org.
    19. Filmer,Deon P. & Nahata,Vatsal & Sabarwal,Shwetlena, 2021. "Preparation, Practice, and Beliefs : A Machine Learning Approach to Understanding Teacher Effectiveness," Policy Research Working Paper Series 9847, The World Bank.
    20. Bas Bosma & Arjen Witteloostuijn, 2024. "Machine learning in international business," Journal of International Business Studies, Palgrave Macmillan;Academy of International Business, vol. 55(6), pages 676-702, August.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2507.17564. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.