IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2503.20711.html
   My bibliography  Save this paper

Demand Estimation with Text and Image Data

Author

Listed:
  • Giovanni Compiani
  • Ilya Morozov
  • Stephan Seiler

Abstract

We propose a demand estimation method that leverages unstructured text and image data to infer substitution patterns. Using pre-trained deep learning models, we extract embeddings from product images and textual descriptions and incorporate them into a random coefficients logit model. This approach enables researchers to estimate demand even when they lack data on product attributes or when consumers value hard-to-quantify attributes, such as visual design or functional benefits. Using data from a choice experiment, we show that our approach outperforms standard attribute-based models in counterfactual predictions of consumers' second choices. We also apply it across 40 product categories on Amazon and consistently find that text and image data help identify close substitutes within each category.

Suggested Citation

  • Giovanni Compiani & Ilya Morozov & Stephan Seiler, 2025. "Demand Estimation with Text and Image Data," Papers 2503.20711, arXiv.org, revised Mar 2025.
  • Handle: RePEc:arx:papers:2503.20711
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2503.20711
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Amil Petrin, 2002. "Quantifying the Benefits of New Products: The Case of the Minivan," Journal of Political Economy, University of Chicago Press, vol. 110(4), pages 705-729, August.
    2. Giovanni Compiani, 2022. "Market counterfactuals and the specification of multiproduct demand: A nonparametric approach," Quantitative Economics, Econometric Society, vol. 13(2), pages 545-591, May.
    3. Koppelman, Frank S. & Wen, Chieh-Hua, 2000. "The paired combinatorial logit model: properties, estimation and application," Transportation Research Part B: Methodological, Elsevier, vol. 34(2), pages 75-89, February.
    4. Vivek Bhattacharya & Gastón Illanes & David Stillerman, 2023. "Merger Effects and Antitrust Enforcement: Evidence from US Consumer Packaged Goods," NBER Working Papers 31123, National Bureau of Economic Research, Inc.
    5. Hunt Allcott & Benjamin B Lockwood & Dmitry Taubinsky, 2019. "Regressive Sin Taxes, with an Application to the Optimal Soda Tax," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 134(3), pages 1557-1626.
    6. Jerry A. Hausman, 1996. "Valuation of New Goods under Perfect and Imperfect Competition," NBER Chapters, in: The Economics of New Goods, pages 207-248, National Bureau of Economic Research, Inc.
    7. Kelvin J. Lancaster, 1966. "A New Approach to Consumer Theory," Journal of Political Economy, University of Chicago Press, vol. 74(2), pages 132-132.
    8. Pinkse, Joris & Slade, Margaret E., 2004. "Mergers, brand competition, and the price of a pint," European Economic Review, Elsevier, vol. 48(3), pages 617-643, June.
    9. Christopher Conlon & Julie Holland Mortimer, 2021. "Empirical properties of diversion ratios," RAND Journal of Economics, RAND Corporation, vol. 52(4), pages 693-726, December.
    10. Hendrik Döpper & Alexander MacKay & Nathan H. Miller & Joel Stiebale, 2024. "Rising Markups and the Role of Consumer Preferences," NBER Working Papers 32739, National Bureau of Economic Research, Inc.
    11. Günter J. Hitsch & Ali Hortaçsu & Xiliang Lin, 2021. "Prices and promotions in U.S. retail markets," Quantitative Marketing and Economics (QME), Springer, vol. 19(3), pages 289-368, December.
    12. Daniel McFadden & Kenneth Train, 2000. "Mixed MNL models for discrete response," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 15(5), pages 447-470.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mogens Fosgerau & Julien Monardo & André de Palma, 2024. "The Inverse Product Differentiation Logit Model," American Economic Journal: Microeconomics, American Economic Association, vol. 16(4), pages 329-370, November.
    2. Steven T. Berry & Philip A. Haile, 2024. "Nonparametric Identification of Differentiated Products Demand Using Micro Data," Econometrica, Econometric Society, vol. 92(4), pages 1135-1162, July.
    3. Wang, Ao, 2023. "Sieve BLP: A semi-nonparametric model of demand for differentiated products," Journal of Econometrics, Elsevier, vol. 235(2), pages 325-351.
    4. Bokhari, Farasat A.S. & Mariuzzo, Franco, 2018. "Demand estimation and merger simulations for drugs: Logits v. AIDS," International Journal of Industrial Organization, Elsevier, vol. 61(C), pages 653-685.
    5. Martin O'Connell & Pierre Dubois & Rachel Griffith, 2022. "The Use of Scanner Data for Economics Research," Annual Review of Economics, Annual Reviews, vol. 14(1), pages 723-745, August.
    6. Aviv Nevo, 2000. "A Practitioner's Guide to Estimation of Random‐Coefficients Logit Models of Demand," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 9(4), pages 513-548, December.
    7. Greene, David & Hossain, Anushah & Hofmann, Julia & Helfand, Gloria & Beach, Robert, 2018. "Consumer willingness to pay for vehicle attributes: What do we Know?," Transportation Research Part A: Policy and Practice, Elsevier, vol. 118(C), pages 258-279.
    8. Pietro Tebaldi & Alexander Torgovitsky & Hanbin Yang, 2023. "Nonparametric Estimates of Demand in the California Health Insurance Exchange," Econometrica, Econometric Society, vol. 91(1), pages 107-146, January.
    9. Genakos, Christos D., 2004. "Differential merger effects: the case of the personal computer industry," LSE Research Online Documents on Economics 6726, London School of Economics and Political Science, LSE Library.
    10. Bonnet, Céline & Richards, Timothy J., 2016. "Models of Consumer Demand for Differentiated Products," TSE Working Papers 16-741, Toulouse School of Economics (TSE).
    11. Julia González & M. Victoria Lacaze, 2012. "Preferences, Market Structure, and Welfare Evaluations in the Argentinean FFP Industry: A Case in Buenos Aires Province," Agribusiness, John Wiley & Sons, Ltd., vol. 28(3), pages 341-360, June.
    12. Torshizi Mohammad & Fulton Murray E. & Gray Richard S., 2018. "Non-Linear Demand in a Linear Town," Journal of Agricultural & Food Industrial Organization, De Gruyter, vol. 16(2), pages 1-12, November.
    13. Czajkowski, Mikołaj & Zagórska, Katarzyna & Letki, Natalia & Tryjanowski, Piotr & Wąs, Adam, 2021. "Drivers of farmers’ willingness to adopt extensive farming practices in a globally important bird area," Land Use Policy, Elsevier, vol. 107(C).
    14. Pereira, Pedro & Ribeiro, Tiago, 2011. "The impact on broadband access to the Internet of the dual ownership of telephone and cable networks," International Journal of Industrial Organization, Elsevier, vol. 29(2), pages 283-293, March.
    15. Choi, Andy S., 2013. "Nonmarket values of major resources in the Korean DMZ areas: A test of distance decay," Ecological Economics, Elsevier, vol. 88(C), pages 97-107.
    16. Doherty, Edel & Campbell, Danny, 2011. "Demand for improved food safety and quality: a cross-regional comparison," 85th Annual Conference, April 18-20, 2011, Warwick University, Coventry, UK 108791, Agricultural Economics Society.
    17. Redding, Stephen J. & Weinstein, David E., 2016. "A unified approach to estimating demand and welfare," LSE Research Online Documents on Economics 67681, London School of Economics and Political Science, LSE Library.
    18. Paleti, Rajesh, 2018. "Generalized multinomial probit Model: Accommodating constrained random parameters," Transportation Research Part B: Methodological, Elsevier, vol. 118(C), pages 248-262.
    19. Veneziani, Mario & Sckokai, Paolo & Moro, Daniele, 2012. "Consumers’ willingness to pay for a functional food," 2012 First Congress, June 4-5, 2012, Trento, Italy 124101, Italian Association of Agricultural and Applied Economics (AIEAA).
    20. Khan, Mohammed Tajuddin & Kishore, Avinash & Joshi, Pramod K., 2016. "Gender dimensions on farmers’ preferences for direct-seeded rice with drum seeder in India," IFPRI discussion papers 1550, International Food Policy Research Institute (IFPRI).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2503.20711. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.