IDEAS home Printed from https://ideas.repec.org/a/inm/ormksc/v40y2021i5p844-870.html
   My bibliography  Save this article

Understanding Large-Scale Dynamic Purchase Behavior

Author

Listed:
  • Bruno Jacobs

    (Robert H. Smith School of Business, University of Maryland, College Park, Maryland 20742)

  • Dennis Fok

    (Erasmus School of Economics, Erasmus University Rotterdam, Rotterdam 3062 PA, Netherlands)

  • Bas Donkers

    (Erasmus School of Economics, Erasmus University Rotterdam, Rotterdam 3062 PA, Netherlands)

Abstract

In modern retail contexts, retailers sell products from vast product assortments to a large and heterogeneous customer base. Understanding purchase behavior in such a context is very important. Standard models cannot be used because of the high dimensionality of the data. We propose a new model that creates an efficient dimension reduction through the idea of purchase motivations. We only require customer-level purchase history data, which is ubiquitous in modern retailing. The model handles large-scale data and even works in settings with shopping trips consisting of few purchases. Essential features of our model are that it accounts for the product, customer, and time dimensions present in purchase history data; relates the relevance of motivations to customer- and shopping-trip characteristics; captures interdependencies between motivations; and achieves superior predictive performance. Estimation results from this comprehensive model provide deep insights into purchase behavior. Such insights can be used by managers to create more intuitive, better informed, and more effective marketing actions. As scalability of the model is essential for practical applicability, we develop a fast, custom-made inference algorithm based on variational inference. We illustrate the model using purchase history data from a Fortune 500 retailer involving more than 4,000 unique products.

Suggested Citation

  • Bruno Jacobs & Dennis Fok & Bas Donkers, 2021. "Understanding Large-Scale Dynamic Purchase Behavior," Marketing Science, INFORMS, vol. 40(5), pages 844-870, September.
  • Handle: RePEc:inm:ormksc:v:40:y:2021:i:5:p:844-870
    DOI: 10.1287/mksc.2020.1279
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mksc.2020.1279
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mksc.2020.1279?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Peter M. Guadagni & John D. C. Little, 1983. "A Logit Model of Brand Choice Calibrated on Scanner Data," Marketing Science, INFORMS, vol. 2(3), pages 203-238.
    2. Peter E. Rossi & Robert E. McCulloch & Greg M. Allenby, 1996. "The Value of Purchase History Data in Target Marketing," Marketing Science, INFORMS, vol. 15(4), pages 321-340.
    3. David M. Blei & Alp Kucukelbir & Jon D. McAuliffe, 2017. "Variational Inference: A Review for Statisticians," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 859-877, April.
    4. Carpenter, Bob & Gelman, Andrew & Hoffman, Matthew D. & Lee, Daniel & Goodrich, Ben & Betancourt, Michael & Brubaker, Marcus & Guo, Jiqiang & Li, Peter & Riddell, Allen, 2017. "Stan: A Probabilistic Programming Language," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 76(i01).
    5. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555.
    6. Bruno J.D. Jacobs & Bas Donkers & Dennis Fok, 2016. "Model-Based Purchase Predictions for Large Assortments," Marketing Science, INFORMS, vol. 35(3), pages 389-404, May.
    7. Feihong Xia & Rabikar Chatterjee & Jerrold H. May, 2019. "Using Conditional Restricted Boltzmann Machines to Model Complex Consumer Shopping Patterns," Marketing Science, INFORMS, vol. 38(4), pages 711-727, July.
    8. Michael Trusov & Liye Ma & Zainab Jamal, 2016. "Crumbs of the Cookie: User Profiling in Customer-Base Analysis and Behavioral Targeting," Marketing Science, INFORMS, vol. 35(3), pages 405-426, May.
    9. Asim Ansari & Yang Li & Jonathan Z. Zhang, 2018. "Probabilistic Topic Model for Hybrid Recommender Systems: A Stochastic Variational Bayesian Approach," Marketing Science, INFORMS, vol. 37(6), pages 987-1008, November.
    10. Braun, Michael & McAuliffe, Jon, 2010. "Variational Inference for Large-Scale Models of Discrete Choice," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 324-335.
    11. Max J. Pachali & Peter Kurz & Thomas Otter, 0. "How to generalize from a hierarchical model?," Quantitative Marketing and Economics (QME), Springer, vol. 0, pages 1-38.
    12. Jia Liu & Olivier Toubia, 2018. "A Semantic Approach for Estimating Consumer Content Preferences from Online Search Queries," Marketing Science, INFORMS, vol. 37(6), pages 930-952, November.
    13. Peter E. Rossi & Greg M. Allenby, 2003. "Bayesian Statistics and Marketing," Marketing Science, INFORMS, vol. 22(3), pages 304-328, July.
    14. Daria Dzyabura & John R. Hauser, 2011. "Active Machine Learning for Consideration Heuristics," Marketing Science, INFORMS, vol. 30(5), pages 801-819, September.
    15. Joachim Büschken & Greg M. Allenby, 2016. "Sentence-Based Text Analysis for Customer Reviews," Marketing Science, INFORMS, vol. 35(6), pages 953-975, November.
    16. Ormerod, J. T. & Wand, M. P., 2010. "Explaining Variational Approximations," The American Statistician, American Statistical Association, vol. 64(2), pages 140-153.
    17. Ryan Dew & Asim Ansari, 2018. "Bayesian Nonparametric Customer Base Analysis with Model-Based Visualizations," Marketing Science, INFORMS, vol. 37(2), pages 216-235, March.
    18. Max J. Pachali & Peter Kurz & Thomas Otter, 2020. "How to generalize from a hierarchical model?," Quantitative Marketing and Economics (QME), Springer, vol. 18(4), pages 343-380, December.
    19. Joachim Büschken & Greg M. Allenby, 2020. "Improving Text Analysis Using Sentence Conjunctions and Punctuation," Marketing Science, INFORMS, vol. 39(4), pages 727-742, July.
    20. Dinesh Puranam & Vishal Narayan & Vrinda Kadiyali, 2017. "The Effect of Calorie Posting Regulation on Consumer Opinion: A Flexible Latent Dirichlet Allocation Model with Informative Priors," Marketing Science, INFORMS, vol. 36(5), pages 726-746, September.
    21. Puneet Manchanda & Asim Ansari & Sunil Gupta, 1999. "The “Shopping Basket”: A Model for Multicategory Purchase Incidence Decisions," Marketing Science, INFORMS, vol. 18(2), pages 95-114.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ma, Liye & Sun, Baohong, 2020. "Machine learning and AI in marketing – Connecting computing power to human insights," International Journal of Research in Marketing, Elsevier, vol. 37(3), pages 481-504.
    2. Gael M. Martin & David T. Frazier & Ruben Loaiza-Maya & Florian Huber & Gary Koop & John Maheu & Didier Nibbering & Anastasios Panagiotelis, 2023. "Bayesian Forecasting in the 21st Century: A Modern Review," Monash Econometrics and Business Statistics Working Papers 1/23, Monash University, Department of Econometrics and Business Statistics.
    3. Gael M. Martin & David T. Frazier & Worapree Maneesoonthorn & Ruben Loaiza-Maya & Florian Huber & Gary Koop & John Maheu & Didier Nibbering & Anastasios Panagiotelis, 2022. "Bayesian Forecasting in Economics and Finance: A Modern Review," Papers 2212.03471, arXiv.org, revised Jul 2023.
    4. Loaiza-Maya, Rubén & Smith, Michael Stanley & Nott, David J. & Danaher, Peter J., 2022. "Fast and accurate variational inference for models with many latent variables," Journal of Econometrics, Elsevier, vol. 230(2), pages 339-362.
    5. Gael M. Martin & David T. Frazier & Christian P. Robert, 2021. "Approximating Bayes in the 21st Century," Monash Econometrics and Business Statistics Working Papers 24/21, Monash University, Department of Econometrics and Business Statistics.
    6. Bansal, Prateek & Krueger, Rico & Bierlaire, Michel & Daziano, Ricardo A. & Rashidi, Taha H., 2020. "Bayesian estimation of mixed multinomial logit models: Advances and simulation-based evaluations," Transportation Research Part B: Methodological, Elsevier, vol. 131(C), pages 124-142.
    7. Wang, Xin (Shane) & Ryoo, Jun Hyun (Joseph) & Bendle, Neil & Kopalle, Praveen K., 2021. "The role of machine learning analytics and metrics in retailing research," Journal of Retailing, Elsevier, vol. 97(4), pages 658-675.
    8. Asim Ansari & Yang Li & Jonathan Z. Zhang, 2018. "Probabilistic Topic Model for Hybrid Recommender Systems: A Stochastic Variational Bayesian Approach," Marketing Science, INFORMS, vol. 37(6), pages 987-1008, November.
    9. Gael M. Martin & David T. Frazier & Christian P. Robert, 2020. "Computing Bayes: Bayesian Computation from 1763 to the 21st Century," Monash Econometrics and Business Statistics Working Papers 14/20, Monash University, Department of Econometrics and Business Statistics.
    10. Marc R. Dotson & Joachim Büschken & Greg M. Allenby, 2020. "Explaining Preference Heterogeneity with Mixed Membership Modeling," Marketing Science, INFORMS, vol. 39(2), pages 407-426, March.
    11. Robert Donnelly & Francisco J.R. Ruiz & David Blei & Susan Athey, 2021. "Counterfactual inference for consumer choice across many product categories," Quantitative Marketing and Economics (QME), Springer, vol. 19(3), pages 369-407, December.
    12. Venkatesh Shankar & Sohil Parsana, 2022. "An overview and empirical comparison of natural language processing (NLP) models and an introduction to and empirical application of autoencoder models in marketing," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1324-1350, November.
    13. Schröder, Nadine & Falke, Andreas & Hruschka, Harald & Reutterer, Thomas, 2019. "Analyzing the Browsing Basket: A Latent Interests-Based Segmentation Tool," Journal of Interactive Marketing, Elsevier, vol. 47(C), pages 181-197.
    14. Sebastian Gabel & Artem Timoshenko, 2022. "Product Choice with Large Assortments: A Scalable Deep-Learning Model," Management Science, INFORMS, vol. 68(3), pages 1808-1827, March.
    15. Rico Krueger & Prateek Bansal & Michel Bierlaire & Ricardo A. Daziano & Taha H. Rashidi, 2019. "Variational Bayesian Inference for Mixed Logit Models with Unobserved Inter- and Intra-Individual Heterogeneity," Papers 1905.00419, arXiv.org, revised Jan 2020.
    16. Prateek Bansal & Rico Krueger & Michel Bierlaire & Ricardo A. Daziano & Taha H. Rashidi, 2019. "Bayesian Estimation of Mixed Multinomial Logit Models: Advances and Simulation-Based Evaluations," Papers 1904.03647, arXiv.org, revised Dec 2019.
    17. Bansal, Prateek & Krueger, Rico & Graham, Daniel J., 2021. "Fast Bayesian estimation of spatial count data models," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    18. Hyowon Kim & Greg M. Allenby, 2022. "Integrating Textual Information into Models of Choice and Scaled Response Data," Marketing Science, INFORMS, vol. 41(4), pages 815-830, July.
    19. Alex Burnap & John R. Hauser & Artem Timoshenko, 2019. "Product Aesthetic Design: A Machine Learning Augmentation," Papers 1907.07786, arXiv.org, revised Nov 2022.
    20. Daziano, Ricardo A., 2022. "Willingness to delay charging of electric vehicles," Research in Transportation Economics, Elsevier, vol. 94(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormksc:v:40:y:2021:i:5:p:844-870. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.