IDEAS home Printed from https://ideas.repec.org/a/taf/jnlbes/v32y2014i1p14-29.html
   My bibliography  Save this article

From Amazon to Apple: Modeling Online Retail Sales, Purchase Incidence, and Visit Behavior

Author

Listed:
  • Anastasios Panagiotelis
  • Michael S. Smith
  • Peter J. Danaher

Abstract

In this study, we propose a multivariate stochastic model for Web site visit duration, page views, purchase incidence, and the sale amount for online retailers. The model is constructed by composition from carefully selected distributions and involves copula components. It allows for the strong nonlinear relationships between the sales and visit variables to be explored in detail, and can be used to construct sales predictions. The model is readily estimated using maximum likelihood, making it an attractive choice in practice given the large sample sizes that are commonplace in online retail studies. We examine a number of top-ranked U.S. online retailers, and find that the visit duration and the number of pages viewed are both related to sales, but in very different ways for different products. Using Bayesian methodology, we show how the model can be extended to a finite mixture model to account for consumer heterogeneity via latent household segmentation. The model can also be adjusted to accommodate a more accurate analysis of online retailers like apple.com that sell products at a very limited number of price points. In a validation study across a range of different Web sites, we find that the purchase incidence and sales amount are both forecast more accurately using our model, when compared to regression, probit regression, a popular data-mining method, and a survival model employed previously in an online retail study. Supplementary materials for this article are available online.

Suggested Citation

  • Anastasios Panagiotelis & Michael S. Smith & Peter J. Danaher, 2014. "From Amazon to Apple: Modeling Online Retail Sales, Purchase Incidence, and Visit Behavior," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 32(1), pages 14-29, January.
  • Handle: RePEc:taf:jnlbes:v:32:y:2014:i:1:p:14-29
    DOI: 10.1080/07350015.2013.835729
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/07350015.2013.835729
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/07350015.2013.835729?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Richard Blundell & John Ham & Costas Meghir, 1989. "Unemployment and Female Labour Supply," Palgrave Macmillan Books, in: Joan Muysken & Chris Neubourg (ed.), Unemployment in Europe, chapter 1, pages 9-36, Palgrave Macmillan.
    2. Li Mingliang & Tobias Justin L, 2006. "Bayesian Analysis of Structural Effects in an Ordered Equation System," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 10(4), pages 1-24, December.
    3. McCulloch, Robert E. & Polson, Nicholas G. & Rossi, Peter E., 2000. "A Bayesian analysis of the multinomial probit model with fully identified parameters," Journal of Econometrics, Elsevier, vol. 99(1), pages 173-193, November.
    4. Murray D. Smith, 2003. "Modelling sample selection using Archimedean copulas," Econometrics Journal, Royal Economic Society, vol. 6(1), pages 99-123, June.
    5. Zhang, Xibin & King, Maxwell L. & Hyndman, Rob J., 2006. "A Bayesian approach to bandwidth selection for multivariate kernel density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 50(11), pages 3009-3031, July.
    6. Francis Vella, 1998. "Estimating Models with Sample Selection Bias: A Survey," Journal of Human Resources, University of Wisconsin Press, vol. 33(1), pages 127-169.
    7. Zhang, Xibin & Brooks, Robert D. & King, Maxwell L., 2009. "A Bayesian approach to bandwidth selection for multivariate kernel regression with an application to state-price density estimation," Journal of Econometrics, Elsevier, vol. 153(1), pages 21-32, November.
    8. Sangjoon Kim & Neil Shephard & Siddhartha Chib, 1998. "Stochastic Volatility: Likelihood Inference and Comparison with ARCH Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 65(3), pages 361-393.
    9. Cameron,A. Colin & Trivedi,Pravin K., 2005. "Microeconometrics," Cambridge Books, Cambridge University Press, number 9780521848053, September.
    10. van Hasselt, Martijn, 2011. "Bayesian inference in a sample selection model," Journal of Econometrics, Elsevier, vol. 165(2), pages 221-232.
    11. Madhu Mohanty, 2001. "Testing for the specification of the wage equation: double selection approach or single selection approach," Applied Economics Letters, Taylor & Francis Journals, vol. 8(8), pages 525-529.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Junming Liu & Weiwei Chen & Jingyuan Yang & Hui Xiong & Can Chen, 2022. "Iterative Prediction-and-Optimization for E-Logistics Distribution Network Design," INFORMS Journal on Computing, INFORMS, vol. 34(2), pages 769-789, March.
    2. Panagiotelis, Anastasios & Czado, Claudia & Joe, Harry & Stöber, Jakob, 2017. "Model selection for discrete regular vine copulas," Computational Statistics & Data Analysis, Elsevier, vol. 106(C), pages 138-152.
    3. Jian Huang & Qinyu Chen & Chengqing Yu, 2022. "A New Feature Based Deep Attention Sales Forecasting Model for Enterprise Sustainable Development," Sustainability, MDPI, vol. 14(19), pages 1-18, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rong Zhang & Brett A. Inder & Xibin Zhang, 2012. "Parameter estimation for a discrete-response model with double rules of sample selection: A Bayesian approach," Monash Econometrics and Business Statistics Working Papers 5/12, Monash University, Department of Econometrics and Business Statistics.
    2. Zhang, Rong & Inder, Brett A. & Zhang, Xibin, 2015. "Bayesian estimation of a discrete response model with double rules of sample selection," Computational Statistics & Data Analysis, Elsevier, vol. 86(C), pages 81-96.
    3. Rong Zhang & Brett A. Inder & Xibin Zhang, 2013. "Bayesian estimation of a discrete response model with double rules of sample selection," Monash Econometrics and Business Statistics Working Papers 24/13, Monash University, Department of Econometrics and Business Statistics.
    4. Guohua Feng & Chuan Wang & Xibin Zhang, 2019. "Estimation of inefficiency in stochastic frontier models: a Bayesian kernel approach," Journal of Productivity Analysis, Springer, vol. 51(1), pages 1-19, February.
    5. Hasebe, Takuya & Vijverberg, Wim P., 2012. "A Flexible Sample Selection Model: A GTL-Copula Approach," IZA Discussion Papers 7003, Institute of Labor Economics (IZA).
    6. Xibin Zhang & Maxwell L. King & Han Lin Shang, 2011. "Bayesian estimation of bandwidths for a nonparametric regression model with a flexible error density," Monash Econometrics and Business Statistics Working Papers 10/11, Monash University, Department of Econometrics and Business Statistics.
    7. Wiemann, Paul F.V. & Klein, Nadja & Kneib, Thomas, 2022. "Correcting for sample selection bias in Bayesian distributional regression models," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    8. Xibin Zhang & Maxwell L. King & Han Lin Shang, 2016. "Bayesian Bandwidth Selection for a Nonparametric Regression Model with Mixed Types of Regressors," Econometrics, MDPI, vol. 4(2), pages 1-27, April.
    9. Marra, Giampiero & Radice, Rosalba, 2013. "Estimation of a regression spline sample selection model," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 158-173.
    10. Didier Nibbering, 2019. "A High-dimensional Multinomial Choice Model," Monash Econometrics and Business Statistics Working Papers 19/19, Monash University, Department of Econometrics and Business Statistics.
    11. Manuel Arellano & Stéphane Bonhomme, 2017. "Quantile Selection Models With an Application to Understanding Changes in Wage Inequality," Econometrica, Econometric Society, vol. 85, pages 1-28, January.
    12. Hu, Shuowen & Poskitt, D.S. & Zhang, Xibin, 2012. "Bayesian adaptive bandwidth kernel density estimation of irregular multivariate distributions," Computational Statistics & Data Analysis, Elsevier, vol. 56(3), pages 732-740.
    13. William Griffiths & Xiaohui Zhang & Xueyan Zhao, 2010. "A Stochastic Frontier Model for Discrete Ordinal Outcomes: A Health Production Function," Department of Economics - Working Papers Series 1092, The University of Melbourne.
    14. Terence C. Cheng & Pravin K. Trivedi, 2015. "Attrition Bias in Panel Data: A Sheep in Wolf's Clothing? A Case Study Based on the Mabel Survey," Health Economics, John Wiley & Sons, Ltd., vol. 24(9), pages 1101-1117, September.
    15. Geweke, J. & Joel Horowitz & Pesaran, M.H., 2006. "Econometrics: A Bird’s Eye View," Cambridge Working Papers in Economics 0655, Faculty of Economics, University of Cambridge.
    16. Song Li & Mervyn J. Silvapulle & Param Silvapulle & Xibin Zhang, 2015. "Bayesian Approaches to Nonparametric Estimation of Densities on the Unit Interval," Econometric Reviews, Taylor & Francis Journals, vol. 34(3), pages 394-412, March.
    17. Gael M. Martin & David T. Frazier & Ruben Loaiza-Maya & Florian Huber & Gary Koop & John Maheu & Didier Nibbering & Anastasios Panagiotelis, 2023. "Bayesian Forecasting in the 21st Century: A Modern Review," Monash Econometrics and Business Statistics Working Papers 1/23, Monash University, Department of Econometrics and Business Statistics.
    18. Murray D. Smith, 2005. "Using Copulas to Model Switching Regimes with an Application to Child Labour," The Economic Record, The Economic Society of Australia, vol. 81(s1), pages 47-57, August.
    19. Zhang, Xibin & King, Maxwell L. & Shang, Han Lin, 2014. "A sampling algorithm for bandwidth estimation in a nonparametric regression model with a flexible error density," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 218-234.
    20. Zhang, Xibin & King, Maxwell L., 2008. "Box-Cox stochastic volatility models with heavy-tails and correlated errors," Journal of Empirical Finance, Elsevier, vol. 15(3), pages 549-566, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlbes:v:32:y:2014:i:1:p:14-29. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UBES20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.