IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v281y2020i3p612-627.html
   My bibliography  Save this article

Predicting product return volume using machine learning methods

Author

Listed:
  • Cui, Hailong
  • Rajagopalan, Sampath
  • Ward, Amy R.

Abstract

In 2015, U.S. consumers returned goods worth $261 billion and the return rates for online sales sometimes exceeded 30%. Manufacturers and retailers have an interest in predicting return volume to address operational challenges in managing product returns. In this paper, we develop and test data-driven models for predicting return volume at the retailer, product type and period levels using a rich data set comprised of detailed operations on each product, and retailer information. The goal is to achieve a good prediction accuracy out of sample. We consider main effects and detailed interaction effects models using various machine learning methods. We find that Least Absolute Shrinkage and Selection Operator (LASSO) yields a predictive model achieving the best prediction accuracy for future return volume due to its ability to select informative interaction terms out of more than one thousand possible combinations. The LASSO model also turns in consistent performance based on several robustness tests and is easy to implement in practice. Our work provides a general predictive model framework for manufacturers to track product returns.

Suggested Citation

  • Cui, Hailong & Rajagopalan, Sampath & Ward, Amy R., 2020. "Predicting product return volume using machine learning methods," European Journal of Operational Research, Elsevier, vol. 281(3), pages 612-627.
  • Handle: RePEc:eee:ejores:v:281:y:2020:i:3:p:612-627
    DOI: 10.1016/j.ejor.2019.05.046
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S037722171930517X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2019.05.046?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ilya O. Ryzhov & Bin Han & Jelena Bradić, 2016. "Cultivating Disaster Donors Using Data Analytics," Management Science, INFORMS, vol. 62(3), pages 849-866, March.
    2. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    3. Gregory A. DeCroix & Jing-Sheng Song & Paul H. Zipkin, 2009. "Managing an Assemble-to-Order System with Returns," Manufacturing & Service Operations Management, INFORMS, vol. 11(1), pages 144-159, October.
    4. Janakiraman, Narayan & Syrdal, Holly A. & Freling, Ryan, 2016. "The Effect of Return Policy Leniency on Consumer Purchase and Return Decisions: A Meta-analytic Review," Journal of Retailing, Elsevier, vol. 92(2), pages 226-235.
    5. Eric T. Anderson & Karsten Hansen & Duncan Simester, 2009. "The Option Value of Returns: Theory and Empirical Evidence," Marketing Science, INFORMS, vol. 28(3), pages 405-423, 05-06.
    6. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    7. Mehmet Sekip Altug & Tolga Aydinliyim, 2016. "Counteracting Strategic Purchase Deferrals: The Impact of Online Retailers’ Return Policy Decisions," Manufacturing & Service Operations Management, INFORMS, vol. 18(3), pages 376-392, July.
    8. Ma, Shaohui & Fildes, Robert & Huang, Tao, 2016. "Demand forecasting with high dimensional data: The case of SKU retail sales forecasting with intra- and inter-category promotional information," European Journal of Operational Research, Elsevier, vol. 249(1), pages 245-257.
    9. Tsiliyannis, Christos Aristeides, 2018. "Markov chain modeling and forecasting of product returns in remanufacturing based on stock mean-age," European Journal of Operational Research, Elsevier, vol. 271(2), pages 474-489.
    10. Guangzhi Shang & Bikram P. Ghosh & Michael R. Galbreth, 2017. "Optimal Retail Return Policies with Wardrobing," Production and Operations Management, Production and Operations Management Society, vol. 26(7), pages 1315-1332, July.
    11. Xuanming Su, 2009. "Consumer Returns Policies and Supply Chain Performance," Manufacturing & Service Operations Management, INFORMS, vol. 11(4), pages 595-612, March.
    12. Çerağ Pinçe & Mark Ferguson & Beril Toktay, 2016. "Extracting Maximum Value from Consumer Returns: Allocating Between Remarketing and Refurbishing for Warranty Claims," Manufacturing & Service Operations Management, INFORMS, vol. 18(4), pages 475-492, October.
    13. Jianqing Fan & Jinchi Lv & Lei Qi, 2011. "Sparse High-Dimensional Models in Economics," Annual Review of Economics, Annual Reviews, vol. 3(1), pages 291-317, September.
    14. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    15. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    16. Sevim, Cuneyt & Oztekin, Asil & Bali, Ozkan & Gumus, Serkan & Guresen, Erkam, 2014. "Developing an early warning system to predict currency crises," European Journal of Operational Research, Elsevier, vol. 237(3), pages 1095-1104.
    17. Oztekin, Asil & Al-Ebbini, Lina & Sevkli, Zulal & Delen, Dursun, 2018. "A decision analytic approach to predicting quality of life for lung transplant recipients: A hybrid genetic algorithms-based methodology," European Journal of Operational Research, Elsevier, vol. 266(2), pages 639-651.
    18. Mortenson, Michael J. & Doherty, Neil F. & Robinson, Stewart, 2015. "Operational research from Taylorism to Terabytes: A research agenda for the analytics age," European Journal of Operational Research, Elsevier, vol. 241(3), pages 583-595.
    19. Jeffrey D. Shulman & Anne T. Coughlan & R. Canan Savaskan, 2011. "Managing Consumer Returns in a Competitive Environment," Management Science, INFORMS, vol. 57(2), pages 347-362, February.
    20. Dimitris Bertsimas & Allison O’Hair & Stephen Relyea & John Silberholz, 2016. "An Analytics Approach to Designing Combination Chemotherapy Regimens for Cancer," Management Science, INFORMS, vol. 62(5), pages 1511-1531, May.
    21. Andre P. Calmon & Stephen C. Graves, 2017. "Inventory Management in a Consumer Electronics Closed-Loop Supply Chain," Manufacturing & Service Operations Management, INFORMS, vol. 19(4), pages 568-585, October.
    22. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bottmer, Lea & Croux, Christophe & Wilms, Ines, 2022. "Sparse regression for large data sets with outliers," European Journal of Operational Research, Elsevier, vol. 297(2), pages 782-794.
    2. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    3. He Jiang, 2023. "Robust forecasting in spatial autoregressive model with total variation regularization," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(2), pages 195-211, March.
    4. Bonaccolto, Giovanni & Caporin, Massimiliano & Maillet, Bertrand B., 2022. "Dynamic large financial networks via conditional expected shortfalls," European Journal of Operational Research, Elsevier, vol. 298(1), pages 322-336.
    5. Lin, Yizhong & Leung, Janny M.Y. & Zhang, Lianmin & Gu, Jia-Wen, 2020. "Single-item repairable inventory system with stochastic new and warranty demands," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 142(C).
    6. Fan, Huirong & Khouja, Moutaz & Zhou, Jing, 2022. "Design of win-win return policies for online retailers," European Journal of Operational Research, Elsevier, vol. 301(2), pages 675-693.
    7. El Kihal, Siham & Nurullayev, Namig & Schulze, Christian & Skiera, Bernd, 2021. "A Comparison of Return Rate Calculation Methods: Evidence from 16 Retailers," Journal of Retailing, Elsevier, vol. 97(4), pages 676-696.
    8. Ilkka Ritola & Harold Krikke & Marjolein C.J. Caniëls, 2020. "Learning from Returned Products in a Closed Loop Supply Chain: A Systematic Literature Review," Logistics, MDPI, vol. 4(2), pages 1-13, April.
    9. Wenting Pan & Candice H. Huynh, 2023. "Optimal operational strategies for online retailers with demand and return uncertainty," Operations Management Research, Springer, vol. 16(2), pages 755-767, June.
    10. Collins, Alan & Fan, Jingwen & Mahabir, Aruneema, 2022. "Actual versus ‘natural’ rates of suicide: Evidence from the USA," Economic Modelling, Elsevier, vol. 106(C).
    11. Serravalle, Francesca & Vannucci, Virginia & Pantano, Eleonora, 2022. "“Take it or leave it?†: Evidence on cultural differences affecting return behaviour for Gen Z," Journal of Retailing and Consumer Services, Elsevier, vol. 66(C).
    12. Duong, Quang Huy & Zhou, Li & Meng, Meng & Nguyen, Truong Van & Ieromonachou, Petros & Nguyen, Duy Tiep, 2022. "Understanding product returns: A systematic literature review using machine learning and bibliometric analysis," International Journal of Production Economics, Elsevier, vol. 243(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ratchford, Brian & Soysal, Gonca & Zentner, Alejandro & Gauri, Dinesh K., 2022. "Online and offline retailing: What we know and directions for future research," Journal of Retailing, Elsevier, vol. 98(1), pages 152-177.
    2. Elena Ivona DUMITRESCU & Sullivan HUE & Christophe HURLIN & Sessi TOKPAVI, 2020. "Machine Learning or Econometrics for Credit Scoring: Let’s Get the Best of Both Worlds," LEO Working Papers / DR LEO 2839, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    3. Necati Ertekin & Michael E. Ketzenberg & Gregory R. Heim, 2020. "Assessing Impacts of Store and Salesperson Dimensions of Retail Service Quality on Consumer Returns," Production and Operations Management, Production and Operations Management Society, vol. 29(5), pages 1232-1255, May.
    4. Hailong Cui & Sampath Rajagopalan & Amy R. Ward, 2021. "Impact of Task-Level Worker Specialization, Workload, and Product Personalization on Consumer Returns," Manufacturing & Service Operations Management, INFORMS, vol. 23(2), pages 346-366, March.
    5. Difrancesco, Rita Maria & Huchzermeier, Arnd & Schröder, David, 2018. "Optimizing the return window for online fashion retailers with closed-loop refurbishment," Omega, Elsevier, vol. 78(C), pages 205-221.
    6. Chen, Hao-Wei, 2023. "Improving supply quality through the store-initiated returns in wholesale supply chains," International Journal of Production Economics, Elsevier, vol. 261(C).
    7. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    8. Zemin Zheng & Jie Zhang & Yang Li, 2022. "L 0 -Regularized Learning for High-Dimensional Additive Hazards Regression," INFORMS Journal on Computing, INFORMS, vol. 34(5), pages 2762-2775, September.
    9. Khouja, Moutaz & Hammami, Ramzi, 2023. "Optimizing price, order quantity, and return policy in the presence of consumer opportunistic behavior for online retailers," European Journal of Operational Research, Elsevier, vol. 309(2), pages 683-703.
    10. Zichen Zhang & Ye Eun Bae & Jonathan R. Bradley & Lang Wu & Chong Wu, 2022. "SUMMIT: An integrative approach for better transcriptomic data imputation improves causal gene identification," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    11. Peter Bühlmann & Jacopo Mandozzi, 2014. "High-dimensional variable screening and bias in subsequent inference, with an empirical comparison," Computational Statistics, Springer, vol. 29(3), pages 407-430, June.
    12. Capanu, Marinela & Giurcanu, Mihai & Begg, Colin B. & Gönen, Mithat, 2023. "Subsampling based variable selection for generalized linear models," Computational Statistics & Data Analysis, Elsevier, vol. 184(C).
    13. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    14. Zeyu Bian & Erica E. M. Moodie & Susan M. Shortreed & Sahir Bhatnagar, 2023. "Variable selection in regression‐based estimation of dynamic treatment regimes," Biometrics, The International Biometric Society, vol. 79(2), pages 988-999, June.
    15. Zanhua Yin, 2020. "Variable selection for sparse logistic regression," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 83(7), pages 821-836, October.
    16. Blazquez, Desamparados & Domenech, Josep, 2018. "Big Data sources and methods for social and economic analyses," Technological Forecasting and Social Change, Elsevier, vol. 130(C), pages 99-113.
    17. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    18. Vineet Kaushik & Ashwani Kumar & Himanshu Gupta & Gaurav Dixit, 2022. "Modelling and prioritizing the factors for online apparel return using BWM approach," Electronic Commerce Research, Springer, vol. 22(3), pages 843-873, September.
    19. Pei Wang & Shunjie Chen & Sijia Yang, 2022. "Recent Advances on Penalized Regression Models for Biological Data," Mathematics, MDPI, vol. 10(19), pages 1-24, October.
    20. Zakariya Yahya Algamal & Muhammad Hisyam Lee, 2019. "A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(3), pages 753-771, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:281:y:2020:i:3:p:612-627. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.