IDEAS home Printed from https://ideas.repec.org/a/bfy/oajsas/v6y2025i1p35-67id2679.html
   My bibliography  Save this article

Forecasting Retail Sales using Machine Learning Models

Author

Listed:
  • Oluwasola Oluwaseun Mustapha
  • Dr. Terry Sithole

Abstract

Purpose: This paper’s main objective is to examine common machine learning techniques and also time series analysis for sales forecasting in a bid to get the best fitted technique and give more logical hypotheses for raising future profit margins while obtaining historical in-depth understanding of prior demand utilising business intelligence software’s like Tableau or Microsoft Power BI. The outcomes are laid forth with regards to dependability as well as precision of the various forecasting models that were employed. Materials and Methods: In this project, a sales prediction is carried out on a 5 year store-item sales data for 50 different items in 10 different stores with a dataset obtained from Kaggle. This study focuses on using Machine Learning Methods including the Random Forest, Gradient Boosting Regression (XGBoost), Linear Regression and also the standard time series Autoregressive Integrated Moving Average (ARIMA) method were analysed and contrasted to measure the methods’ effectiveness for prediction of Sales. Findings: This study demonstrates the potential of machine learning algorithms in accurately forecasting sales, which can be extremely valuable for businesses in optimizing their operations, inventory management, and financial planning. By leveraging these predictive models, companies can make data-driven decisions to improve efficiency, reduce costs, and increase profitability. The findings also highlight the importance of selecting the most appropriate algorithm for a given dataset and problem, as well as the need for proper model tuning and validation to ensure reliable results. Furthermore, the study underscores the significance of understanding and interpreting error metrics like RMSE and MAE to effectively evaluate and compare model performance. Unique Contribution to Theory, Practice and Policy: Factors such as Seasonality, Trend, Promotional offers and Randomity have been known to be important factors that affect the outcome of Sales Forecasting which is why the performances of the Mean Absolute Error (MAE), the Absolute error (R2) and the Root Mean Square Error (RMSE) are all compared in the different algorithms used, to help identify the best preferred algorithm to be adopted which turned out to be the XGBoost method.

Suggested Citation

  • Oluwasola Oluwaseun Mustapha & Dr. Terry Sithole, 2025. "Forecasting Retail Sales using Machine Learning Models," American Journal of Statistics and Actuarial Sciences, AJPO Journals Limited, vol. 6(1), pages 35-67.
  • Handle: RePEc:bfy:oajsas:v:6:y:2025:i:1:p:35-67:id:2679
    as

    Download full text from publisher

    File URL: https://ajpojournals.org/journals/index.php/AJSAS/article/view/2679
    Download Restriction: Access to full texts is restricted to American Journal of Statistics and Actuarial Sciences
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Peter R. Winters, 1960. "Forecasting Sales by Exponentially Weighted Moving Averages," Management Science, INFORMS, vol. 6(3), pages 324-342, April.
    2. Bohdan M. Pavlyshenko, 2019. "Machine-Learning Models for Sales Time Series Forecasting," Data, MDPI, vol. 4(1), pages 1-11, January.
    3. Lo, Tammy, 1994. "An expert system for choosing demand forecasting techniques," International Journal of Production Economics, Elsevier, vol. 33(1-3), pages 5-15, January.
    4. Fildes, Robert & Petropoulos, Fotios, 2015. "Simple versus complex selection rules for forecasting many time series," Journal of Business Research, Elsevier, vol. 68(8), pages 1692-1701.
    5. Schaeffer, Satu Elisa & Rodriguez Sanchez, Sara Veronica, 2020. "Forecasting client retention — A machine-learning approach," Journal of Retailing and Consumer Services, Elsevier, vol. 52(C).
    6. Patrick Bajari & Denis Nekipelov & Stephen P. Ryan & Miaoyu Yang, 2015. "Machine Learning Methods for Demand Estimation," American Economic Review, American Economic Association, vol. 105(5), pages 481-485, May.
    7. Tsao, Yu-Chung & Chen, Yu-Kai & Chiu, Shih-Hao & Lu, Jye-Chyi & Vu, Thuy-Linh, 2022. "An innovative demand forecasting approach for the server industry," Technovation, Elsevier, vol. 110(C).
    8. Marshall Fisher & Ananth Raman, 2018. "Using Data and Big Data in Retailing," Production and Operations Management, Production and Operations Management Society, vol. 27(9), pages 1665-1669, September.
    9. Ruomeng Cui & Santiago Gallino & Antonio Moreno & Dennis J. Zhang, 2018. "The Operational Value of Social Media Information," Production and Operations Management, Production and Operations Management Society, vol. 27(10), pages 1749-1769, October.
    10. Ma, Shaohui & Fildes, Robert, 2021. "Retail sales forecasting with meta-learning," European Journal of Operational Research, Elsevier, vol. 288(1), pages 111-128.
    11. Sanders, Nada R. & Graman, Gregory A., 2009. "Quantifying costs of forecast errors: A case study of the warehouse environment," Omega, Elsevier, vol. 37(1), pages 116-125, February.
    12. Na Liu & Shuyun Ren & Tsan-Ming Choi & Chi-Leung Hui & Sau-Fun Ng, 2013. "Sales Forecasting for Fashion Retailing Service Industry: A Review," Mathematical Problems in Engineering, Hindawi, vol. 2013, pages 1-9, November.
    13. Wong, W.K. & Guo, Z.X., 2010. "A hybrid intelligent model for medium-term sales forecasting in fashion retail supply chains using extreme learning machine and harmony search algorithm," International Journal of Production Economics, Elsevier, vol. 128(2), pages 614-624, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Swaminathan, Kritika & Venkitasubramony, Rakesh, 2024. "Demand forecasting for fashion products: A systematic review," International Journal of Forecasting, Elsevier, vol. 40(1), pages 247-267.
    2. Fildes, Robert & Ma, Shaohui & Kolassa, Stephan, 2022. "Retail forecasting: Research and practice," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1283-1318.
    3. Daniela Pencheva, 2020. "Use of Factors Related to the Consumption of Fast Moving Consumer Goods in Business Intelligence System for Managing Orders to Suppliers in Retail Chain," Izvestia Journal of the Union of Scientists - Varna. Economic Sciences Series, Union of Scientists - Varna, Economic Sciences Section, vol. 9(2), pages 124-135, August.
    4. Md. Iftekharul Alam Efat & Petr Hajek & Mohammad Zoynul Abedin & Rahat Uddin Azad & Md. Al Jaber & Shuvra Aditya & Mohammad Kabir Hassan, 2024. "Deep-learning model using hybrid adaptive trend estimated series for modelling and forecasting sales," Annals of Operations Research, Springer, vol. 339(1), pages 297-328, August.
    5. Tsao, Yu-Chung & Chen, Yu-Kai & Chiu, Shih-Hao & Lu, Jye-Chyi & Vu, Thuy-Linh, 2022. "An innovative demand forecasting approach for the server industry," Technovation, Elsevier, vol. 110(C).
    6. Xiaodan Zhu & Anh Ninh & Hui Zhao & Zhenming Liu, 2021. "Demand Forecasting with Supply‐Chain Information and Machine Learning: Evidence in the Pharmaceutical Industry," Production and Operations Management, Production and Operations Management Society, vol. 30(9), pages 3231-3252, September.
    7. Xuan Bi & Gediminas Adomavicius & William Li & Annie Qu, 2022. "Improving Sales Forecasting Accuracy: A Tensor Factorization Approach with Demand Awareness," INFORMS Journal on Computing, INFORMS, vol. 34(3), pages 1644-1660, May.
    8. Villegas, Marco A. & Pedregal, Diego J., 2019. "Automatic selection of unobserved components models for supply chain forecasting," International Journal of Forecasting, Elsevier, vol. 35(1), pages 157-169.
    9. Hartzel, Kathleen S. & Wood, Charles A., 2017. "Factors that affect the improvement of demand forecast accuracy through point-of-sale reporting," European Journal of Operational Research, Elsevier, vol. 260(1), pages 171-182.
    10. Emmanuel Sirimal Silva & Hossein Hassani & Dag Øivind Madsen & Liz Gee, 2019. "Googling Fashion: Forecasting Fashion Consumer Behaviour Using Google Trends," Social Sciences, MDPI, vol. 8(4), pages 1-23, April.
    11. Ma, Shaohui & Fildes, Robert, 2020. "Forecasting third-party mobile payments with implications for customer flow prediction," International Journal of Forecasting, Elsevier, vol. 36(3), pages 739-760.
    12. Jiayun Wang & Shanshan Wu & Qingwei Jin & Yijun Wang & Can Chen, 2024. "Identifying Popular Products at an Early Stage of Sales Season for Apparel Industry," Interfaces, INFORMS, vol. 54(3), pages 282-296, May.
    13. Dinis, Duarte & Barbosa-Póvoa, Ana & Teixeira, Ângelo Palos, 2022. "Enhancing capacity planning through forecasting: An integrated tool for maintenance of complex product systems," International Journal of Forecasting, Elsevier, vol. 38(1), pages 178-192.
    14. Ma, Shaohui & Fildes, Robert, 2021. "Retail sales forecasting with meta-learning," European Journal of Operational Research, Elsevier, vol. 288(1), pages 111-128.
    15. Sule Birim & Ipek Kazancoglu & Sachin Kumar Mangla & Aysun Kahraman & Yigit Kazancoglu, 2024. "The derived demand for advertising expenses and implications on sustainability: a comparative study using deep learning and traditional machine learning methods," Annals of Operations Research, Springer, vol. 339(1), pages 131-161, August.
    16. Messeni Petruzzelli, Antonio & Mora, Luca & Natalicchio, Angelo & Platania, Federico & Toscano Hernandez, Celina, 2024. "Consumers’ reaction to sci-fi as a source of information for technological development: An empirical analysis," Technovation, Elsevier, vol. 132(C).
    17. Robert P. Rooderkerk & Nicole DeHoratius & Andrés Musalem, 2022. "The past, present, and future of retail analytics: Insights from a survey of academic research and interviews with practitioners," Production and Operations Management, Production and Operations Management Society, vol. 31(10), pages 3727-3748, October.
    18. Petropoulos, Fotios & Goodwin, Paul & Fildes, Robert, 2017. "Using a rolling training approach to improve judgmental extrapolations elicited from forecasters with technical knowledge," International Journal of Forecasting, Elsevier, vol. 33(1), pages 314-324.
    19. Marshall Fisher & Ananth Raman, 2022. "Innovations in retail operations: Thirty years of lessons from Production and Operations Management," Production and Operations Management, Production and Operations Management Society, vol. 31(12), pages 4452-4461, December.
    20. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bfy:oajsas:v:6:y:2025:i:1:p:35-67:id:2679. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chief Editor (email available below). General contact details of provider: https://ajpojournals.org/journals/index.php/AJSAS/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.