IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0283452.html

A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh

Author

Listed:
  • Mst Noorunnahar
  • Arman Hossain Chowdhury
  • Farhana Arefeen Mila

Abstract

In this study, we attempt to anticipate annual rice production in Bangladesh (1961–2020) using both the Autoregressive Integrated Moving Average (ARIMA) and the eXtreme Gradient Boosting (XGBoost) methods and compare their respective performances. On the basis of the lowest Corrected Akaike Information Criteria (AICc) values, a significant ARIMA (0, 1, 1) model with drift was chosen based on the findings. The drift parameter value shows that the production of rice positively trends upward. Thus, the ARIMA (0, 1, 1) model with drift was found to be significant. On the other hand, the XGBoost model for time series data was developed by changing the tunning parameters frequently with the greatest result. The four prominent error measures, such as mean absolute error (MAE), mean percentage error (MPE), root mean square error (RMSE), and mean absolute percentage error (MAPE), were used to assess the predictive performance of each model. We found that the error measures of the XGBoost model in the test set were comparatively lower than those of the ARIMA model. Comparatively, the MAPE value of the test set of the XGBoost model (5.38%) was lower than that of the ARIMA model (7.23%), indicating that XGBoost performs better than ARIMA at predicting the annual rice production in Bangladesh. Hence, the XGBoost model performs better than the ARIMA model in predicting the annual rice production in Bangladesh. Therefore, based on the better performance, the study forecasted the annual rice production for the next 10 years using the XGBoost model. According to our predictions, the annual rice production in Bangladesh will vary from 57,850,318 tons in 2021 to 82,256,944 tons in 2030. The forecast indicated that the amount of rice produced annually in Bangladesh will increase in the years to come.

Suggested Citation

  • Mst Noorunnahar & Arman Hossain Chowdhury & Farhana Arefeen Mila, 2023. "A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh," PLOS ONE, Public Library of Science, vol. 18(3), pages 1-15, March.
  • Handle: RePEc:plo:pone00:0283452
    DOI: 10.1371/journal.pone.0283452
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0283452
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0283452&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0283452?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Rahman, Mohammad Chhiddikur & Islam, Mohammad Ariful & Rahaman, Md Shajedur & Sarkar, Md Abdur Rouf & Ahmed, Rokib & Kabir, Md Shahjahan, 2021. "Identifying the Threshold Level of Flooding for Rice Production in Bangladesh: An Empirical Analysis," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 19(2), pages 243-250.
    2. Md Siddikur Rahman & Arman Hossain Chowdhury, 2022. "A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers," PLOS ONE, Public Library of Science, vol. 17(9), pages 1-14, September.
    3. Md Siddikur Rahman & Arman Hossain Chowdhury & Miftahuzzannat Amrin, 2022. "Accuracy comparison of ARIMA and XGBoost forecasting models in predicting the incidence of COVID-19 in Bangladesh," PLOS Global Public Health, Public Library of Science, vol. 2(5), pages 1-13, May.
    4. Pai, Ping-Feng & Lin, Chih-Sheng, 2005. "A hybrid ARIMA and support vector machines model in stock price forecasting," Omega, Elsevier, vol. 33(6), pages 497-505, December.
    5. Hyndman, Rob J. & Khandakar, Yeasmin, 2008. "Automatic Time Series Forecasting: The forecast Package for R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 27(i03).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Adel S. Aldosary & Baqer Al-Ramadan & Abdulla Al Kafy & Hamad Ahmed Altuwaijri & Zullyadini A. Rahaman, 2025. "Forecasting climate risk and heat stress hazards in arid ecosystems: Machine learning and ensemble models for specific humidity prediction in Dammam, Saudi Arabia," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 121(8), pages 9281-9309, May.
    2. Jihong Sun & Chen Sun & Zhaowen Li & Ye Qian & Tong Li, 2024. "Prediction method of sugarcane important phenotype data based on multi-model and multi-task," PLOS ONE, Public Library of Science, vol. 19(12), pages 1-25, December.
    3. Xiaoqing Luo, 2026. "When simplicity fails: forecasting Mainland Chinese tourist arrivals in Macao during structural breaks with a hybrid economic-search model," Asia-Pacific Journal of Regional Science, Springer, vol. 10(1), pages 1-31, March.
    4. Ghosh, Soham & Mukhoti, Sujay & Sharma, Pritee, 2025. "Quantifying rainfall-induced climate risk in rainfed agriculture: A volatility-based time series study from semi-arid India," Agricultural Water Management, Elsevier, vol. 319(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jindřich Pokora, 2017. "Hybrid ARIMA and Support Vector Regression in Short-term Electricity Price Forecasting," Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis, Mendel University Press, vol. 65(2), pages 699-708.
    2. Xiong, Tao & Li, Chongguang & Bao, Yukun, 2017. "Interval-valued time series forecasting using a novel hybrid HoltI and MSVR model," Economic Modelling, Elsevier, vol. 60(C), pages 11-23.
    3. Mirna Patricia Ponce-Flores & Jesús David Terán-Villanueva & Salvador Ibarra-Martínez & José Antonio Castán-Rocha, 2023. "Generalized Pandemic Model with COVID-19 for Early-Stage Infection Forecasting," Mathematics, MDPI, vol. 11(18), pages 1-18, September.
    4. Md Siddikur Rahman & Arman Hossain Chowdhury, 2022. "A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers," PLOS ONE, Public Library of Science, vol. 17(9), pages 1-14, September.
    5. Huijian Dong & Xiaomin Guo & Han Reichgelt & Ruizhi Hu, 2020. "Predictive power of ARIMA models in forecasting equity returns: a sliding window method," Journal of Asset Management, Palgrave Macmillan, vol. 21(6), pages 549-566, October.
    6. Salvatore Carta & Andrea Medda & Alessio Pili & Diego Reforgiato Recupero & Roberto Saia, 2018. "Forecasting E-Commerce Products Prices by Combining an Autoregressive Integrated Moving Average (ARIMA) Model and Google Trends Data," Future Internet, MDPI, vol. 11(1), pages 1-19, December.
    7. Rob Hyndman & Heather Booth & Farah Yasmeen, 2013. "Coherent Mortality Forecasting: The Product-Ratio Method With Functional Time Series Models," Demography, Springer;Population Association of America (PAA), vol. 50(1), pages 261-283, February.
    8. Nahapetyan Yervand, 2019. "The benefits of the Velvet Revolution in Armenia: Estimation of the short-term economic gains using deep neural networks," Central European Economic Journal, Sciendo, vol. 6(53), pages 286-303, January.
    9. Hasan Engin Duran, 2025. "The future of urban hierearchy and Zipf law: ARIMA and BATS forecasting," Letters in Spatial and Resource Sciences, Springer, vol. 18(1), pages 1-14, December.
    10. Barrow, Devon & Kourentzes, Nikolaos, 2018. "The impact of special days in call arrivals forecasting: A neural network approach to modelling special days," European Journal of Operational Research, Elsevier, vol. 264(3), pages 967-977.
    11. Felipe Leite Coelho da Silva & Josiane da Silva Cordeiro & Kleyton da Costa & Nemias Saboya & Paulo Canas Rodrigues & Javier Linkolk López-Gonzales, 2025. "Time series forecasting via integrating a filtering method: an application to electricity consumption," Computational Statistics, Springer, vol. 40(9), pages 5023-5042, December.
    12. Kamaladdin Fataliyev & Aneesh Chivukula & Mukesh Prasad & Wei Liu, 2021. "Stock Market Analysis with Text Data: A Review," Papers 2106.12985, arXiv.org, revised Jul 2021.
    13. Dombi, József & Jónás, Tamás & Tóth, Zsuzsanna Eszter, 2018. "Modeling and long-term forecasting demand in spare parts logistics businesses," International Journal of Production Economics, Elsevier, vol. 201(C), pages 1-17.
    14. Amita Gajewar & Gagan Bansal, 2016. "Revenue Forecasting for Enterprise Products," Papers 1701.06624, arXiv.org.
    15. Tao XIONG & LI Chongguang & Yukun BAO, 2017. "An improved EEMD-based hybrid approach for the short-term forecasting of hog price in China," Agricultural Economics, Czech Academy of Agricultural Sciences, vol. 63(3), pages 136-148.
    16. Pieter van der Spek & Chris Verhoef, 2014. "Balancing Time‐to‐Market and Quality in Embedded Systems," Systems Engineering, John Wiley & Sons, vol. 17(2), pages 166-192, June.
    17. Hewamalage, Hansika & Bergmeir, Christoph & Bandara, Kasun, 2021. "Recurrent Neural Networks for Time Series Forecasting: Current status and future directions," International Journal of Forecasting, Elsevier, vol. 37(1), pages 388-427.
    18. Hyndman, Rob J. & Ahmed, Roman A. & Athanasopoulos, George & Shang, Han Lin, 2011. "Optimal combination forecasts for hierarchical time series," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2579-2589, September.
    19. Md Siddikur Rahman & Arman Hossain Chowdhury & Miftahuzzannat Amrin, 2022. "Accuracy comparison of ARIMA and XGBoost forecasting models in predicting the incidence of COVID-19 in Bangladesh," PLOS Global Public Health, Public Library of Science, vol. 2(5), pages 1-13, May.
    20. Kourentzes, Nikolaos & Petropoulos, Fotios & Trapero, Juan R., 2014. "Improving forecasting by estimating time series structural components across multiple frequencies," International Journal of Forecasting, Elsevier, vol. 30(2), pages 291-302.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0283452. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.