IDEAS home Printed from https://ideas.repec.org/a/eee/agisys/v187y2021ics0308521x20308775.html
   My bibliography  Save this article

Machine learning for large-scale crop yield forecasting

Author

Listed:
  • Paudel, Dilli
  • Boogaard, Hendrik
  • de Wit, Allard
  • Janssen, Sander
  • Osinga, Sjoukje
  • Pylianidis, Christos
  • Athanasiadis, Ioannis N.

Abstract

Many studies have applied machine learning to crop yield prediction with a focus on specific case studies. The data and methods they used may not be transferable to other crops and locations. On the other hand, operational large-scale systems, such as the European Commission's MARS Crop Yield Forecasting System (MCYFS), do not use machine learning. Machine learning is a promising method especially when large amounts of data are being collected and published. We combined agronomic principles of crop modeling with machine learning to build a machine learning baseline for large-scale crop yield forecasting. The baseline is a workflow emphasizing correctness, modularity and reusability. For correctness, we focused on designing explainable predictors or features (in relation to crop growth and development) and applying machine learning without information leakage. We created features using crop simulation outputs and weather, remote sensing and soil data from the MCYFS database. We emphasized a modular and reusable workflow to support different crops and countries with small configuration changes. The workflow can be used to run repeatable experiments (e.g. early season or end of season predictions) using standard input data to obtain reproducible results. The results serve as a starting point for further optimizations. In our case studies, we predicted yield at regional level for five crops (soft wheat, spring barley, sunflower, sugar beet, potatoes) and three countries (the Netherlands (NL), Germany (DE), France (FR)). We compared the performance with a simple method with no prediction skill, which either predicted a linear yield trend or the average of the training set. We also aggregated the predictions to the national level and compared with past MCYFS forecasts. The normalized RMSE (NRMSE) for early season predictions (30 days after planting) were comparable for NL (all crops), DE (all except soft wheat) and FR (soft wheat, spring barley, sunflower). For example, NRMSE was 7.87 for soft wheat (NL) (6.32 for MCYFS) and 8.21 for sugar beet (DE) (8.79 for MCYFS). In contrast, NRMSEs for soft wheat (DE), sugar beet (FR) and potatoes (FR) were twice as much compared to MCYFS. NRMSEs for end of season were still comparable to MCYFS for NL, but worse for DE and FR. The baseline can be improved by adding new data sources, designing more predictive features and evaluating different machine learning algorithms. The baseline will motivate the use of machine learning in large-scale crop yield forecasting.

Suggested Citation

  • Paudel, Dilli & Boogaard, Hendrik & de Wit, Allard & Janssen, Sander & Osinga, Sjoukje & Pylianidis, Christos & Athanasiadis, Ioannis N., 2021. "Machine learning for large-scale crop yield forecasting," Agricultural Systems, Elsevier, vol. 187(C).
  • Handle: RePEc:eee:agisys:v:187:y:2021:i:c:s0308521x20308775
    DOI: 10.1016/j.agsy.2020.103016
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0308521X20308775
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.agsy.2020.103016?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lecerf, Rémi & Ceglar, Andrej & López-Lozano, Raúl & Van Der Velde, Marijn & Baruth, Bettina, 2019. "Assessing the information in crop model and meteorological indicators to forecast crop yield over Europe," Agricultural Systems, Elsevier, vol. 168(C), pages 191-202.
    2. Jig Han Jeong & Jonathan P Resop & Nathaniel D Mueller & David H Fleisher & Kyungdahm Yun & Ethan E Butler & Dennis J Timlin & Kyo-Moon Shim & James S Gerber & Vangimalla R Reddy & Soo-Hyung Kim, 2016. "Random Forests for Global and Regional Crop Yield Predictions," PLOS ONE, Public Library of Science, vol. 11(6), pages 1-15, June.
    3. de Wit, Allard & Boogaard, Hendrik & Fumagalli, Davide & Janssen, Sander & Knapen, Rob & van Kraalingen, Daniel & Supit, Iwan & van der Wijngaart, Raymond & van Diepen, Kees, 2019. "25 years of the WOFOST cropping systems model," Agricultural Systems, Elsevier, vol. 168(C), pages 154-167.
    4. Bussay, Attila & van der Velde, Marijn & Fumagalli, Davide & Seguini, Lorenzo, 2015. "Improving operational maize yield forecasting in Hungary," Agricultural Systems, Elsevier, vol. 141(C), pages 94-106.
    5. van der Velde, M. & Nisini, L., 2019. "Performance of the MARS-crop yield forecasting system for the European Union: Assessing accuracy, in-season, and year-to-year improvements from 1993 to 2015," Agricultural Systems, Elsevier, vol. 168(C), pages 203-212.
    6. López-Lozano, Raúl & Baruth, Bettina, 2019. "An evaluation framework to build a cost-efficient crop monitoring system. Experiences from the extension of the European crop monitoring system," Agricultural Systems, Elsevier, vol. 168(C), pages 231-246.
    7. Willcock, Simon & Martínez-López, Javier & Hooftman, Danny A.P. & Bagstad, Kenneth J. & Balbi, Stefano & Marzo, Alessia & Prato, Carlo & Sciandrello, Saverio & Signorello, Giovanni & Voigt, Brian & , 2018. "Machine learning for ecosystem services," Ecosystem Services, Elsevier, vol. 33(PB), pages 165-174.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Schmidt, Lorenz & Odening, Martin & Schlanstein, Johann & Ritter, Matthias, 2021. "Estimation of the Farm-Level Yield-Weather-Relation Using Machine Learning," 61st Annual Conference, Berlin, Germany, September 22-24, 2021 317075, German Association of Agricultural Economists (GEWISOLA).
    2. Sebastian C. Ibañez & Christopher P. Monterola, 2023. "A Global Forecasting Approach to Large-Scale Crop Production Prediction with Time Series Transformers," Agriculture, MDPI, vol. 13(9), pages 1-27, September.
    3. Kalpana Jain & Naveen Choudhary, 2022. "Comparative analysis of machine learning techniques for predicting production capability of crop yield," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 13(1), pages 583-593, March.
    4. Timsina, Jagadish & Dutta, Sudarshan & Devkota, Krishna Prasad & Chakraborty, Somsubhra & Neupane, Ram Krishna & Bishta, Sudarshan & Amgain, Lal Prasad & Singh, Vinod K. & Islam, Saiful & Majumdar, Ka, 2021. "Improved nutrient management in cereals using Nutrient Expert and machine learning tools: Productivity, profitability and nutrient use efficiency," Agricultural Systems, Elsevier, vol. 192(C).
    5. Gaona, Jaime & Benito-Verdugo, Pilar & Martínez-Fernández, José & González-Zamora, Ángel & Almendra-Martín, Laura & Herrero-Jiménez, Carlos Miguel, 2023. "Predictive value of soil moisture and concurrent variables in the multivariate modelling of cereal yields in water-limited environments," Agricultural Water Management, Elsevier, vol. 282(C).
    6. Kouame, Anselme K.K. & Bindraban, Prem S. & Kissiedu, Isaac N. & Atakora, Williams K. & El Mejahed, Khalil, 2023. "Identifying drivers for variability in maize (Zea mays L.) yield in Ghana: A meta-regression approach," Agricultural Systems, Elsevier, vol. 209(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. van der Velde, M. & Nisini, L., 2019. "Performance of the MARS-crop yield forecasting system for the European Union: Assessing accuracy, in-season, and year-to-year improvements from 1993 to 2015," Agricultural Systems, Elsevier, vol. 168(C), pages 203-212.
    2. Tiecheng Bai & Nannan Zhang & Youqi Chen & Benoit Mercatoris, 2019. "Assessing the Performance of the WOFOST Model in Simulating Jujube Fruit Tree Growth under Different Irrigation Regimes," Sustainability, MDPI, vol. 11(5), pages 1-16, March.
    3. Mahboobe Ghobadi & Mahdi Gheysari & Mohammad Shayannejad & Hamze Dokoohaki, 2023. "Analyzing the Effects of Planting Date on the Uncertainty of CERES-Maize and Its Potential to Reduce Yield Gap in Arid and Mediterranean Climates," Agriculture, MDPI, vol. 13(8), pages 1-17, July.
    4. Ahmed, Moiz Uddin & Hussain, Iqbal, 2022. "Prediction of Wheat Production Using Machine Learning Algorithms in northern areas of Pakistan," Telecommunications Policy, Elsevier, vol. 46(6).
    5. Agudelo, César Augusto Ruiz & Bustos, Sandra Liliana Hurtado & Moreno, Carmen Alicia Parrado, 2020. "Modeling interactions among multiple ecosystem services. A critical review," Ecological Modelling, Elsevier, vol. 429(C).
    6. Bohan, David & Schmucki, Reto & Abay, Abrha & Termansen, Mette & Bane, Miranda & Charalabiis, Alice & Cong, Rong-Gang & Derocles, Stephane & Dorner, Zita & Forster, Matthieu & Gibert, Caroline & Harro, 2020. "Designing farmer-acceptable rotations that assure ecosystem service provision inthe face of climate change," MPRA Paper 112313, University Library of Munich, Germany.
    7. Lorilla, Roxanne Suzette & Poirazidis, Konstantinos & Detsis, Vassilis & Kalogirou, Stamatis & Chalkias, Christos, 2020. "Socio-ecological determinants of multiple ecosystem services on the Mediterranean landscapes of the Ionian Islands (Greece)," Ecological Modelling, Elsevier, vol. 422(C).
    8. Schmidt, Lorenz & Odening, Martin & Schlanstein, Johann & Ritter, Matthias, 2022. "Exploring the weather-yield nexus with artificial neural networks," Agricultural Systems, Elsevier, vol. 196(C).
    9. Gniewko Niedbała, 2019. "Application of Artificial Neural Networks for Multi-Criteria Yield Prediction of Winter Rapeseed," Sustainability, MDPI, vol. 11(2), pages 1-13, January.
    10. Cai, Liping & Wang, Hui & Liu, Yanxu & Fan, Donglin & Li, Xiaoxiao, 2022. "Is potential cultivated land expanding or shrinking in the dryland of China? Spatiotemporal evaluation based on remote sensing and SVM," Land Use Policy, Elsevier, vol. 112(C).
    11. Serra, J. & Paredes, P. & Cordovil, CMdS & Cruz, S. & Hutchings, NJ & Cameira, MR, 2023. "Is irrigation water an overlooked source of nitrogen in agriculture?," Agricultural Water Management, Elsevier, vol. 278(C).
    12. Indy Man Kit Ho & Anthony Weldon & Jason Tze Ho Yong & Candy Tze Tim Lam & Jaime Sampaio, 2023. "Using Machine Learning Algorithms to Pool Data from Meta-Analysis for the Prediction of Countermovement Jump Improvement," IJERPH, MDPI, vol. 20(10), pages 1-15, May.
    13. Signorello, Giovanni & Prato, Carlo & Marzo, Alessia & Ientile, Renzo & Cucuzza, Giuseppe & Sciandrello, Saverio & Martínez-López, Javier & Balbi, Stefano & Villa, Ferdinando, 2018. "Are protected areas covering important biodiversity sites? An assessment of the nature protection network in Sicily (Italy)," Land Use Policy, Elsevier, vol. 78(C), pages 593-602.
    14. Richards, Daniel Rex & Lavorel, Sandra, 2022. "Integrating social media data and machine learning to analyse scenarios of landscape appreciation," Ecosystem Services, Elsevier, vol. 55(C).
    15. Florian Schierhorn & Max Hofmann & Taras Gagalyuk & Igor Ostapchuk & Daniel Müller, 2021. "Machine learning reveals complex effects of climatic means and weather extremes on wheat yields during different plant developmental stages," Climatic Change, Springer, vol. 169(3), pages 1-19, December.
    16. Devkota, Mina & Yigezu, Yigezu Atnafe, 2020. "Explaining yield and gross margin gaps for sustainable intensification of the wheat-based systems in a Mediterranean climate," Agricultural Systems, Elsevier, vol. 185(C).
    17. Yan, Ling & Jin, Jiming & Wu, Pute, 2020. "Impact of parameter uncertainty and water stress parameterization on wheat growth simulations using CERES-Wheat with GLUE," Agricultural Systems, Elsevier, vol. 181(C).
    18. repec:eur:ejfejr:25 is not listed on IDEAS
    19. Puyu Feng & Bin Wang & De Li Liu & Hongtao Xing & Fei Ji & Ian Macadam & Hongyan Ruan & Qiang Yu, 2018. "Impacts of rainfall extremes on wheat yield in semi-arid cropping systems in eastern Australia," Climatic Change, Springer, vol. 147(3), pages 555-569, April.
    20. Shine, P. & Scully, T. & Upton, J. & Murphy, M.D., 2019. "Annual electricity consumption prediction and future expansion analysis on dairy farms using a support vector machine," Applied Energy, Elsevier, vol. 250(C), pages 1110-1119.
    21. Brombacher, Joost & Silva, Isadora Rezende de Oliveira & Degen, Jelle & Pelgrum, Henk, 2022. "A novel evapotranspiration based irrigation quantification method using the hydrological similar pixels algorithm," Agricultural Water Management, Elsevier, vol. 267(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:agisys:v:187:y:2021:i:c:s0308521x20308775. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/agsy .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.