IDEAS home Printed from https://ideas.repec.org/a/eee/chsofr/v139y2020ics0960077920304525.html
   My bibliography  Save this article

An empirical overview of nonlinearity and overfitting in machine learning using COVID-19 data

Author

Listed:
  • Peng, Yaohao
  • Nagata, Mateus Hiro

Abstract

In this paper, we applied support vector regression to predict the number of COVID-19 cases for the 12 most-affected countries, testing for different structures of nonlinearity using Kernel functions and analyzing the sensitivity of the models’ predictive performance to different hyperparameters settings using 3-D interpolated surfaces. In our experiment, the model that incorporates the highest degree of nonlinearity (Gaussian Kernel) had the best in-sample performance, but also yielded the worst out-of-sample predictions, a typical example of overfitting in a machine learning model. On the other hand, the linear Kernel function performed badly in-sample but generated the best out-of-sample forecasts. The findings of this paper provide an empirical assessment of fundamental concepts in data analysis and evidence the need for caution when applying machine learning models to support real-world decision making, notably with respect to the challenges arising from the COVID-19 pandemics.

Suggested Citation

  • Peng, Yaohao & Nagata, Mateus Hiro, 2020. "An empirical overview of nonlinearity and overfitting in machine learning using COVID-19 data," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
  • Handle: RePEc:eee:chsofr:v:139:y:2020:i:c:s0960077920304525
    DOI: 10.1016/j.chaos.2020.110055
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0960077920304525
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.chaos.2020.110055?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Wang, Shaojie & He, Shaobo & Yousefpour, Amin & Jahanshahi, Hadi & Repnik, Robert & Perc, Matjaž, 2020. "Chaos and complexity in a fractional-order financial system with time delays," Chaos, Solitons & Fractals, Elsevier, vol. 131(C).
    2. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    3. Chimmula, Vinay Kumar Reddy & Zhang, Lei, 2020. "Time series forecasting of COVID-19 transmission in Canada using LSTM networks," Chaos, Solitons & Fractals, Elsevier, vol. 135(C).
    4. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. João Gabriel Moraes Souza & Daniel Tavares Castro & Yaohao Peng & Ivan Ricardo Gartner, 2024. "A Machine Learning-Based Analysis on the Causality of Financial Stress in Banking Institutions," Computational Economics, Springer;Society for Computational Economics, vol. 64(3), pages 1857-1890, September.
    2. James Ming Chen & Mira Zovko & Nika Šimurina & Vatroslav Zovko, 2021. "Fear in a Handful of Dust: The Epidemiological, Environmental, and Economic Drivers of Death by PM 2.5 Pollution," IJERPH, MDPI, vol. 18(16), pages 1-59, August.
    3. Gerardo Alfonso Perez & Raquel Castillo, 2023. "Categorical Variable Mapping Considerations in Classification Problems: Protein Application," Mathematics, MDPI, vol. 11(2), pages 1-26, January.
    4. Alireza Tavakolian & Alireza Rezaee & Farshid Hajati & Shahadat Uddin, 2023. "Hospital Readmission and Length-of-Stay Prediction Using an Optimized Hybrid Deep Model," Future Internet, MDPI, vol. 15(9), pages 1-21, September.
    5. Peng, Yaohao & de Moraes Souza, João Gabriel, 2024. "Chaos, overfitting and equilibrium: To what extent can machine learning beat the financial market?," International Review of Financial Analysis, Elsevier, vol. 95(PB).
    6. Wenhui Ke & Yimin Lu, 2024. "Ensemble Prediction Method Based on Decomposition–Reconstitution–Integration for COVID-19 Outbreak Prediction," Mathematics, MDPI, vol. 12(3), pages 1-20, February.
    7. Tayarani N., Mohammad-H., 2021. "Applications of artificial intelligence in battling against covid-19: A literature review," Chaos, Solitons & Fractals, Elsevier, vol. 142(C).
    8. Szczygielski, Jan Jakub & Charteris, Ailie & Bwanya, Princess Rutendo & Brzeszczyński, Janusz, 2023. "Which COVID-19 information really impacts stock markets?," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 84(C).
    9. Xiaojin Xie & Kangyang Luo & Zhixiang Yin & Guoqiang Wang, 2021. "Nonlinear Combinational Dynamic Transmission Rate Model and Its Application in Global COVID-19 Epidemic Prediction and Analysis," Mathematics, MDPI, vol. 9(18), pages 1-17, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shuangshuang Fan & Yichao Li & William Mbanyele & Xiufeng Lai, 2025. "Determinants and Pathways for Inclusive Growth in China: Investigation Based on Artificial Intelligence (AI) Algorithm," Computational Economics, Springer;Society for Computational Economics, vol. 65(3), pages 1231-1264, March.
    2. Andrii Babii & Ryan T. Ball & Eric Ghysels & Jonas Striaukas, 2024. "Panel data nowcasting: The case of price–earnings ratios," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(2), pages 292-307, March.
    3. Bakalli, Gaetan & Guerrier, Stéphane & Scaillet, Olivier, 2023. "A penalized two-pass regression to predict stock returns with time-varying risk premia," Journal of Econometrics, Elsevier, vol. 237(2).
    4. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    5. Tobias Götze & Marc Gürtler & Eileen Witowski, 2020. "Improving CAT bond pricing models via machine learning," Journal of Asset Management, Palgrave Macmillan, vol. 21(5), pages 428-446, September.
    6. Wen, Danyan & Liu, Li & Wang, Yudong & Zhang, Yaojie, 2022. "Forecasting crude oil market returns: Enhanced moving average technical indicators," Resources Policy, Elsevier, vol. 76(C).
    7. Daníelsson, Jón & Macrae, Robert & Uthemann, Andreas, 2022. "Artificial intelligence and systemic risk," Journal of Banking & Finance, Elsevier, vol. 140(C).
    8. Cong Wang, 2024. "Stock return prediction with multiple measures using neural network models," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-34, December.
    9. Liu, Yunting & Zhu, Yandi, 2025. "Good idiosyncratic volatility, bad idiosyncratic volatility, and the cross-section of stock returns," Journal of Banking & Finance, Elsevier, vol. 170(C).
    10. Guo, Li & Sang, Bo & Tu, Jun & Wang, Yu, 2024. "Cross-cryptocurrency return predictability," Journal of Economic Dynamics and Control, Elsevier, vol. 163(C).
    11. Cao, Sean & Jiang, Wei & Wang, Junbo & Yang, Baozhong, 2024. "From Man vs. Machine to Man + Machine: The art and AI of stock analyses," Journal of Financial Economics, Elsevier, vol. 160(C).
    12. Rad, Hossein & Low, Rand Kwong Yew & Miffre, Joëlle & Faff, Robert, 2023. "The commodity risk premium and neural networks," Journal of Empirical Finance, Elsevier, vol. 74(C).
    13. Chen, Andrew Y. & McCoy, Jack, 2024. "Missing values handling for machine learning portfolios," Journal of Financial Economics, Elsevier, vol. 155(C).
    14. Avramov, D. & Ge, S. & Li, S. & Linton, O. B., 2025. "Dual Industry Effects and Cross-Stock Predictability," Janeway Institute Working Papers 2506, Faculty of Economics, University of Cambridge.
    15. Shunyao Wang & Ming Cheng & Christina Dan Wang, 2025. "NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks," Papers 2505.06864, arXiv.org.
    16. Tse, Tiffany Tsz Kwan & Hanaki, Nobuyuki & Mao, Bolin, 2024. "Beware the performance of an algorithm before relying on it: Evidence from a stock price forecasting experiment," Journal of Economic Psychology, Elsevier, vol. 102(C).
    17. Christian Fieberg & Daniel Metko & Thorsten Poddig & Thomas Loy, 2023. "Machine learning techniques for cross-sectional equity returns’ prediction," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 45(1), pages 289-323, March.
    18. Doğan, Murat & Sayılır, Özlem & Komath, Muhammed Aslam Chelery & Çimen, Emre, 2025. "Prediction of market value of firms with corporate sustainability performance data using machine learning models," Finance Research Letters, Elsevier, vol. 77(C).
    19. Bryan Kelly & Semyon Malamud & Kangying Zhou, 2024. "The Virtue of Complexity in Return Prediction," Journal of Finance, American Finance Association, vol. 79(1), pages 459-503, February.
    20. Jiajun Gu & Zichen Yang & Xintong Lin & Sixun Chen & YuTing Lu, 2024. "AI-Enhanced Factor Analysis for Predicting S&P 500 Stock Dynamics," Papers 2412.12438, arXiv.org.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:chsofr:v:139:y:2020:i:c:s0960077920304525. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Thayer, Thomas R. (email available below). General contact details of provider: https://www.journals.elsevier.com/chaos-solitons-and-fractals .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.