IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0299811.html
   My bibliography  Save this article

Analysis of learning curves in predictive modeling using exponential curve fitting with an asymptotic approach

Author

Listed:
  • Leonardo Silva Vianna
  • Alexandre Leopoldo Gonçalves
  • João Artur Souza

Abstract

The existence of large volumes of data has considerably alleviated concerns regarding the availability of sufficient data instances for machine learning experiments. Nevertheless, in certain contexts, addressing limited data availability may demand distinct strategies and efforts. Analyzing COVID-19 predictions at pandemic beginning emerged a question: how much data is needed to make reliable predictions? When does the volume of data provide a better understanding of the disease’s evolution and, in turn, offer reliable forecasts? Given these questions, the objective of this study is to analyze learning curves obtained from predicting the incidence of COVID-19 in Brazilian States using ARIMA models with limited available data. To fulfill the objective, a retrospective exploration of COVID-19 incidence across the Brazilian States was performed. After the data acquisition and modeling, the model errors were assessed by employing a learning curve analysis. The asymptotic exponential curve fitting enabled the evaluation of the errors in different points, reflecting the increased available data over time. For a comprehensive understanding of the results at distinct stages of the time evolution, the average derivative of the curves and the equilibrium points were calculated, aimed to identify the convergence of the ARIMA models to a stable pattern. We observed differences in average derivatives and equilibrium values among the multiple samples. While both metrics ultimately confirmed the convergence to stability, the equilibrium points were more sensitive to changes in the models’ accuracy and provided a better indication of the learning progress. The proposed method for constructing learning curves enabled consistent monitoring of prediction results, providing evidence-based understandings required for informed decision-making.

Suggested Citation

  • Leonardo Silva Vianna & Alexandre Leopoldo Gonçalves & João Artur Souza, 2024. "Analysis of learning curves in predictive modeling using exponential curve fitting with an asymptotic approach," PLOS ONE, Public Library of Science, vol. 19(4), pages 1-23, April.
  • Handle: RePEc:plo:pone00:0299811
    DOI: 10.1371/journal.pone.0299811
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0299811
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0299811&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0299811?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. HÄRDLE, Wolfgang & HART, Jeffrey & MARRON, Steve & TSYBAKOV, Alexander, 1992. "Bandwith choice for average derivative estimation," LIDAM Reprints CORE 977, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    2. Hardle, Wolfgang & Tsybakov, A. B., 1993. "How sensitive are average derivatives?," Journal of Econometrics, Elsevier, vol. 58(1-2), pages 31-48, July.
    3. Firmino, Paulo Renato Alves & de Sales, Jair Paulino & Gonçalves Júnior, Jucier & da Silva, Taciana Araújo, 2020. "A non-central beta model to forecast and evaluate pandemics time series," Chaos, Solitons & Fractals, Elsevier, vol. 140(C).
    4. Vaishnav, Vaibhav & Vajpai, Jayashri, 2020. "Assessment of impact of relaxation in lockdown and forecast of preparation for combating COVID-19 pandemic in India using Group Method of Data Handling," Chaos, Solitons & Fractals, Elsevier, vol. 140(C).
    5. Özköse, Fatma & Yavuz, Mehmet & Şenel, M. Tamer & Habbireeh, Rafla, 2022. "Fractional order modelling of omicron SARS-CoV-2 variant containing heart attack effect using real data from the United Kingdom," Chaos, Solitons & Fractals, Elsevier, vol. 157(C).
    6. Hardle, Wolfgang & Tsybakov, A. B., 1993. "How sensitive are average derivatives?," Journal of Econometrics, Elsevier, vol. 58(1-2), pages 31-48, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marian Hristache, 2002. "Are Efficient Estimators in Single-Index Models Really Efficient? A Computational Discussion," Computational Statistics, Springer, vol. 17(4), pages 453-464, December.
    2. Kaido, Hiroaki, 2017. "Asymptotically Efficient Estimation Of Weighted Average Derivatives With An Interval Censored Variable," Econometric Theory, Cambridge University Press, vol. 33(5), pages 1218-1241, October.
    3. Ichimura, Hidehiko & Todd, Petra E., 2007. "Implementing Nonparametric and Semiparametric Estimators," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 74, Elsevier.
    4. Xia, Yingcun & Härdle, Wolfgang Karl & Linton, Oliver, 2009. "Optimal smoothing for a computationally and statistically efficient single index estimator," SFB 649 Discussion Papers 2009-028, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    5. Goldenshluger, Alexander, 2002. "Density Deconvolution in the Circular Structural Model," Journal of Multivariate Analysis, Elsevier, vol. 81(2), pages 360-375, May.
    6. Gorgens, T., 1999. "Semiparametric Estimation of Single-Index Transition Intensities," Papers 99-25, Carleton - School of Public Administration.
    7. Linton, Oliver, 2002. "Edgeworth approximations for semiparametric instrumental variable estimators and test statistics," Journal of Econometrics, Elsevier, vol. 106(2), pages 325-368, February.
    8. Girard, Stéphane & Guillou, Armelle & Stupfler, Gilles, 2013. "Frontier estimation with kernel regression on high order moments," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 172-189.
    9. Qihua Wang & Tao Zhang & Wolfgang Karl Härdle, 2016. "An Extended Single-index Model with Missing Response at Random," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(4), pages 1140-1152, December.
    10. Kim, Peter T. & Koo, Ja-Yong & Park, Heon Jin, 2004. "Sharp minimaxity and spherical deconvolution for super-smooth error distributions," Journal of Multivariate Analysis, Elsevier, vol. 90(2), pages 384-392, August.
    11. Véronique Flambard & Pierre Lasserre & Pierre Mohnen, 2007. "Snow removal auctions in Montreal: costs, informational rents, and procurement management," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 40(1), pages 245-277, February.
    12. Kyungchul Song, 2009. "Two-Step Extremum Estimation with Estimated Single-Indices," PIER Working Paper Archive 09-012, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
    13. Nishiyama, Y., 2004. "Minimum normal approximation error bandwidth selection for averaged derivatives," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 64(1), pages 53-61.
    14. Huybrechts F. Bindele & Ash Abebe & Karlene N. Meyer, 2018. "General rank-based estimation for regression single index models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(5), pages 1115-1146, October.
    15. Hall, Peter & Park, Byeong U. & Stern, Steven E., 1998. "On Polynomial Estimators of Frontiers and Boundaries," Journal of Multivariate Analysis, Elsevier, vol. 66(1), pages 71-98, July.
    16. Yiping Yang & Tiejun Tong & Gaorong Li, 2019. "SIMEX estimation for single-index model with covariate measurement error," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 103(1), pages 137-161, March.
    17. Cattaneo, Matias D. & Crump, Richard K. & Jansson, Michael, 2010. "Robust Data-Driven Inference for Density-Weighted Average Derivatives," Journal of the American Statistical Association, American Statistical Association, vol. 105(491), pages 1070-1083.
    18. repec:hum:wpaper:sfb649dp2014-003 is not listed on IDEAS
    19. Powell, James L. & Stoker, Thomas M., 1996. "Optimal bandwidth choice for density-weighted averages," Journal of Econometrics, Elsevier, vol. 75(2), pages 291-316, December.
    20. Almekinders, Geert J & Eijffinger, Sylvester C W, 1994. "Daily Bundesbank and Federal Reserve Interventions: Are They a Reaction to Changes in the Level and Volatility of the DM/$-Rate?," Empirical Economics, Springer, vol. 19(1), pages 111-130.
    21. repec:hum:wpaper:sfb649dp2009-028 is not listed on IDEAS
    22. Girard, Séphane & Jacob, Pierre, 2009. "Frontier estimation with local polynomials and high power-transformed data," Journal of Multivariate Analysis, Elsevier, vol. 100(8), pages 1691-1705, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0299811. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.