IDEAS home Printed from https://ideas.repec.org/p/cte/wsrepe/28630.html
   My bibliography  Save this paper

Out-of-sample prediction in multidimensional P-spline models

Author

Listed:
  • Carballo González, Alba
  • Durbán Reguera, María Luz
  • Lee, Dae-Jin

Abstract

Prediction of out-of-sample values is a problem of interest in any regression model. In the context of penalized smooth mixed model regression Carballo et al. (2017) have proposed a general framework for prediction in additive models without interaction terms. The aim of this paper is to extend this work, based on the methodology proposed in Currie et al. (2004), to models that include interaction terms, i.e. prediction is needed in multidimensional setting. Our approach fits the data and predicts the new observations simultaneously and uses constraints to ensure a coherent fit or to impose further restrictions on the predictions. We also develop this methodology for the so called smooth-ANOVA models which allow us to include interaction terms that can be decomposed as a sum of several smooth functions. To illustrate the methodology two real data sets are used, one to predict log mortality rates in the Spanish population and another to predict aboveground biomass in Populus trees as a smooth function of height and diameter. We examine the performance of the interaction models in comparison to the Smooth-ANOVA models (both models with and without the restriction the fit has to be maintained) through a simulation study.

Suggested Citation

  • Carballo González, Alba & Durbán Reguera, María Luz & Lee, Dae-Jin, 2019. "Out-of-sample prediction in multidimensional P-spline models," DES - Working Papers. Statistics and Econometrics. WS 28630, Universidad Carlos III de Madrid. Departamento de Estadística.
  • Handle: RePEc:cte:wsrepe:28630
    as

    Download full text from publisher

    File URL: https://e-archivo.uc3m.es/bitstream/handle/10016/28630/ws201910.pdf?sequence=1
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. I. D. Currie & M. Durban & P. H. C. Eilers, 2006. "Generalized linear array models with applications to multidimensional smoothing," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(2), pages 259-280, April.
    2. Simon N. Wood, 2006. "Low-Rank Scale-Invariant Tensor Product Smooths for Generalized Additive Mixed Models," Biometrics, The International Biometric Society, vol. 62(4), pages 1025-1036, December.
    3. Carballo González, Alba & Durbán Reguera, María Luz & Lee, Dae-Jin, 2017. "A general framework for prediction in penalized regression," DES - Working Papers. Statistics and Econometrics. WS 24607, Universidad Carlos III de Madrid. Departamento de Estadística.
    4. Greene, William H & Seaks, Terry G, 1991. "The Restricted Least Squares Estimator: A Pedagogical Note," The Review of Economics and Statistics, MIT Press, vol. 73(3), pages 563-567, August.
    5. Hyndman, Rob J. & Koehler, Anne B., 2006. "Another look at measures of forecast accuracy," International Journal of Forecasting, Elsevier, vol. 22(4), pages 679-688.
    6. Camarda, Carlo G., 2012. "MortalitySmooth: An R Package for Smoothing Poisson Counts with P-Splines," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 50(i01).
    7. M. P. Wand, 2003. "Smoothing and mixed models," Computational Statistics, Springer, vol. 18(2), pages 223-249, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alba Carballo & María Durbán & Dae-Jin Lee, 2021. "Out-of-Sample Prediction in Multidimensional P-Spline Models," Mathematics, MDPI, vol. 9(15), pages 1-23, July.
    2. Lee, Dae-Jin & Durbán, María, 2008. "Smooth-car mixed models for spatial count data," DES - Working Papers. Statistics and Econometrics. WS ws085820, Universidad Carlos III de Madrid. Departamento de Estadística.
    3. Lee, Dae-Jin & Durbán, María, 2009. "Smooth-CAR mixed models for spatial count data," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 2968-2979, June.
    4. Lee, Dae-Jin & Durbán, María, 2009. "P-spline anova-type interaction models for spatio-temporal smoothing," DES - Working Papers. Statistics and Econometrics. WS ws093312, Universidad Carlos III de Madrid. Departamento de Estadística.
    5. Ahbab Mohammad Fazle Rabbi & Stefano Mazzuco, 2021. "Mortality Forecasting with the Lee–Carter Method: Adjusting for Smoothing and Lifespan Disparity," European Journal of Population, Springer;European Association for Population Studies, vol. 37(1), pages 97-120, March.
    6. Lee, Wang-Sheng, 2014. "Is the BMI a Relic of the Past?," IZA Discussion Papers 8637, Institute of Labor Economics (IZA).
    7. Militino, A.F. & Goicoa, T. & Ugarte, M.D., 2012. "Estimating the percentage of food expenditure in small areas using bias-corrected P-spline based estimators," Computational Statistics & Data Analysis, Elsevier, vol. 56(10), pages 2934-2948.
    8. Basile, Roberto & Durbán, María & Mínguez, Román & María Montero, Jose & Mur, Jesús, 2014. "Modeling regional economic dynamics: Spatial dependence, spatial heterogeneity and nonlinearities," Journal of Economic Dynamics and Control, Elsevier, vol. 48(C), pages 229-245.
    9. Gioldasis, Georgios & Musolesi, Antonio & Simioni, Michel, 2023. "Interactive R&D spillovers: An estimation strategy based on forecasting-driven model selection," International Journal of Forecasting, Elsevier, vol. 39(1), pages 144-169.
    10. María Xosé Rodríguez‐Álvarez & María Durbán & Paul H.C. Eilers & Dae‐Jin Lee & Francisco Gonzalez, 2023. "Multidimensional adaptive P‐splines with application to neurons' activity studies," Biometrics, The International Biometric Society, vol. 79(3), pages 1972-1985, September.
    11. Lee, Dae-Jin & Durbán, María, 2012. "Seasonal modulation mixed models for time series forecasting," DES - Working Papers. Statistics and Econometrics. WS ws122519, Universidad Carlos III de Madrid. Departamento de Estadística.
    12. Simon N. Wood & Zheyuan Li & Gavin Shaddick & Nicole H. Augustin, 2017. "Generalized Additive Models for Gigadata: Modeling the U.K. Black Smoke Network Daily Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1199-1210, July.
    13. Lee, Wang-Sheng & McKinnish, Terra, 2019. "Locus of control and marital satisfaction: Couple perspectives using Australian data," Journal of Economic Psychology, Elsevier, vol. 74(C).
    14. Mariola Sánchez-González & María Durbán & Dae-Jin Lee & Isabel Cañellas & Hortensia Sixto, 2017. "Smooth additive mixed models for predicting aboveground biomass," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 22(1), pages 23-41, March.
    15. Lingbing Feng & Yanlin Shi, 2018. "Forecasting mortality rates: multivariate or univariate models?," Journal of Population Research, Springer, vol. 35(3), pages 289-318, September.
    16. Lee, Dae-Jin & Durbán, María & Eilers, Paul, 2013. "Efficient two-dimensional smoothing with P-spline ANOVA mixed models and nested bases," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 22-37.
    17. Georgios Gioldasis & Antonio Musolesi & Michel Simioni, 2021. "Interactive R&D Spillovers: An estimation strategy based on forecasting-driven model selection," SEEDS Working Papers 0621, SEEDS, Sustainability Environmental Economics and Dynamics Studies, revised Jun 2021.
    18. Carlo Giovanni Camarda, 2019. "Smooth constrained mortality forecasting," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 41(38), pages 1091-1130.
    19. Georgios Gioldasis & Antonio Musolesi & Michel Simioni, 2021. "Interactive R&D Spillovers: an estimation strategy based on forecasting-driven model selection," Working Papers hal-03224910, HAL.
    20. Lee, Wang-Sheng, 2014. "Big and Tall: Is there a Height Premium or Obesity Penalty in the Labor Market?," IZA Discussion Papers 8606, Institute of Labor Economics (IZA).

    More about this item

    Keywords

    Prediction;

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cte:wsrepe:28630. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Ana Poveda (email available below). General contact details of provider: http://portal.uc3m.es/portal/page/portal/dpto_estadistica .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.