IDEAS home Printed from https://ideas.repec.org/a/wly/jnljam/v2025y2025i1n3904251.html

Comparison of Semiparametric Models in the Presence of Noise and Outliers

Author

Listed:
  • Daniel Edinam Wormenor
  • Sampson Twumasi-Ankrah
  • Accam Burnett Tetteh

Abstract

Various studies have examined generalized additive models (GAMs), comparing thin plate splines (tp), P‐splines (ps), cubic regression splines (cr), and Gaussian processes (gp) for discrete choice data, function approximation, and in the presence of multicollinearity and outliers. Some studies have applied ps to models with correlated and heteroscedastic errors, while others have reviewed multiple smoothing term packages for modeling GAMs. This study seeks to examine the performance of semiparametric models in the presence of different noise and outliers within the framework of GAMs through simulation. The study adopted four GAMs, cr, ps, tp, and gp, for simulated data with different noise and outliers with varying sample sizes. According to our investigation, the cr model performs well in terms of deviance for the majority of sample sizes and all types of noise. With higher sample sizes, the ps model frequently performs well, particularly in terms of AIC and GCV under noise that is heteroscedastic and Gaussian. The gp model excels with the smallest sample size under Gaussian and lognormal noise in terms of GCV, and the tp model frequently performs best under exponential and lognormal noise for larger samples in terms of AIC and GCV. For data containing outliers, the cr and tp models are effective with smaller sample sizes, while the gp model excels with larger sample sizes based on AIC and GCV. Regarding deviance, the cr model consistently performs best across all sample sizes. Our results show that the sample size and kind of noise in the data have a significant impact on the smoothing model’s performance. No single model consistently outperforms the others for all noise types and sample sizes, suggesting that the choice of model should be based on the specific goal of a study.

Suggested Citation

  • Daniel Edinam Wormenor & Sampson Twumasi-Ankrah & Accam Burnett Tetteh, 2025. "Comparison of Semiparametric Models in the Presence of Noise and Outliers," Journal of Applied Mathematics, John Wiley & Sons, vol. 2025(1).
  • Handle: RePEc:wly:jnljam:v:2025:y:2025:i:1:n:3904251
    DOI: 10.1155/jama/3904251
    as

    Download full text from publisher

    File URL: https://doi.org/10.1155/jama/3904251
    Download Restriction: no

    File URL: https://libkey.io/10.1155/jama/3904251?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521780506, August.
    2. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521785167, August.
    3. Inyoung Kim & Noah D. Cohen & Raymond J. Carroll, 2003. "Semiparametric Regression Splines in Matched Case-Control Studies," Biometrics, The International Biometric Society, vol. 59(4), pages 1158-1169, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Otto-Sobotka, Fabian & Salvati, Nicola & Ranalli, Maria Giovanna & Kneib, Thomas, 2019. "Adaptive semiparametric M-quantile regression," Econometrics and Statistics, Elsevier, vol. 11(C), pages 116-129.
    2. Arthur Charpentier & Emmanuel Flachaire & Antoine Ly, 2017. "Econom\'etrie et Machine Learning," Papers 1708.06992, arXiv.org, revised Mar 2018.
    3. Hyunju Son & Youyi Fong, 2021. "Fast grid search and bootstrap‐based inference for continuous two‐phase polynomial regression models," Environmetrics, John Wiley & Sons, Ltd., vol. 32(3), May.
    4. Zi Ye & Giles Hooker & Stephen P. Ellner, 2021. "Generalized Single Index Models and Jensen Effects on Reproduction and Survival," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 26(3), pages 492-512, September.
    5. Ferraccioli, Federico & Sangalli, Laura M. & Finos, Livio, 2022. "Some first inferential tools for spatial regression with differential regularization," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    6. Vahid Goodarzi Vanani & Davood Shahsavani & Mohammad Kazemi, 2025. "A robust partial linear model combining modified Huber loss function and variable selection," Statistical Papers, Springer, vol. 66(6), pages 1-28, October.
    7. Akdeniz Duran, Esra & Härdle, Wolfgang Karl & Osipenko, Maria, 2012. "Difference based ridge and Liu type estimators in semiparametric regression models," Journal of Multivariate Analysis, Elsevier, vol. 105(1), pages 164-175.
    8. Nagler Thomas & Schellhase Christian & Czado Claudia, 2017. "Nonparametric estimation of simplified vine copula models: comparison of methods," Dependence Modeling, De Gruyter, vol. 5(1), pages 99-120, January.
    9. Wei Huang & Oliver Linton & Zheng Zhang, 2022. "A Unified Framework for Specification Tests of Continuous Treatment Effect Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(4), pages 1817-1830, October.
    10. Basile, Roberto & Durbán, María & Mínguez, Román & María Montero, Jose & Mur, Jesús, 2014. "Modeling regional economic dynamics: Spatial dependence, spatial heterogeneity and nonlinearities," Journal of Economic Dynamics and Control, Elsevier, vol. 48(C), pages 229-245.
    11. Morteza Amini & Mahdi Roozbeh & Nur Anisah Mohamed, 2024. "Separation of the Linear and Nonlinear Covariates in the Sparse Semi-Parametric Regression Model in the Presence of Outliers," Mathematics, MDPI, vol. 12(2), pages 1-17, January.
    12. Wahba, Jackline & Schluter, Christian, 2009. "Illegal migration, wages and remittances- semi-parametric estimation of illegality effects," Discussion Paper Series In Economics And Econometrics 913, Economics Division, School of Social Sciences, University of Southampton.
    13. Feng, Yuanhua & Härdle, Wolfgang Karl, 2020. "A data-driven P-spline smoother and the P-Spline-GARCH models," IRTG 1792 Discussion Papers 2020-016, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    14. Schmidt, Rouven & Kneib, Thomas, 2023. "Multivariate distributional stochastic frontier models," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    15. Clark, Andrew E. & Etilé, Fabrice, 2011. "Happy house: Spousal weight and individual well-being," Journal of Health Economics, Elsevier, vol. 30(5), pages 1124-1136.
    16. Hannes Matuschek & Reinhold Kliegl & Matthias Holschneider, 2015. "Smoothing Spline ANOVA Decomposition of Arbitrary Splines: An Application to Eye Movements in Reading," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-15, March.
    17. Shirun Shen & Huiya Zhou & Kejun He & Lan Zhou, 2024. "Principal Component Analysis of Two-dimensional Functional Data with Serial Correlation," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 29(3), pages 601-620, September.
    18. Michaelides, Michael & Spanos, Aris, 2020. "On modeling heterogeneity in linear models using trend polynomials," Economic Modelling, Elsevier, vol. 85(C), pages 74-86.
    19. Lu, Qiang (Steven) & Yang, Yupin & Yuksel, Ulku, 2015. "The impact of a new online channel: An empirical study," Annals of Tourism Research, Elsevier, vol. 54(C), pages 136-155.
    20. Afonso, António & Alves, José & Beck, Krzysztof & Jackson, Karen, 2024. "Financial, institutional, and macroeconomic determinants of cross-country portfolio equity flows: The case of developed countries," Economic Modelling, Elsevier, vol. 141(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jnljam:v:2025:y:2025:i:1:n:3904251. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://onlinelibrary.wiley.com/journal/4185 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.