IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v66y2025i6d10.1007_s00362-025-01745-3.html
   My bibliography  Save this article

A robust partial linear model combining modified Huber loss function and variable selection

Author

Listed:
  • Vahid Goodarzi Vanani

    (Shahrood University of Technology)

  • Davood Shahsavani

    (Shahrood University of Technology)

  • Mohammad Kazemi

    (University of Guilan)

Abstract

Partial linear models (PLMs), which contain both linear and nonlinear components, are flexible and more efficient than general nonparametric regression models. Typically, the estimation of PLMs is based on least squares or likelihood-based methods, which are very sensitive to outliers and may not be effective for heavy tail error distributions. This paper proposes a new, efficient, and robust estimation procedure based on a modified Huber loss function, in which the redundant covariates are also identified by a feature selection method simultaneously. Several simulation studies were conducted to assess the performance of the proposed model in the presence of outliers. Our simulation results demonstrate that the proposed model is efficient and robust against outliers and heavy-tailed distributions. Moreover, it demonstrates better performance in both prediction accuracy and variable selection compared to competing methods. Finally, the real Boston housing dataset is used to assess the proposed model.

Suggested Citation

  • Vahid Goodarzi Vanani & Davood Shahsavani & Mohammad Kazemi, 2025. "A robust partial linear model combining modified Huber loss function and variable selection," Statistical Papers, Springer, vol. 66(6), pages 1-28, October.
  • Handle: RePEc:spr:stpapr:v:66:y:2025:i:6:d:10.1007_s00362-025-01745-3
    DOI: 10.1007/s00362-025-01745-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-025-01745-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-025-01745-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Wang, Jia & Cai, Xizhen & Li, Runze, 2021. "Variable selection for partially linear models via Bayesian subset modeling with diffusing prior," Journal of Multivariate Analysis, Elsevier, vol. 183(C).
    2. Gertraud Malsiner‐Walli & Paul Hofmarcher & Bettina Grün, 2019. "Semi‐parametric Regression under Model Uncertainty: Economic Applications," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 81(5), pages 1117-1143, October.
    3. Liu, Jingyuan & Lou, Lejia & Li, Runze, 2018. "Variable selection for partially linear models via partial correlation," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 418-434.
    4. Yang, Yiping & Luo, Chuanqin & Yang, Weiming, 2024. "Double penalized variable selection for high-dimensional partial linear mixed effects models," Journal of Multivariate Analysis, Elsevier, vol. 204(C).
    5. Xinyu Fu & Mian Huang & Weixin Yao, 2024. "Semiparametric efficient estimation in high‐dimensional partial linear regression models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 51(3), pages 1259-1287, September.
    6. Zhuoran Yang & Liya Fu & You-Gan Wang & Zhixiong Dong & Yunlu Jiang, 2022. "A robust and efficient variable selection method for linear regression," Journal of Applied Statistics, Taylor & Francis Journals, vol. 49(14), pages 3677-3692, October.
    7. Brice M. Nguelifack & Isabelle Kemajou-Brown, 2020. "Robust signed-rank estimation and variable selection for semi-parametric additive partial linear models," Journal of Applied Statistics, Taylor & Francis Journals, vol. 47(10), pages 1794-1819, July.
    8. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521780506, November.
    9. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    10. Wang Q. & Linton O. & Hardle W., 2004. "Semiparametric Regression Analysis With Missing Response at Random," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 334-345, January.
    11. Xiuli Wang & Jingchang Shao & Jingjing Wu & Qiang Zhao, 2023. "Robust variable selection with exponential squared loss for partially linear spatial autoregressive models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(6), pages 949-977, December.
    12. Hu Yang & Ning Li & Jing Yang, 2020. "A robust and efficient estimation and variable selection method for partially linear models with large-dimensional covariates," Statistical Papers, Springer, vol. 61(5), pages 1911-1937, October.
    13. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521785167, November.
    14. Jia Chen & Jiti Gao & Degui Li, 2013. "Estimation in Partially Linear Single-Index Panel Data Models With Fixed Effects," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 31(3), pages 315-330, July.
    15. Harrison, David Jr. & Rubinfeld, Daniel L., 1978. "Hedonic housing prices and the demand for clean air," Journal of Environmental Economics and Management, Elsevier, vol. 5(1), pages 81-102, March.
    16. Gong, Siliang & Zhang, Kai & Liu, Yufeng, 2018. "Efficient test-based variable selection for high-dimensional linear models," Journal of Multivariate Analysis, Elsevier, vol. 166(C), pages 17-31.
    17. Inyoung Kim & Noah D. Cohen & Raymond J. Carroll, 2003. "Semiparametric Regression Splines in Matched Case-Control Studies," Biometrics, The International Biometric Society, vol. 59(4), pages 1158-1169, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dlugosz, Stephan & Mammen, Enno & Wilke, Ralf A., 2017. "Generalized partially linear regression with misclassified data and an application to labour market transitions," Computational Statistics & Data Analysis, Elsevier, vol. 110(C), pages 145-159.
    2. Akdeniz Duran, Esra & Härdle, Wolfgang Karl & Osipenko, Maria, 2012. "Difference based ridge and Liu type estimators in semiparametric regression models," Journal of Multivariate Analysis, Elsevier, vol. 105(1), pages 164-175.
    3. Morteza Amini & Mahdi Roozbeh & Nur Anisah Mohamed, 2024. "Separation of the Linear and Nonlinear Covariates in the Sparse Semi-Parametric Regression Model in the Presence of Outliers," Mathematics, MDPI, vol. 12(2), pages 1-17, January.
    4. Shirun Shen & Huiya Zhou & Kejun He & Lan Zhou, 2024. "Principal Component Analysis of Two-dimensional Functional Data with Serial Correlation," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 29(3), pages 601-620, September.
    5. Afonso, António & Alves, José & Beck, Krzysztof & Jackson, Karen, 2024. "Financial, institutional, and macroeconomic determinants of cross-country portfolio equity flows: The case of developed countries," Economic Modelling, Elsevier, vol. 141(C).
    6. Nolan, Tui H. & Richardson, Sylvia & Ruffieux, Hélène, 2025. "Efficient Bayesian functional principal component analysis of irregularly-observed multivariate curves," Computational Statistics & Data Analysis, Elsevier, vol. 203(C).
    7. Zanin, Luca, 2023. "A flexible estimation of sectoral portfolio exposure to climate transition risks in the European stock market," Journal of Behavioral and Experimental Finance, Elsevier, vol. 39(C).
    8. Gao, Lisa & Shi, Peng, 2022. "Leveraging high-resolution weather information to predict hail damage claims: A spatial point process for replicated point patterns," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 161-179.
    9. Rachid Muleia & Shelsea Luís Damião & Áuria Ribeiro Banze & Cynthia Semá Baltazar & Isaac Akpor Adjei, 2025. "Factors associated with early sexual debut among adolescents and youth in Mozambique: A geo-additive survival analysis of the Mozambique 2021 AIDS indicator survey," PLOS ONE, Public Library of Science, vol. 20(6), pages 1-17, June.
    10. Qi Qian & Danh V. Nguyen & Esra Kürüm & Connie M. Rhee & Sudipto Banerjee & Yihao Li & Damla Şentürk, 2024. "Multivariate Varying Coefficient Spatiotemporal Model," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 16(3), pages 761-786, December.
    11. Yu Liu & Chin-Shang Li, 2023. "A linear spline Cox cure model with its applications," Computational Statistics, Springer, vol. 38(2), pages 935-954, June.
    12. Elizabeth Goult & Laura Andrea Barrero Guevara & Michael Briga & Matthieu Domenech de Cellès, 2024. "Estimating the optimal age for infant measles vaccination," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    13. Caldeira, João F. & Santos, André A.P. & Torrent, Hudson S., 2023. "Semiparametric portfolios: Improving portfolio performance by exploiting non-linearities in firm characteristics," Economic Modelling, Elsevier, vol. 122(C).
    14. Huang, Lei & Jiang, Hui & Wang, Huixia, 2019. "A novel partial-linear single-index model for time series data," Computational Statistics & Data Analysis, Elsevier, vol. 134(C), pages 110-122.
    15. Benjamin Owusu & Bettina Bökemeier & Alfred Greiner, 2023. "Assessing nonlinearities and heterogeneity in debt sustainability analysis: a panel spline approach," Empirical Economics, Springer, vol. 64(3), pages 1315-1346, March.
    16. Li, Yehua & Qiu, Yumou & Xu, Yuhang, 2022. "From multivariate to functional data analysis: Fundamentals, recent developments, and emerging areas," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    17. Maximilian Osterhaus, 2024. "A Sparse Grid Approach for the Nonparametric Estimation of High-Dimensional Random Coefficient Models," Papers 2408.07185, arXiv.org.
    18. Kalogridis, Ioannis & Van Aelst, Stefan, 2023. "Robust penalized estimators for functional linear regression," Journal of Multivariate Analysis, Elsevier, vol. 194(C).
    19. Goepp, Vivien & Bouaziz, Olivier & Nuel, Grégory, 2025. "Spline regression with automatic knot selection," Computational Statistics & Data Analysis, Elsevier, vol. 202(C).
    20. Hamid Mraoui & Ahmed El-Alaoui & Souad Bechrouri & Nezha Mohaoui & Abdelilah Monir, 2025. "Two-stage regression spline modeling based on local polynomial kernel regression," Computational Statistics, Springer, vol. 40(1), pages 383-403, January.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:66:y:2025:i:6:d:10.1007_s00362-025-01745-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.