IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v60y2019i6d10.1007_s00362-017-0896-6.html
   My bibliography  Save this article

Two stage smoothing in additive models with missing covariates

Author

Listed:
  • Takuma Yoshida

    (Kagoshima University)

Abstract

This paper considers sparse additive models with missing covariates. The missing mechanism is assumed to be missing at random. The additive components are estimated via a two stage method. First, the penalized weighted least squares method is used. The weight is the inverse of the selection probability, which is the probability of observing covariates. As the penalty, we utilize the adaptive group lasso to distinguish between the zero and the nonzero components. Thus, the penalty is used to investigate the sparse structure and the weight reflects the missing structure. The estimator obtained from the penalized weighted least squares method is denoted by the first stage estimator (FSE). We show the sparsity and consistency properties of the FSE. However, the asymptotic distribution of the FSE of the nonzero components is not derived as it is difficult. Therefore for each nonzero component, we apply the penalized spline methods for univariate regression with the residual of the FSE of other component. The asymptotic normality of the second stage estimator is shown. To confirm the performance of the proposed estimator, simulation studies and a real data application are implemented.

Suggested Citation

  • Takuma Yoshida, 2019. "Two stage smoothing in additive models with missing covariates," Statistical Papers, Springer, vol. 60(6), pages 1803-1826, December.
  • Handle: RePEc:spr:stpapr:v:60:y:2019:i:6:d:10.1007_s00362-017-0896-6
    DOI: 10.1007/s00362-017-0896-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-017-0896-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-017-0896-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Marx, Brian D. & Eilers, Paul H. C., 1998. "Direct generalized additive modeling with penalized likelihood," Computational Statistics & Data Analysis, Elsevier, vol. 28(2), pages 193-209, August.
    2. Liang H. & Wang S. & Robins J.M. & Carroll R.J., 2004. "Estimation in Partially Linear Models With Missing Covariates," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 357-367, January.
    3. Grace Y. Yi & Wenqing He, 2009. "Median Regression Models for Longitudinal Data with Dropouts," Biometrics, The International Biometric Society, vol. 65(2), pages 618-625, June.
    4. Hu Yang & Huilan Liu, 2016. "Penalized weighted composite quantile estimators with missing covariates," Statistical Papers, Springer, vol. 57(1), pages 69-88, March.
    5. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    6. Tingting Li & Hu Yang, 2016. "Inverse probability weighted estimators for single-index models with missing covariates," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 45(5), pages 1199-1214, March.
    7. Xuerong Chen & Alan T. K. Wan & Yong Zhou, 2015. "Efficient Quantile Regression Analysis With Missing Observations," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 723-741, June.
    8. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521780506.
    9. Sherwood, Ben, 2016. "Variable selection for additive partial linear quantile regression with missing covariates," Journal of Multivariate Analysis, Elsevier, vol. 152(C), pages 206-223.
    10. Li, Tong & Hsiao, Cheng, 2004. "Robust estimation of generalized linear models with measurement errors," Journal of Econometrics, Elsevier, vol. 118(1-2), pages 51-65.
    11. Horowitz, Joel L. & Lee, Sokbae, 2005. "Nonparametric Estimation of an Additive Quantile Regression Model," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1238-1249, December.
    12. Pradeep Ravikumar & John Lafferty & Han Liu & Larry Wasserman, 2009. "Sparse additive models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 1009-1030, November.
    13. Ruppert,David & Wand,M. P. & Carroll,R. J., 2003. "Semiparametric Regression," Cambridge Books, Cambridge University Press, number 9780521785167.
    14. Gerda Claeskens & Tatyana Krivobokova & Jean D. Opsomer, 2009. "Asymptotic properties of penalized spline estimators," Biometrika, Biometrika Trust, vol. 96(3), pages 529-544.
    15. Cui, Hengjian & He, Xuming & Ng, Kai W., 2004. "M-estimation for linear models with spatially-correlated errors," Statistics & Probability Letters, Elsevier, vol. 66(4), pages 383-393, March.
    16. Takuma Yoshida & Kanta Naito, 2014. "Asymptotics for penalised splines in generalised additive models," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 26(2), pages 269-289, June.
    17. Inyoung Kim & Noah D. Cohen & Raymond J. Carroll, 2003. "Semiparametric Regression Splines in Matched Case-Control Studies," Biometrics, The International Biometric Society, vol. 59(4), pages 1158-1169, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Takuma Yoshida & Kanta Naito, 2014. "Asymptotics for penalised splines in generalised additive models," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 26(2), pages 269-289, June.
    2. Gao, Yan & Zhang, Xinyu & Wang, Shouyang & Zou, Guohua, 2016. "Model averaging based on leave-subject-out cross-validation," Journal of Econometrics, Elsevier, vol. 192(1), pages 139-151.
    3. Takuma Yoshida, 2016. "Asymptotics and smoothing parameter selection for penalized spline regression with various loss functions," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 70(4), pages 278-303, November.
    4. Cui, Xia & Zhao, Weihua & Lian, Heng & Liang, Hua, 2019. "Pursuit of dynamic structure in quantile additive models with longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 130(C), pages 42-60.
    5. Sherwood, Ben, 2016. "Variable selection for additive partial linear quantile regression with missing covariates," Journal of Multivariate Analysis, Elsevier, vol. 152(C), pages 206-223.
    6. Dlugosz, Stephan & Mammen, Enno & Wilke, Ralf A., 2017. "Generalized partially linear regression with misclassified data and an application to labour market transitions," Computational Statistics & Data Analysis, Elsevier, vol. 110(C), pages 145-159.
    7. Morteza Amini & Mahdi Roozbeh & Nur Anisah Mohamed, 2024. "Separation of the Linear and Nonlinear Covariates in the Sparse Semi-Parametric Regression Model in the Presence of Outliers," Mathematics, MDPI, vol. 12(2), pages 1-17, January.
    8. Feng, Yuanhua & Härdle, Wolfgang Karl, 2020. "A data-driven P-spline smoother and the P-Spline-GARCH models," IRTG 1792 Discussion Papers 2020-016, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    9. I. Gijbels & I. Prosdocimi & G. Claeskens, 2010. "Nonparametric estimation of mean and dispersion functions in extended generalized linear models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 19(3), pages 580-608, November.
    10. Liu, Jingyuan & Lou, Lejia & Li, Runze, 2018. "Variable selection for partially linear models via partial correlation," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 418-434.
    11. Peter Pütz & Thomas Kneib, 2018. "A penalized spline estimator for fixed effects panel data models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(2), pages 145-166, April.
    12. Hübler, Olaf, 2017. "Health and Body Mass Index: No Simple Relationship," IZA Discussion Papers 10620, Institute of Labor Economics (IZA).
    13. Gerhard Tutz & Jan Gertheiss, 2014. "Rating Scales as Predictors—The Old Question of Scale Level and Some Answers," Psychometrika, Springer;The Psychometric Society, vol. 79(3), pages 357-376, July.
    14. Yoshida, Takuma, 2018. "Semiparametric method for model structure discovery in additive regression models," Econometrics and Statistics, Elsevier, vol. 5(C), pages 124-136.
    15. Simon N. Wood & Zheyuan Li & Gavin Shaddick & Nicole H. Augustin, 2017. "Generalized Additive Models for Gigadata: Modeling the U.K. Black Smoke Network Daily Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 1199-1210, July.
    16. Lee, Wang-Sheng & McKinnish, Terra, 2019. "Locus of control and marital satisfaction: Couple perspectives using Australian data," Journal of Economic Psychology, Elsevier, vol. 74(C).
    17. Chen, Xiaohong, 2007. "Large Sample Sieve Estimation of Semi-Nonparametric Models," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 76, Elsevier.
    18. Wu, Chaojiang & Yu, Yan, 2014. "Partially linear modeling of conditional quantiles using penalized splines," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 170-187.
    19. Wu, Ximing & Sickles, Robin, 2018. "Semiparametric estimation under shape constraints," Econometrics and Statistics, Elsevier, vol. 6(C), pages 74-89.
    20. Simon N. Wood, 2020. "Inference and computation with generalized additive models and their extensions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(2), pages 307-339, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:60:y:2019:i:6:d:10.1007_s00362-017-0896-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.