IDEAS home Printed from https://ideas.repec.org/a/eee/ecosta/v5y2018icp124-136.html
   My bibliography  Save this article

Semiparametric method for model structure discovery in additive regression models

Author

Listed:
  • Yoshida, Takuma

Abstract

In regression analysis, there are two typical approaches, parametric methods and nonparametric methods. If the prior information of the structure of the regression function is obtained, a parametric method is preferred since they are efficient and easily interpreted. When the model is misspecified, on the other hand, parametric estimators do not work well. Therefore, it is important to check whether the parametric model assumption is valid. To simultaneously discover the model structure and estimate the regression function in additive regression models, a new semiparametric method is proposed. First, a parametric model is prepared and its estimator is obtained for all additive components. Next, for the residual data associated with the parametric estimator, a nonparametric method is applied. The final estimator is constructed by summing the parametric estimator and the nonparametric estimator of the residual data. In the second-step estimation, the B-spline method with an adaptive group lasso penalty is utilized. For each additive component, if the nonparametric estimator becomes the zero function, the final estimator is reduced to a parametric estimator. In other words, the model structure can then be discovered. The asymptotic properties of the proposed estimator are shown. A numerical study via a Monte Carlo simulation and a real data application are presented.

Suggested Citation

  • Yoshida, Takuma, 2018. "Semiparametric method for model structure discovery in additive regression models," Econometrics and Statistics, Elsevier, vol. 5(C), pages 124-136.
  • Handle: RePEc:eee:ecosta:v:5:y:2018:i:c:p:124-136
    DOI: 10.1016/j.ecosta.2017.02.005
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S245230621730014X
    Download Restriction: Full text for ScienceDirect subscribers only. Contains open access articles

    File URL: https://libkey.io/10.1016/j.ecosta.2017.02.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Horowitz, Joel L. & Lee, Sokbae, 2005. "Nonparametric Estimation of an Additive Quantile Regression Model," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1238-1249, December.
    2. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    3. Opsomer, Jean D., 2000. "Asymptotic Properties of Backfitting Estimators," Journal of Multivariate Analysis, Elsevier, vol. 73(2), pages 166-179, May.
    4. Marx, Brian D. & Eilers, Paul H. C., 1998. "Direct generalized additive modeling with penalized likelihood," Computational Statistics & Data Analysis, Elsevier, vol. 28(2), pages 193-209, August.
    5. Jianqing Fan & Arnab Maity & Yihui Wang & Yichao Wu, 2013. "Parametrically guided generalised additive models with application to mergers and acquisitions data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 25(1), pages 109-128, March.
    6. Fan, Jianqing & Feng, Yang & Song, Rui, 2011. "Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 544-557.
    7. Carlos Martins-Filho & Santosh Mishra & Aman Ullah, 2008. "A Class of Improved Parametrically Guided Nonparametric Regression Estimators," Econometric Reviews, Taylor & Francis Journals, vol. 27(4-6), pages 542-573.
    8. Pradeep Ravikumar & John Lafferty & Han Liu & Larry Wasserman, 2009. "Sparse additive models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 1009-1030, November.
    9. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    10. White, Halbert, 1982. "Maximum Likelihood Estimation of Misspecified Models," Econometrica, Econometric Society, vol. 50(1), pages 1-25, January.
    11. Liu, Rong & Yang, Lijian, 2010. "Spline-Backfitted Kernel Smoothing Of Additive Coefficient Model," Econometric Theory, Cambridge University Press, vol. 26(1), pages 29-59, February.
    12. El Ghouch, Anouar & Genton, Marc G., 2009. "Local Polynomial Quantile Regression With Parametric Features," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1416-1429.
    13. Zhang, Hao Helen & Cheng, Guang & Liu, Yufeng, 2011. "Linear or Nonlinear? Automatic Structure Discovery for Partially Linear Models," Journal of the American Statistical Association, American Statistical Association, vol. 106(495), pages 1099-1112.
    14. Yichao Wu & Leonard A. Stefanski, 2015. "Automatic structure recovery for additive models," Biometrika, Biometrika Trust, vol. 102(2), pages 381-395.
    15. Opsomer, Jean D. & Ruppert, D., 1998. "A Fully Automated Bandwidth Selection Method for Fitting Additive Models," Staff General Research Papers Archive 1176, Iowa State University, Department of Economics.
    16. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Cai, Xizhen & Zhu, Yeying & Huang, Yuan & Ghosh, Debashis, 2022. "High-dimensional causal mediation analysis based on partial linear structural equation models," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Du, Pang & Cheng, Guang & Liang, Hua, 2012. "Semiparametric regression models with additive nonparametric components and high dimensional parametric components," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 2006-2017.
    2. Fabian Scheipl & Thomas Kneib & Ludwig Fahrmeir, 2013. "Penalized likelihood and Bayesian function selection in regression models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 97(4), pages 349-385, October.
    3. Lian, Heng & Feng, Sanying & Zhao, Kaifeng, 2015. "Parametric and semiparametric reduced-rank regression with flexible sparsity," Journal of Multivariate Analysis, Elsevier, vol. 136(C), pages 163-174.
    4. Lian, Heng & Meng, Jie & Zhao, Kaifeng, 2015. "Spline estimator for simultaneous variable selection and constant coefficient identification in high-dimensional generalized varying-coefficient models," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 81-103.
    5. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    6. Li, Xinyi & Wang, Li & Nettleton, Dan, 2019. "Sparse model identification and learning for ultra-high-dimensional additive partially linear models," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 204-228.
    7. Zhenghui Feng & Lu Lin & Ruoqing Zhu & Lixing Zhu, 2020. "Nonparametric variable selection and its application to additive models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(3), pages 827-854, June.
    8. He, Yong & Zhang, Liang & Ji, Jiadong & Zhang, Xinsheng, 2019. "Robust feature screening for elliptical copula regression model," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 568-582.
    9. Takuma Yoshida & Kanta Naito, 2014. "Asymptotics for penalised splines in generalised additive models," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 26(2), pages 269-289, June.
    10. Jiang, Liewen & Bondell, Howard D. & Wang, Huixia Judy, 2014. "Interquantile shrinkage and variable selection in quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 69(C), pages 208-219.
    11. Min Chen & Yimin Lian & Zhao Chen & Zhengjun Zhang, 2017. "Sure explained variability and independence screening," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 29(4), pages 849-883, October.
    12. Zhang, Shucong & Zhou, Yong, 2018. "Variable screening for ultrahigh dimensional heterogeneous data via conditional quantile correlations," Journal of Multivariate Analysis, Elsevier, vol. 165(C), pages 1-13.
    13. Takuma Yoshida, 2019. "Two stage smoothing in additive models with missing covariates," Statistical Papers, Springer, vol. 60(6), pages 1803-1826, December.
    14. Xia, Xiaochao & Yang, Hu & Li, Jialiang, 2016. "Feature screening for generalized varying coefficient models with application to dichotomous responses," Computational Statistics & Data Analysis, Elsevier, vol. 102(C), pages 85-97.
    15. Wenyan Zhong & Xuewen Lu & Jingjing Wu, 2021. "Bi-level variable selection in semiparametric transformation models with right-censored data," Computational Statistics, Springer, vol. 36(3), pages 1661-1692, September.
    16. Jiawei Hou & Yunquan Song, 2022. "Interquantile shrinkage in spatial additive autoregressive models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(4), pages 1030-1057, December.
    17. Diego Vidaurre & Concha Bielza & Pedro Larrañaga, 2013. "A Survey of L1 Regression," International Statistical Review, International Statistical Institute, vol. 81(3), pages 361-387, December.
    18. Noh, Hohsuk & Lee, Eun, 2012. "Component Selection in Additive Quantile Regression Models," LIDAM Discussion Papers ISBA 2012021, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    19. Heng Lian & Peng Lai & Hua Liang, 2013. "Partially Linear Structure Selection in Cox Models with Varying Coefficients," Biometrics, The International Biometric Society, vol. 69(2), pages 348-357, June.
    20. Lin, Hongmei & Lian, Heng & Liang, Hua, 2019. "Rank reduction for high-dimensional generalized additive models," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 672-684.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ecosta:v:5:y:2018:i:c:p:124-136. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/econometrics-and-statistics .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.