IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v155y2017icp133-150.html
   My bibliography  Save this article

Profile forward regression screening for ultra-high dimensional semiparametric varying coefficient partially linear models

Author

Listed:
  • Li, Yujie
  • Li, Gaorong
  • Lian, Heng
  • Tong, Tiejun

Abstract

In this paper, we consider semiparametric varying coefficient partially linear models when the predictor variables of the linear part are ultra-high dimensional where the dimensionality grows exponentially with the sample size. We propose a profile forward regression (PFR) method to perform variable screening for ultra-high dimensional linear predictor variables. The proposed PFR algorithm can not only identify all relevant predictors consistently even for ultra-high semiparametric models including both nonparametric and parametric parts, but also possesses the screening consistency property. To determine whether or not to include the candidate predictor in the model of selected ones, we adopt an extended Bayesian information criterion (EBIC) to select the “best” candidate model. Simulation studies and a real data example are also carried out to assess the performance of the proposed method and to compare it with existing screening methods.

Suggested Citation

  • Li, Yujie & Li, Gaorong & Lian, Heng & Tong, Tiejun, 2017. "Profile forward regression screening for ultra-high dimensional semiparametric varying coefficient partially linear models," Journal of Multivariate Analysis, Elsevier, vol. 155(C), pages 133-150.
  • Handle: RePEc:eee:jmvana:v:155:y:2017:i:c:p:133-150
    DOI: 10.1016/j.jmva.2016.12.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X16302615
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2016.12.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jingyuan Liu & Runze Li & Rongling Wu, 2014. "Feature Selection for Varying Coefficient Models With Ultrahigh-Dimensional Covariates," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 266-274, March.
    2. Jiahua Chen & Zehua Chen, 2008. "Extended Bayesian information criteria for model selection with large model spaces," Biometrika, Biometrika Trust, vol. 95(3), pages 759-771.
    3. Wenxuan Zhong & Tingting Zhang & Yu Zhu & Jun S. Liu, 2012. "Correlation pursuit: forward stepwise variable selection for index models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 74(5), pages 849-870, November.
    4. Wang, Lifeng & Li, Hongzhe & Huang, Jianhua Z., 2008. "Variable Selection in Nonparametric Varying-Coefficient Models for Analysis of Repeated Measurements," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1556-1569.
    5. Fan, Jianqing & Feng, Yang & Song, Rui, 2011. "Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 544-557.
    6. Li, Gaorong & Xue, Liugen & Lian, Heng, 2011. "Semi-varying coefficient models with a diverging number of components," Journal of Multivariate Analysis, Elsevier, vol. 102(7), pages 1166-1174, August.
    7. Zhao, Peixin & Xue, Liugen, 2009. "Variable selection for semiparametric varying coefficient partially linear models," Statistics & Probability Letters, Elsevier, vol. 79(20), pages 2148-2157, October.
    8. Hengjian Cui & Runze Li & Wei Zhong, 2015. "Model-Free Feature Screening for Ultrahigh Dimensional Discriminant Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 630-641, June.
    9. Runze Li & Wei Zhong & Liping Zhu, 2012. "Feature Screening via Distance Correlation Learning," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 1129-1139, September.
    10. Jianqing Fan & Yunbei Ma & Wei Dai, 2014. "Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Varying Coefficient Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1270-1284, September.
    11. Weihua Zhao & Riquan Zhang & Jicai Liu & Yazhao Lv, 2014. "Robust and efficient variable selection for semiparametric partially linear varying coefficient model based on modal regression," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 66(1), pages 165-191, February.
    12. Li, Qi, et al, 2002. "Semiparametric Smooth Coefficient Models," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(3), pages 412-422, July.
    13. Xue, Liugen & Zhu, Lixing, 2007. "Empirical Likelihood for a Varying Coefficient Model With Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 642-654, June.
    14. Zhaoping Hong & Yuao Hu & Heng Lian, 2013. "Variable selection for high-dimensional varying coefficient partially linear models via nonconcave penalty," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(7), pages 887-908, October.
    15. Lam, Clifford & Fan, Jianqing, 2008. "Profile-kernel likelihood inference with diverging number of parameters," LSE Research Online Documents on Economics 31548, London School of Economics and Political Science, LSE Library.
    16. Yingcun Xia, 2004. "Efficient estimation for semivarying-coefficient models," Biometrika, Biometrika Trust, vol. 91(3), pages 661-681, September.
    17. Zhang, Weiwei & Li, Gaorong & Xue, Liugen, 2011. "Profile inference on partially linear varying-coefficient errors-in-variables models under restricted condition," Computational Statistics & Data Analysis, Elsevier, vol. 55(11), pages 3027-3040, November.
    18. You, Jinhong & Chen, Gemai, 2006. "Estimation of a semiparametric varying-coefficient partially linear errors-in-variables model," Journal of Multivariate Analysis, Elsevier, vol. 97(2), pages 324-341, February.
    19. Wang, Hansheng, 2009. "Forward Regression for Ultra-High Dimensional Variable Screening," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1512-1524.
    20. You, Jinhong & Zhou, Yong, 2006. "Empirical likelihood for semiparametric varying-coefficient partially linear regression models," Statistics & Probability Letters, Elsevier, vol. 76(4), pages 412-422, February.
    21. Li, Gaorong & Feng, Sanying & Peng, Heng, 2011. "A profile-type smoothed score function for a varying coefficient partially linear model," Journal of Multivariate Analysis, Elsevier, vol. 102(2), pages 372-385, February.
    22. Li, Gaorong & Lin, Lu & Zhu, Lixing, 2012. "Empirical likelihood for a varying coefficient partially linear model with diverging number of parameters," Journal of Multivariate Analysis, Elsevier, vol. 105(1), pages 85-111.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhaoliang Wang & Liugen Xue & Gaorong Li & Fei Lu, 2019. "Spline estimator for ultra-high dimensional partially linear varying coefficient models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(3), pages 657-677, June.
    2. Zhang, Shen & Zhao, Peixin & Li, Gaorong & Xu, Wangli, 2019. "Nonparametric independence screening for ultra-high dimensional generalized varying coefficient models with longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 37-52.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhaoliang Wang & Liugen Xue & Gaorong Li & Fei Lu, 2019. "Spline estimator for ultra-high dimensional partially linear varying coefficient models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(3), pages 657-677, June.
    2. Akira Shinkyu, 2023. "Forward Selection for Feature Screening and Structure Identification in Varying Coefficient Models," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(1), pages 485-511, February.
    3. Zhang, Shucong & Zhou, Yong, 2018. "Variable screening for ultrahigh dimensional heterogeneous data via conditional quantile correlations," Journal of Multivariate Analysis, Elsevier, vol. 165(C), pages 1-13.
    4. Zhao, Peixin & Xue, Liugen, 2010. "Variable selection for semiparametric varying coefficient partially linear errors-in-variables models," Journal of Multivariate Analysis, Elsevier, vol. 101(8), pages 1872-1883, September.
    5. Zhang, Shen & Zhao, Peixin & Li, Gaorong & Xu, Wangli, 2019. "Nonparametric independence screening for ultra-high dimensional generalized varying coefficient models with longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 171(C), pages 37-52.
    6. Fan, Guo-Liang & Liang, Han-Ying & Shen, Yu, 2016. "Penalized empirical likelihood for high-dimensional partially linear varying coefficient model with measurement errors," Journal of Multivariate Analysis, Elsevier, vol. 147(C), pages 183-201.
    7. Sanying Feng & Liugen Xue, 2014. "Bias-corrected statistical inference for partially linear varying coefficient errors-in-variables models with restricted condition," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 66(1), pages 121-140, February.
    8. Lu, Jun & Lin, Lu, 2018. "Feature screening for multi-response varying coefficient models with ultrahigh dimensional predictors," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 242-254.
    9. Yang, Baoying & Yin, Xiangrong & Zhang, Nan, 2019. "Sufficient variable selection using independence measures for continuous response," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 480-493.
    10. Peixin Zhao & Liugen Xue, 2011. "Variable selection for varying coefficient models with measurement errors," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 74(2), pages 231-245, September.
    11. Li, Gaorong & Lin, Lu & Zhu, Lixing, 2012. "Empirical likelihood for a varying coefficient partially linear model with diverging number of parameters," Journal of Multivariate Analysis, Elsevier, vol. 105(1), pages 85-111.
    12. Weihua Zhao & Riquan Zhang & Jicai Liu & Yazhao Lv, 2014. "Robust and efficient variable selection for semiparametric partially linear varying coefficient model based on modal regression," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 66(1), pages 165-191, February.
    13. Dong, Yuexiao & Yu, Zhou & Zhu, Liping, 2020. "Model-free variable selection for conditional mean in regression," Computational Statistics & Data Analysis, Elsevier, vol. 152(C).
    14. Shuaishuai Chen & Jun Lu, 2023. "Quantile-Composited Feature Screening for Ultrahigh-Dimensional Data," Mathematics, MDPI, vol. 11(10), pages 1-21, May.
    15. Mingqiu Wang & Peixin Zhao & Xiaoning Kang, 2020. "Structure identification for varying coefficient models with measurement errors based on kernel smoothing," Statistical Papers, Springer, vol. 61(5), pages 1841-1857, October.
    16. Shan Luo & Zehua Chen, 2014. "Sequential Lasso Cum EBIC for Feature Selection With Ultra-High Dimensional Feature Space," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1229-1240, September.
    17. Wang, Christina Dan & Chen, Zhao & Lian, Yimin & Chen, Min, 2022. "Asset selection based on high frequency Sharpe ratio," Journal of Econometrics, Elsevier, vol. 227(1), pages 168-188.
    18. Yang, Yiping & Li, Gaorong & Peng, Heng, 2014. "Empirical likelihood of varying coefficient errors-in-variables models with longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 127(C), pages 1-18.
    19. Lai, Peng & Song, Fengli & Chen, Kaiwen & Liu, Zhi, 2017. "Model free feature screening with dependent variable in ultrahigh dimensional binary classification," Statistics & Probability Letters, Elsevier, vol. 125(C), pages 141-148.
    20. Zhao, Bangxin & Liu, Xin & He, Wenqing & Yi, Grace Y., 2021. "Dynamic tilted current correlation for high dimensional variable screening," Journal of Multivariate Analysis, Elsevier, vol. 182(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:155:y:2017:i:c:p:133-150. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.