IDEAS home Printed from https://ideas.repec.org/p/zbw/irtgdp/2018002.html
   My bibliography  Save this paper

Nonparametric Variable Selection and Its Application to Additive Models

Author

Listed:
  • Feng, Zheng-Hui
  • Lin, Lu
  • Zhu, Ruo-Qing
  • Zhu, Li-Xing

Abstract

For multivariate nonparametric regression models, existing variable selection methods with penalization require high-dimensional nonparametric approximations in objective functions. When the dimension is high, none of methods with penalization in the literature are readily available. Also, ranking and screening approaches cannot have selection consistency when iterative algorithms cannot be used due to inefficient nonparametric approximation. In this paper, a novel and easily implemented approach is proposed to make existing methods feasible for selection with no need of nonparametric approximation. Selection consistency can be achieved. As an application to additive regression models, we then suggest a two-stage procedure that separates selection and estimation steps. An adaptive estimation to the smoothness of underlying components can be constructed such that the consistency can be even at parametric rate if the underlying model is really parametric. Simulations are carried out to examine the performance of our method, and a real data example is analyzed for illustration.

Suggested Citation

  • Feng, Zheng-Hui & Lin, Lu & Zhu, Ruo-Qing & Zhu, Li-Xing, 2018. "Nonparametric Variable Selection and Its Application to Additive Models," IRTG 1792 Discussion Papers 2018-002, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
  • Handle: RePEc:zbw:irtgdp:2018002
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/230713/1/irtg1792dp2018-002.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Xia Cui & Heng Peng & Songqiao Wen & Lixing Zhu, 2013. "Component Selection in the Additive Regression Model," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 40(3), pages 491-510, September.
    2. Runze Li & Wei Zhong & Liping Zhu, 2012. "Feature Screening via Distance Correlation Learning," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 1129-1139, September.
    3. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    4. Zhu, Lixing & Miao, Baiqi & Peng, Heng, 2006. "On Sliced Inverse Regression With High-Dimensional Covariates," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 630-643, June.
    5. Buhlmann P. & Yu B., 2003. "Boosting With the L2 Loss: Regression and Classification," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 324-339, January.
    6. Pradeep Ravikumar & John Lafferty & Han Liu & Larry Wasserman, 2009. "Sparse additive models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 1009-1030, November.
    7. Lin, Lu & Sun, Jing & Zhu, Lixing, 2013. "Nonparametric feature screening," Computational Statistics & Data Analysis, Elsevier, vol. 67(C), pages 162-174.
    8. Liping Zhu & Tao Wang & Lixing Zhu & Louis Ferré, 2010. "Sufficient dimension reduction through discretization-expectation estimation," Biometrika, Biometrika Trust, vol. 97(2), pages 295-304.
    9. Lu Lin & Xia Cui & Lixing Zhu, 2009. "An Adaptive Two‐stage Estimation Method for Additive Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(2), pages 248-269, June.
    10. Jianqing Fan & Jinchi Lv, 2008. "Sure independence screening for ultrahigh dimensional feature space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 849-911, November.
    11. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Qingliang Fan & Wei Zhong, 2018. "Nonparametric Additive Instrumental Variable Estimator: A Group Shrinkage Estimation Perspective," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(3), pages 388-399, July.
    2. Victor Chernozhukov & Wolfgang Härdle & Chen Huang & Weining Wang, 2018. "LASSO-driven inference in time and space," CeMMAP working papers CWP36/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    3. Zhong, Wei & Liu, Xi & Ma, Shuangge, 2018. "Variable selection and direction estimation for single-index models via DC-TGDR method," IRTG 1792 Discussion Papers 2018-050, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    4. Guo, Li & Tao, Yubo & Härdle, Wolfgang Karl, 2018. "Understanding Latent Group Structure of Cryptocurrencies Market: A Dynamic Network Perspective," IRTG 1792 Discussion Papers 2018-032, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    5. Packham, Natalie & Woebbeking, Fabian, 2018. "A factor-model approach for correlation scenarios and correlation stress-testing," IRTG 1792 Discussion Papers 2018-034, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    6. Xiaojia Bao & Qingliang Fan, 2020. "The impact of temperature on gaming productivity: evidence from online games," Empirical Economics, Springer, vol. 58(2), pages 835-867, February.
    7. Packham, Natalie & Kalkbrener, Michael & Overbeck, Ludger, 2018. "Default probabilities and default correlations under stress," IRTG 1792 Discussion Papers 2018-037, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    8. Kuczmaszewska, Anna & Yan, Ji Gao, 2018. "On complete convergence in Marcinkiewicz-Zygmund type SLLN for random variables," IRTG 1792 Discussion Papers 2018-041, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    9. Chen, Haiqiang & Li, Yingxing & Lin, Ming & Zhu, Yanli, 2018. "A Regime Shift Model with Nonparametric Switching Mechanism," IRTG 1792 Discussion Papers 2018-048, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    10. Yatracos, Yannis G., 2018. "Residual'S Influence Index (Rinfin), Bad Leverage And Unmasking In High Dimensional L2-Regression," IRTG 1792 Discussion Papers 2018-060, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    11. Zbonakova, Lenka & Li, Xinjue & Härdle, Wolfgang Karl, 2018. "Penalized Adaptive Forecasting with Large Information Sets and Structural Changes," IRTG 1792 Discussion Papers 2018-039, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    12. Packham, Natalie, 2018. "Optimal contracts under competition when uncertainty from adverse selection and moral hazard are present," IRTG 1792 Discussion Papers 2018-033, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    13. Cai, Zongwu & Fang, Ying & Lin, Ming & Su, Jia, 2018. "Inferences for a Partially Varying Coefficient Model With Endogenous Regressors," IRTG 1792 Discussion Papers 2018-047, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    14. Wang, Honglin & Yu, Fan & Zhou, Yinggang, 2018. "Property Investment and Rental Rate under Housing Price Uncertainty: A Real Options Approach," IRTG 1792 Discussion Papers 2018-051, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    15. Yan, Ji Gao, 2018. "Complete Convergence and Complete Moment Convergence for Maximal Weighted Sums of Extended Negatively Dependent Random Variables," IRTG 1792 Discussion Papers 2018-040, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    16. Kalkbrener, Michael & Packham, Natalie, 2018. "Correlation Under Stress In Normal Variance Mixture Models," IRTG 1792 Discussion Papers 2018-035, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    17. Chiu, Hsin-Yu & Chiang, Mi-Hsiu & Kuo, Wei-Yu, 2018. "Predicative Ability of Similarity-based Futures Trading Strategies," IRTG 1792 Discussion Papers 2018-045, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    18. Guo, Shaojun & Li, Dong & Li, Muyi, 2018. "Strict Stationarity Testing and GLAD Estimation of Double Autoregressive Models," IRTG 1792 Discussion Papers 2018-049, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    19. Koziuk, Andzhey & Spokoiny, Vladimir, 2018. "Toolbox: Gaussian comparison on Eucledian balls," IRTG 1792 Discussion Papers 2018-028, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhenghui Feng & Lu Lin & Ruoqing Zhu & Lixing Zhu, 2020. "Nonparametric variable selection and its application to additive models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(3), pages 827-854, June.
    2. Zhou Yu & Yuexiao Dong & Li-Xing Zhu, 2016. "Trace Pursuit: A General Framework for Model-Free Variable Selection," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 813-821, April.
    3. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    4. Feng, Zhenghui & Zhu, Lixing, 2012. "An alternating determination–optimization approach for an additive multi-index model," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1981-1993.
    5. He, Yong & Zhang, Liang & Ji, Jiadong & Zhang, Xinsheng, 2019. "Robust feature screening for elliptical copula regression model," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 568-582.
    6. Min Chen & Yimin Lian & Zhao Chen & Zhengjun Zhang, 2017. "Sure explained variability and independence screening," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 29(4), pages 849-883, October.
    7. Zhang, Shucong & Zhou, Yong, 2018. "Variable screening for ultrahigh dimensional heterogeneous data via conditional quantile correlations," Journal of Multivariate Analysis, Elsevier, vol. 165(C), pages 1-13.
    8. Peirong Xu & Lixing Zhu & Yi Li, 2014. "Ultrahigh dimensional time course feature selection," Biometrics, The International Biometric Society, vol. 70(2), pages 356-365, June.
    9. Qiu, Debin & Ahn, Jeongyoun, 2020. "Grouped variable screening for ultra-high dimensional data for linear model," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    10. Aneiros, Germán & Novo, Silvia & Vieu, Philippe, 2022. "Variable selection in functional regression models: A review," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    11. Liming Wang & Xingxiang Li & Xiaoqing Wang & Peng Lai, 2022. "Unified mean-variance feature screening for ultrahigh-dimensional regression," Computational Statistics, Springer, vol. 37(4), pages 1887-1918, September.
    12. Akira Shinkyu, 2023. "Forward Selection for Feature Screening and Structure Identification in Varying Coefficient Models," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(1), pages 485-511, February.
    13. Yi Chu & Lu Lin, 2020. "Conditional SIRS for nonparametric and semiparametric models by marginal empirical likelihood," Statistical Papers, Springer, vol. 61(4), pages 1589-1606, August.
    14. Umberto Amato & Anestis Antoniadis & Italia De Feis, 2016. "Additive model selection," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 25(4), pages 519-564, November.
    15. Du, Pang & Cheng, Guang & Liang, Hua, 2012. "Semiparametric regression models with additive nonparametric components and high dimensional parametric components," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 2006-2017.
    16. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    17. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    18. Wang, Christina Dan & Chen, Zhao & Lian, Yimin & Chen, Min, 2022. "Asset selection based on high frequency Sharpe ratio," Journal of Econometrics, Elsevier, vol. 227(1), pages 168-188.
    19. Hung Hung & Su‐Yun Huang, 2019. "Sufficient dimension reduction via random‐partitions for the large‐p‐small‐n problem," Biometrics, The International Biometric Society, vol. 75(1), pages 245-255, March.
    20. Jingyuan Liu & Runze Li & Rongling Wu, 2014. "Feature Selection for Varying Coefficient Models With Ultrahigh-Dimensional Covariates," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 266-274, March.

    More about this item

    Keywords

    Adaptive estimation; non-parametric additive model; purely nonparametric regression; variable selection;
    All these keywords.

    JEL classification:

    • C00 - Mathematical and Quantitative Methods - - General - - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:irtgdp:2018002. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/wfhubde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.