IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v55y2011i12p3304-3310.html
   My bibliography  Save this article

Variable selection for functional regression models via the L1 regularization

Author

Listed:
  • Matsui, Hidetoshi
  • Konishi, Sadanori

Abstract

In regression analysis, L1 regularizations such as the lasso or the SCAD provide sparse solutions, which leads to variable selection. We consider the variable selection problem where variables are given as functional forms, using L1 regularization. In order to select functional variables each of which is controlled by multiple parameters, we treat parameters as grouped parameters and then apply the group SCAD. A crucial issue in the regularization method is the choice of regularization parameters. We derive a model selection criterion for evaluating the model estimated by the regularization method via the group SCAD penalty. Results of simulation and real data analysis show the effectiveness of the proposed modeling strategy.

Suggested Citation

  • Matsui, Hidetoshi & Konishi, Sadanori, 2011. "Variable selection for functional regression models via the L1 regularization," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3304-3310, December.
  • Handle: RePEc:eee:csdana:v:55:y:2011:i:12:p:3304-3310
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947311002192
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sadanori Konishi, 2004. "Bayesian information criteria and smoothing parameter selection in radial basis function networks," Biometrika, Biometrika Trust, vol. 91(1), pages 27-43, March.
    2. Ferraty, Frédéric & Vieu, Philippe, 2009. "Additive prediction and boosting for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1400-1413, February.
    3. Jianqing Fan & Runze Li, 2004. "New Estimation and Model Selection Procedures for Semiparametric Modeling in Longitudinal Data Analysis," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 710-723, January.
    4. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    5. F. Ferraty & P. Hall & P. Vieu, 2010. "Most-predictive design points for functional data predictors," Biometrika, Biometrika Trust, vol. 97(4), pages 807-824.
    6. Yuko Araki & Sadanori Konishi & Shuichi Kawano & Hidetoshi Matsui, 2009. "Functional regression modeling via regularized Gaussian basis expansions," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 61(4), pages 811-833, December.
    7. Jianwen Cai & Jianqing Fan & Runze Li & Haibo Zhou, 2005. "Variable selection for multivariate failure time data," Biometrika, Biometrika Trust, vol. 92(2), pages 303-316, June.
    8. Hansheng Wang & Runze Li & Chih-Ling Tsai, 2007. "Tuning parameter selectors for the smoothly clipped absolute deviation method," Biometrika, Biometrika Trust, vol. 94(3), pages 553-568.
    9. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Crawford, F. & Watling, D.P. & Connors, R.D., 2017. "A statistical method for estimating predictable differences between daily traffic flow profiles," Transportation Research Part B: Methodological, Elsevier, vol. 95(C), pages 196-213.
    2. Gregorutti, Baptiste & Michel, Bertrand & Saint-Pierre, Philippe, 2015. "Grouped variable importance with random forests and application to multiple functional data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 90(C), pages 15-35.
    3. Belli, Edoardo, 2022. "Smoothly adaptively centered ridge estimator," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    4. Mirosław Krzyśko & Łukasz Smaga, 2017. "An Application Of Functional Multivariate Regression Model To Multiclass Classification," Statistics in Transition New Series, Polish Statistical Association, vol. 18(3), pages 433-442, September.
    5. Aneiros, Germán & Novo, Silvia & Vieu, Philippe, 2022. "Variable selection in functional regression models: A review," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    6. Mingotti, Nicola & Lillo Rodríguez, Rosa Elvira & Romo, Juan, 2013. "Lasso variable selection in functional regression," DES - Working Papers. Statistics and Econometrics. WS ws131413, Universidad Carlos III de Madrid. Departamento de Estadística.
    7. Łukasz Smaga & Hidetoshi Matsui, 2018. "A note on variable selection in functional regression via random subspace method," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(3), pages 455-477, August.
    8. Andreas Groll & Trevor Hastie & Gerhard Tutz, 2017. "Selection of effects in Cox frailty models by regularization methods," Biometrics, The International Biometric Society, vol. 73(3), pages 846-856, September.
    9. Matsui, Hidetoshi, 2014. "Variable and boundary selection for functional data via multiclass logistic regression modeling," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 176-185.
    10. Collazos, Julian A.A. & Dias, Ronaldo & Zambom, Adriano Z., 2016. "Consistent variable selection for functional regression models," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 63-71.
    11. Fan, Zhaohu & Reimherr, Matthew, 2017. "High-dimensional adaptive function-on-scalar regression," Econometrics and Statistics, Elsevier, vol. 1(C), pages 167-183.
    12. Mirshani, Ardalan & Reimherr, Matthew, 2021. "Adaptive function-on-scalar regression with a smoothing elastic net," Journal of Multivariate Analysis, Elsevier, vol. 185(C).
    13. Krzyśko Mirosław & Smaga Łukasz, 2017. "An Application of Functional Multivariate Regression Model to Multiclass Classification," Statistics in Transition New Series, Polish Statistical Association, vol. 18(3), pages 433-442, September.
    14. Huang, Lele & Zhao, Junlong & Wang, Huiwen & Wang, Siyang, 2016. "Robust shrinkage estimation and selection for functional multiple linear model through LAD loss," Computational Statistics & Data Analysis, Elsevier, vol. 103(C), pages 384-400.
    15. Sanying Feng & Menghan Zhang & Tiejun Tong, 2022. "Variable selection for functional linear models with strong heredity constraint," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(2), pages 321-339, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matsui, Hidetoshi, 2014. "Variable and boundary selection for functional data via multiclass logistic regression modeling," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 176-185.
    2. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    3. Peng, Heng & Lu, Ying, 2012. "Model selection in linear mixed effect models," Journal of Multivariate Analysis, Elsevier, vol. 109(C), pages 109-129.
    4. Xin Cheng & Wenbin Lu & Mengling Liu, 2015. "Identification of homogeneous and heterogeneous variables in pooled cohort studies," Biometrics, The International Biometric Society, vol. 71(2), pages 397-403, June.
    5. Xingwei Tong & Xin He & Liuquan Sun & Jianguo Sun, 2009. "Variable Selection for Panel Count Data via Non‐Concave Penalized Estimating Function," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(4), pages 620-635, December.
    6. Tang, Yanlin & Wang, Huixia Judy & Zhu, Zhongyi, 2013. "Variable selection in quantile varying coefficient models with longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 435-449.
    7. Hu, Jianhua & Xin, Xin & You, Jinhong, 2014. "Model determination and estimation for the growth curve model via group SCAD penalty," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 199-213.
    8. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    9. G. Aneiros & P. Vieu, 2016. "Sparse nonparametric model for regression with functional covariate," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(4), pages 839-859, October.
    10. Zhang, Tao & Zhang, Qingzhao & Wang, Qihua, 2014. "Model detection for functional polynomial regression," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 183-197.
    11. Xiao Ni & Daowen Zhang & Hao Helen Zhang, 2010. "Variable Selection for Semiparametric Mixed Models in Longitudinal Studies," Biometrics, The International Biometric Society, vol. 66(1), pages 79-88, March.
    12. Feng, Sanying & Lian, Heng & Xue, Liugen, 2016. "A new nested Cholesky decomposition and estimation for the covariance matrix of bivariate longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 102(C), pages 98-109.
    13. Fei Jin & Lung-fei Lee, 2018. "Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices," Econometrics, MDPI, vol. 6(1), pages 1-24, February.
    14. Feng, Zhenghui & Zhu, Lixing, 2012. "An alternating determination–optimization approach for an additive multi-index model," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1981-1993.
    15. Guang Cheng & Hao Zhang & Zuofeng Shang, 2015. "Sparse and efficient estimation for partial spline models with increasing dimension," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(1), pages 93-127, February.
    16. Zhangong Zhou & Rong Jiang & Weimin Qian, 2013. "LAD variable selection for linear models with randomly censored data," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(2), pages 287-300, February.
    17. Ngai Hang Chan & Linhao Gao & Wilfredo Palma, 2022. "Simultaneous variable selection and structural identification for time‐varying coefficient models," Journal of Time Series Analysis, Wiley Blackwell, vol. 43(4), pages 511-531, July.
    18. Qingliang Fan & Yaqian Wu, 2020. "Endogenous Treatment Effect Estimation with some Invalid and Irrelevant Instruments," Papers 2006.14998, arXiv.org.
    19. Jin, Fei & Lee, Lung-fei, 2018. "Irregular N2SLS and LASSO estimation of the matrix exponential spatial specification model," Journal of Econometrics, Elsevier, vol. 206(2), pages 336-358.
    20. Alan T. K. Wan & Jinhong You & Riquan Zhang, 2016. "A Seemingly Unrelated Nonparametric Additive Model with Autoregressive Errors," Econometric Reviews, Taylor & Francis Journals, vol. 35(5), pages 894-928, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:55:y:2011:i:12:p:3304-3310. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.