IDEAS home Printed from https://ideas.repec.org/a/spr/stmapp/v27y2018i3d10.1007_s10260-018-0421-7.html
   My bibliography  Save this article

A note on variable selection in functional regression via random subspace method

Author

Listed:
  • Łukasz Smaga

    (Adam Mickiewicz University)

  • Hidetoshi Matsui

    (Shiga University)

Abstract

Variable selection problem is one of the most important tasks in regression analysis, especially in a high-dimensional setting. In this paper, we study this problem in the context of scalar response functional regression model, which is a linear model with scalar response and functional regressors. The functional model can be represented by certain multiple linear regression model via basis expansions of functional variables. Based on this model and random subspace method of Mielniczuk and Teisseyre (Comput Stat Data Anal 71:725–742, 2014), two simple variable selection procedures for scalar response functional regression model are proposed. The final functional model is selected by using generalized information criteria. Monte Carlo simulation studies conducted and a real data example show very satisfactory performance of new variable selection methods under finite samples. Moreover, they suggest that considered procedures outperform solutions found in the literature in terms of correctly selected model, false discovery rate control and prediction error.

Suggested Citation

  • Łukasz Smaga & Hidetoshi Matsui, 2018. "A note on variable selection in functional regression via random subspace method," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(3), pages 455-477, August.
  • Handle: RePEc:spr:stmapp:v:27:y:2018:i:3:d:10.1007_s10260-018-0421-7
    DOI: 10.1007/s10260-018-0421-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10260-018-0421-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10260-018-0421-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Mielniczuk, Jan & Teisseyre, Paweł, 2014. "Using random subspace method for prediction and variable importance assessment in linear regression," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 725-742.
    2. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Multivariate functional outlier detection," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 177-202, July.
    3. Paweł Teisseyre & Robert A. Kłopotek & Jan Mielniczuk, 2016. "Random Subspace Method for high-dimensional regression with the R package regRSM," Computational Statistics, Springer, vol. 31(3), pages 943-972, September.
    4. Manuel Febrero-Bande & Wenceslao González-Manteiga, 2013. "Generalized additive models for functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(2), pages 278-292, June.
    5. Gareth M. James, 2002. "Generalized linear models with functional predictors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(3), pages 411-432, August.
    6. Matsui, Hidetoshi, 2014. "Variable and boundary selection for functional data via multiclass logistic regression modeling," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 176-185.
    7. Collazos, Julian A.A. & Dias, Ronaldo & Zambom, Adriano Z., 2016. "Consistent variable selection for functional regression models," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 63-71.
    8. Aldo Goia & Philippe Vieu, 2015. "A partitioned Single Functional Index Model," Computational Statistics, Springer, vol. 30(3), pages 673-692, September.
    9. Aneiros, Germán & Vieu, Philippe, 2014. "Variable selection in infinite-dimensional problems," Statistics & Probability Letters, Elsevier, vol. 94(C), pages 12-20.
    10. Li, Yehua & Hsing, Tailen, 2007. "On rates of convergence in functional linear regression," Journal of Multivariate Analysis, Elsevier, vol. 98(9), pages 1782-1804, October.
    11. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Rejoinder to ‘multivariate functional outlier detection’," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 269-277, July.
    12. Matsui, Hidetoshi & Konishi, Sadanori, 2011. "Variable selection for functional regression models via the L1 regularization," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3304-3310, December.
    13. Peter Radchenko & Xinghao Qiao & Gareth M. James, 2015. "Index Models for Sparsely Sampled Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 824-836, June.
    14. Tomasz Górecki & Łukasz Smaga, 2015. "A comparison of tests for the one-way ANOVA problem for functional data," Computational Statistics, Springer, vol. 30(4), pages 987-1010, December.
    15. Kokoszka, Piotr & Oja, Hanny & Park, Byeong & Sangalli, Laura, 2017. "Special issue on functional data analysis," Econometrics and Statistics, Elsevier, vol. 1(C), pages 99-100.
    16. Chiou, Jeng-Min & Yang, Ya-Fang & Chen, Yu-Ting, 2016. "Multivariate functional linear regression and prediction," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 301-312.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Aneiros, Germán & Novo, Silvia & Vieu, Philippe, 2022. "Variable selection in functional regression models: A review," Journal of Multivariate Analysis, Elsevier, vol. 188(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chiou, Jeng-Min & Yang, Ya-Fang & Chen, Yu-Ting, 2016. "Multivariate functional linear regression and prediction," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 301-312.
    2. Aneiros, Germán & Novo, Silvia & Vieu, Philippe, 2022. "Variable selection in functional regression models: A review," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    3. Vieu, Philippe, 2018. "On dimension reduction models for functional data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 134-138.
    4. Qiu, Zhiping & Chen, Jianwei & Zhang, Jin-Ting, 2021. "Two-sample tests for multivariate functional data with applications," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    5. Mirosław Krzyśko & Łukasz Smaga, 2017. "An Application Of Functional Multivariate Regression Model To Multiclass Classification," Statistics in Transition New Series, Polish Statistical Association, vol. 18(3), pages 433-442, September.
    6. Manuel Febrero-Bande & Pedro Galeano & Wenceslao González-Manteiga, 2017. "Functional Principal Component Regression and Functional Partial Least-squares Regression: An Overview and a Comparative Study," International Statistical Review, International Statistical Institute, vol. 85(1), pages 61-83, April.
    7. Krzyśko Mirosław & Smaga Łukasz, 2017. "An Application of Functional Multivariate Regression Model to Multiclass Classification," Statistics in Transition New Series, Polish Statistical Association, vol. 18(3), pages 433-442, September.
    8. Fraiman, Ricardo & Gimenez, Yanina & Svarc, Marcela, 2016. "Feature selection for functional data," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 191-208.
    9. Kalogridis, Ioannis & Van Aelst, Stefan, 2019. "Robust functional regression based on principal components," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 393-415.
    10. Gregorutti, Baptiste & Michel, Bertrand & Saint-Pierre, Philippe, 2015. "Grouped variable importance with random forests and application to multiple functional data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 90(C), pages 15-35.
    11. Collazos, Julian A.A. & Dias, Ronaldo & Zambom, Adriano Z., 2016. "Consistent variable selection for functional regression models," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 63-71.
    12. Zhu, Hanbing & Zhang, Riquan & Yu, Zhou & Lian, Heng & Liu, Yanghui, 2019. "Estimation and testing for partially functional linear errors-in-variables models," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 296-314.
    13. Francesca Ieva & Anna Paganoni, 2015. "Discussion of “multivariate functional outlier detection” by M. Hubert, P. Rousseeuw and P. Segaert," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 217-221, July.
    14. Febrero-Bande, Manuel & González-Manteiga, Wenceslao & Prallon, Brenda & Saporito, Yuri F., 2023. "Functional classification of bitcoin addresses," Computational Statistics & Data Analysis, Elsevier, vol. 181(C).
    15. Huang, Lele & Zhao, Junlong & Wang, Huiwen & Wang, Siyang, 2016. "Robust shrinkage estimation and selection for functional multiple linear model through LAD loss," Computational Statistics & Data Analysis, Elsevier, vol. 103(C), pages 384-400.
    16. Febrero-Bande, Manuel & Galeano, Pedro & González-Manteiga, Wenceslao, 2019. "Estimation, imputation and prediction for the functional linear model with scalar response with responses missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 131(C), pages 91-103.
    17. Boente, Graciela & Parada, Daniela, 2023. "Robust estimation for functional quadratic regression models," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    18. Alvarez, Agustín & Boente, Graciela & Kudraszow, Nadia, 2019. "Robust sieve estimators for functional canonical correlation analysis," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 46-62.
    19. Francesca Ieva & Anna Maria Paganoni, 2020. "Component-wise outlier detection methods for robustifying multivariate functional samples," Statistical Papers, Springer, vol. 61(2), pages 595-614, April.
    20. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stmapp:v:27:y:2018:i:3:d:10.1007_s10260-018-0421-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.