IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v39y2024i4d10.1007_s00180-024-01489-y.html
   My bibliography  Save this article

Variable selection for multivariate functional data via conditional correlation learning

Author

Listed:
  • Keyao Wang

    (Beihang University
    Beijing Key Laboratory of Emergency Support Simulation Technologies of City Operation)

  • Huiwen Wang

    (Beihang University
    Management and Decision (Beihang University))

  • Shanshan Wang

    (Beihang University
    Beijing Key Laboratory of Emergency Support Simulation Technologies of City Operation)

  • Lihong Wang

    (National Computer Network Emergency Response Technical Team/Coordination Center of China)

Abstract

Variable selection involves selecting truly important predictors from p-dimensional multivariate functional predictors in functional predictive models. In this paper, a variable selection method is designed for scalar-on-function predictions entangled with nonlinear joint associations among scalar response and multiple functional predictors. First, a nonparametric functional nonlinear conditional correlation coefficient, namely, the FunNCC coefficient, is proposed to measure complex dependencies, including the nonmonotonic marginal dependence, along with the conditional associations of redundancy, complement, and interaction. Then, a model-free feature ordering and selection method is designed, where the FunNCC is utilized to rank relevance, enabling the selection of a subset of predictors with the strongest joint dependence. Since this method allows for quantitatively evaluating the contributions of predictors in explaining responses, it achieves moderate model interpretability. Finally, extensive simulation studies and two real-data cases involving air pollution regression and hand gesture recognition are conducted to evaluate the finite sample performance of the proposed method, and the results show that the proposed FunNCC and variable selection methods outperform state-of-the-art baselines.

Suggested Citation

  • Keyao Wang & Huiwen Wang & Shanshan Wang & Lihong Wang, 2024. "Variable selection for multivariate functional data via conditional correlation learning," Computational Statistics, Springer, vol. 39(4), pages 2375-2412, June.
  • Handle: RePEc:spr:compst:v:39:y:2024:i:4:d:10.1007_s00180-024-01489-y
    DOI: 10.1007/s00180-024-01489-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-024-01489-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-024-01489-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Aneiros, Germán & Novo, Silvia & Vieu, Philippe, 2022. "Variable selection in functional regression models: A review," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    2. Yating Wan & Minya Xu & Hui Huang & Song Xi Chen, 2021. "A spatio‐temporal model for the analysis and prediction of fine particulate matter concentration in Beijing," Environmetrics, John Wiley & Sons, Ltd., vol. 32(1), February.
    3. C E Lee & X Zhang & X Shao, 2020. "Testing conditional mean independence for functional data," Biometrika, Biometrika Trust, vol. 107(2), pages 331-346.
    4. Ping Yu & Zhongzhan Zhang & Jiang Du, 2016. "A test of linearity in partial functional linear regression," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 79(8), pages 953-969, November.
    5. Sarah Brockhaus & Andreas Fuest & Andreas Mayr & Sonja Greven, 2018. "Signal regression models for location, scale and shape with an application to stock returns," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(3), pages 665-686, April.
    6. Collazos, Julian A.A. & Dias, Ronaldo & Zambom, Adriano Z., 2016. "Consistent variable selection for functional regression models," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 63-71.
    7. Xiaofeng Shao & Jingsi Zhang, 2014. "Martingale Difference Correlation and Its Use in High-Dimensional Variable Screening," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1302-1318, September.
    8. Li, Yehua & Qiu, Yumou & Xu, Yuhang, 2022. "From multivariate to functional data analysis: Fundamentals, recent developments, and emerging areas," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    9. Manuel Febrero-Bande & Wenceslao González-Manteiga & Manuel Oviedo de la Fuente, 2019. "Variable selection in functional additive regression models," Computational Statistics, Springer, vol. 34(2), pages 469-487, June.
    10. Zhu, Hanbing & Li, Rui & Zhang, Riquan & Lian, Heng, 2020. "Nonlinear functional canonical correlation analysis via distance covariance," Journal of Multivariate Analysis, Elsevier, vol. 180(C).
    11. Park, So Young & Xiao, Luo & Willbur, Jayson D. & Staicu, Ana-Maria & Jumbe, N. L’ntshotsholé, 2018. "A joint design for functional data with application to scheduling ultrasound scans," Computational Statistics & Data Analysis, Elsevier, vol. 122(C), pages 101-114.
    12. Lai, Tingyu & Zhang, Zhongzhan & Wang, Yafei, 2021. "A kernel-based measure for conditional mean dependence," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    13. Sanying Feng & Menghan Zhang & Tiejun Tong, 2022. "Variable selection for functional linear models with strong heredity constraint," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(2), pages 321-339, April.
    14. Sourav Chatterjee, 2021. "A New Coefficient of Correlation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(536), pages 2009-2022, October.
    15. Fang Yao & Hans-Georg Müller, 2010. "Functional quadratic regression," Biometrika, Biometrika Trust, vol. 97(1), pages 49-64.
    16. Matsui, Hidetoshi & Konishi, Sadanori, 2011. "Variable selection for functional regression models via the L1 regularization," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3304-3310, December.
    17. Dehan Kong & Kaijie Xue & Fang Yao & Hao H. Zhang, 2016. "Partially functional linear regression in high dimensions," Biometrika, Biometrika Trust, vol. 103(1), pages 147-159.
    18. Huang, Lele & Zhao, Junlong & Wang, Huiwen & Wang, Siyang, 2016. "Robust shrinkage estimation and selection for functional multiple linear model through LAD loss," Computational Statistics & Data Analysis, Elsevier, vol. 103(C), pages 384-400.
    19. Azadkia, Mona & Chatterjee, Sourav, 2021. "A simple measure of conditional dependence," LSE Research Online Documents on Economics 125584, London School of Economics and Political Science, LSE Library.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sanying Feng & Menghan Zhang & Tiejun Tong, 2022. "Variable selection for functional linear models with strong heredity constraint," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(2), pages 321-339, April.
    2. Aneiros, Germán & Novo, Silvia & Vieu, Philippe, 2022. "Variable selection in functional regression models: A review," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    3. Yu, Dengdeng & Zhang, Li & Mizera, Ivan & Jiang, Bei & Kong, Linglong, 2019. "Sparse wavelet estimation in quantile regression with multiple functional predictors," Computational Statistics & Data Analysis, Elsevier, vol. 136(C), pages 12-29.
    4. Liang, Weijuan & Zhang, Qingzhao & Ma, Shuangge, 2023. "Locally sparse quantile estimation for a partially functional interaction model," Computational Statistics & Data Analysis, Elsevier, vol. 186(C).
    5. Lai, Tingyu & Zhang, Zhongzhan & Wang, Yafei, 2021. "A kernel-based measure for conditional mean dependence," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    6. Chi-Kuang Yeh & Peijun Sang, 2025. "Variable Selection in Multivariate Functional Linear Regression," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 17(1), pages 17-34, April.
    7. Weice Sun & Jiaqi Xu & Tao Liu, 2025. "Partially Functional Linear Regression Based on Gaussian Process Prior and Ensemble Learning," Mathematics, MDPI, vol. 13(5), pages 1-25, March.
    8. Mirosław Krzyśko & Łukasz Smaga, 2017. "An Application Of Functional Multivariate Regression Model To Multiclass Classification," Statistics in Transition New Series, Polish Statistical Association, vol. 18(3), pages 433-442, September.
    9. Yang, Seong J. & Shin, Hyejin & Lee, Sang Han & Lee, Seokho, 2020. "Functional linear regression model with randomly censored data: Predicting conversion time to Alzheimer ’s disease," Computational Statistics & Data Analysis, Elsevier, vol. 150(C).
    10. Xiongtao Dai & Zhenhua Lin & Hans‐Georg Müller, 2021. "Modeling sparse longitudinal data on Riemannian manifolds," Biometrics, The International Biometric Society, vol. 77(4), pages 1328-1341, December.
    11. Yuping Hu & Siyu Wu & Sanying Feng & Junliang Jin, 2020. "Estimation in Partial Functional Linear Spatial Autoregressive Model," Mathematics, MDPI, vol. 8(10), pages 1-12, October.
    12. Łukasz Smaga & Hidetoshi Matsui, 2018. "A note on variable selection in functional regression via random subspace method," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(3), pages 455-477, August.
    13. Zhang, Xiaochen & Zhang, Qingzhao & Ma, Shuangge & Fang, Kuangnan, 2022. "Subgroup analysis for high-dimensional functional regression," Journal of Multivariate Analysis, Elsevier, vol. 192(C).
    14. Rongjie Jiang & Liming Wang & Yang Bai, 2021. "Optimal model averaging estimator for semi-functional partially linear models," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(2), pages 167-194, February.
    15. Tang, Qingguo & Tu, Wei & Kong, Linglong, 2023. "Estimation for partial functional partially linear additive model," Computational Statistics & Data Analysis, Elsevier, vol. 177(C).
    16. Fanrong Zhao & Baoxue Zhang, 2024. "A U-Statistic for Testing the Lack of Dependence in Functional Partially Linear Regression Model," Mathematics, MDPI, vol. 12(16), pages 1-23, August.
    17. Ma, Haiqiang & Li, Ting & Zhu, Hongtu & Zhu, Zhongyi, 2019. "Quantile regression for functional partially linear model in ultra-high dimensions," Computational Statistics & Data Analysis, Elsevier, vol. 129(C), pages 135-147.
    18. Chang, Jinyuan & Chen, Cheng & Qiao, Xinghao & Yao, Qiwei, 2023. "An autocovariance-based learning framework for high-dimensional functional time series," LSE Research Online Documents on Economics 117910, London School of Economics and Political Science, LSE Library.
    19. Liebl, Dominik & Walders, Fabian, 2019. "Parameter regimes in partial functional panel regression," Econometrics and Statistics, Elsevier, vol. 11(C), pages 105-115.
    20. Qi, Xin & Luo, Ruiyan, 2018. "Function-on-function regression with thousands of predictive curves," Journal of Multivariate Analysis, Elsevier, vol. 163(C), pages 51-66.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:39:y:2024:i:4:d:10.1007_s00180-024-01489-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.