IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v189y2022ics0047259x2100141x.html
   My bibliography  Save this article

Feature extraction for functional time series: Theory and application to NIR spectroscopy data

Author

Listed:
  • Yang, Yang
  • Yang, Yanrong
  • Shang, Han Lin

Abstract

We propose a novel method to extract global and local features of functional time series. The global features concerning the dominant modes of variation over the entire function domain, and local features of function variations over particular short intervals within function domain, are both important in functional data analysis. Functional principal component analysis (FPCA), though a key feature extraction tool, only focus on capturing the dominant global features, neglecting highly localized features. We introduce a FPCA-BTW method that initially extracts global features of functional data via FPCA, and then extracts local features by block thresholding of wavelet (BTW) coefficients. Using Monte Carlo simulations, along with an empirical application on near-infrared spectroscopy data of wood panels, we illustrate that the proposed method outperforms competing methods including FPCA and sparse FPCA in the estimation functional processes. Moreover, extracted local features inheriting serial dependence of the original functional time series contribute to more accurate forecasts. Finally, we develop asymptotic properties of FPCA-BTW estimators, discovering the interaction between convergence rates of global and local features.

Suggested Citation

  • Yang, Yang & Yang, Yanrong & Shang, Han Lin, 2022. "Feature extraction for functional time series: Theory and application to NIR spectroscopy data," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
  • Handle: RePEc:eee:jmvana:v:189:y:2022:i:c:s0047259x2100141x
    DOI: 10.1016/j.jmva.2021.104863
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X2100141X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2021.104863?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jonathan E. Gellar & Elizabeth Colantuoni & Dale M. Needham & Ciprian M. Crainiceanu, 2014. "Variable-Domain Functional Regression for Modeling ICU Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(508), pages 1425-1439, December.
    2. Johnstone, Iain M. & Lu, Arthur Yu, 2009. "On Consistency and Sparsity for Principal Components Analysis in High Dimensions," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 682-693.
    3. Gneiting, Tilmann, 2011. "Making and Evaluating Point Forecasts," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 746-762.
    4. Müller, Hans-Georg & Yao, Fang, 2008. "Functional Additive Models," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1534-1544.
    5. Berrendero, José R. & Cuevas, Antonio & Pateiro-López, Beatriz, 2016. "Shape classification based on interpoint distance distributions," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 237-247.
    6. Jianqing Fan & Yuan Liao & Martina Mincheva, 2013. "Large covariance estimation by thresholding principal orthogonal complements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(4), pages 603-680, September.
    7. Degui Li & Peter M. Robinson & Han Lin Shang, 2020. "Long-Range Dependent Curve Time Series," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 957-971, April.
    8. Shang, Han Lin & Hyndman, Rob.J., 2011. "Nonparametric time series forecasting with dynamic updating," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 81(7), pages 1310-1324.
    9. Horváth, Lajos & Rice, Gregory & Whipple, Stephen, 2016. "Adaptive bandwidth selection in the long run covariance estimator of functional time series," Computational Statistics & Data Analysis, Elsevier, vol. 100(C), pages 676-693.
    10. Klepsch, J. & Klüppelberg, C. & Wei, T., 2017. "Prediction of functional ARMA processes with an application to traffic data," Econometrics and Statistics, Elsevier, vol. 1(C), pages 128-149.
    11. Andrews, Donald W K, 1991. "Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation," Econometrica, Econometric Society, vol. 59(3), pages 817-858, May.
    12. Peter Hall & Céline Vial, 2006. "Assessing the finite dimensionality of functional data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(4), pages 689-705, September.
    13. Seung C. Ahn & Alex R. Horenstein, 2013. "Eigenvalue Ratio Test for the Number of Factors," Econometrica, Econometric Society, vol. 81(3), pages 1203-1227, May.
    14. Berrendero, José R. & Bueno-Larraz, Beatriz & Cuevas, Antonio, 2019. "An RKHS model for variable selection in functional linear regression," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 25-45.
    15. Antonio Cuevas & Manuel Febrero & Ricardo Fraiman, 2007. "Robust estimation and classification for functional data via projection-based depth notions," Computational Statistics, Springer, vol. 22(3), pages 481-496, September.
    16. Huang, Jianhua Z. & Shen, Haipeng & Buja, Andreas, 2009. "The Analysis of Two-Way Functional Data Using Two-Way Regularized Singular Value Decompositions," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1609-1620.
    17. Clifford Lam & Qiwei Yao & Neil Bathia, 2011. "Estimation of latent factors for high-dimensional time series," Biometrika, Biometrika Trust, vol. 98(4), pages 901-918.
    18. Lam, Clifford & Yao, Qiwei & Bathia, Neil, 2011. "Estimation of latent factors for high-dimensional time series," LSE Research Online Documents on Economics 31549, London School of Economics and Political Science, LSE Library.
    19. Hans-Georg Müller & Yichao Wu & Fang Yao, 2013. "Continuously additive models for nonlinear functional regression," Biometrika, Biometrika Trust, vol. 100(3), pages 607-622.
    20. Alexander Aue & Diogo Dubart Norinho & Siegfried Hörmann, 2015. "On the Prediction of Stationary Functional Time Series," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 378-392, March.
    21. Novo, Silvia & Aneiros, Germán & Vieu, Philippe, 2021. "A kNN procedure in semiparametric functional data analysis," Statistics & Probability Letters, Elsevier, vol. 171(C).
    22. Kuhnt, Sonja & Rehage, André, 2016. "An angle-based multivariate functional pseudo-depth for shape outlier detection," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 325-340.
    23. Siegfried Hörmann & Łukasz Kidziński & Marc Hallin, 2015. "Dynamic functional principal components," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 77(2), pages 319-348, March.
    24. Shang, Han Lin, 2019. "Dynamic Principal Component Regression: Application To Age-Specific Mortality Forecasting," ASTIN Bulletin, Cambridge University Press, vol. 49(3), pages 619-645, September.
    25. Silvia Novo & Germán Aneiros & Philippe Vieu, 2019. "Automatic and location-adaptive estimation in functional single-index regression," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 31(2), pages 364-392, April.
    26. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    27. Aneiros, Germán & Vieu, Philippe, 2014. "Variable selection in infinite-dimensional problems," Statistics & Probability Letters, Elsevier, vol. 94(C), pages 12-20.
    28. Piotr Kokoszka & Matthew Reimherr, 2013. "Determining the order of the functional autoregressive model," Journal of Time Series Analysis, Wiley Blackwell, vol. 34(1), pages 116-129, January.
    29. Berkes, István & Horváth, Lajos & Rice, Gregory, 2016. "On the asymptotic normality of kernel estimators of the long run covariance of functional time series," Journal of Multivariate Analysis, Elsevier, vol. 144(C), pages 150-175.
    30. Klepsch, J. & Klüppelberg, C., 2017. "An innovations algorithm for the prediction of functional linear processes," Journal of Multivariate Analysis, Elsevier, vol. 155(C), pages 252-271.
    31. Lam, Clifford & Yao, Qiwei, 2012. "Factor modeling for high-dimensional time series: inference for the number of factors," LSE Research Online Documents on Economics 45684, London School of Economics and Political Science, LSE Library.
    32. Gregory Rice & Han Lin Shang, 2017. "A Plug-in Bandwidth Selection Procedure for Long-Run Covariance Estimation with Stationary Functional Time Series," Journal of Time Series Analysis, Wiley Blackwell, vol. 38(4), pages 591-609, July.
    33. Aneiros, Germán & Cao, Ricardo & Fraiman, Ricardo & Genest, Christian & Vieu, Philippe, 2019. "Recent advances in functional data analysis and high-dimensional statistics," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 3-9.
    34. Peter Hall & Giles Hooker, 2016. "Truncated linear models for functional data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(3), pages 637-653, June.
    35. Antoniadis A. & Fan J., 2001. "Regularization of Wavelet Approximations," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 939-967, September.
    36. Chiou, Jeng-Min & Yang, Ya-Fang & Chen, Yu-Ting, 2016. "Multivariate functional linear regression and prediction," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 301-312.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Cees Diks & Bram Wouters, 2023. "Noise reduction for functional time series," Papers 2307.02154, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cees Diks & Bram Wouters, 2023. "Noise reduction for functional time series," Papers 2307.02154, arXiv.org.
    2. Gao, Yuan & Shang, Han Lin & Yang, Yanrong, 2019. "High-dimensional functional time series forecasting: An application to age-specific mortality rates," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 232-243.
    3. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    4. Elías, Antonio & Jiménez, Raúl & Shang, Han Lin, 2022. "On projection methods for functional time series forecasting," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    5. Han Lin Shang & Yang Yang & Fearghal Kearney, 2019. "Intraday forecasts of a volatility index: functional time series methods with dynamic updating," Annals of Operations Research, Springer, vol. 282(1), pages 331-354, November.
    6. Barigozzi, Matteo & Trapani, Lorenzo, 2020. "Sequential testing for structural stability in approximate factor models," Stochastic Processes and their Applications, Elsevier, vol. 130(8), pages 5149-5187.
    7. Shang, Han Lin & Kearney, Fearghal, 2022. "Dynamic functional time-series forecasts of foreign exchange implied volatility surfaces," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1025-1049.
    8. Han Lin Shang & Yang Yang, 2021. "Forecasting Australian subnational age-specific mortality rates," Journal of Population Research, Springer, vol. 38(1), pages 1-24, March.
    9. Han Lin Shang & Rob J Hyndman, 2016. "Grouped functional time series forecasting: An application to age-specific mortality rates," Monash Econometrics and Business Statistics Working Papers 4/16, Monash University, Department of Econometrics and Business Statistics.
    10. Shang, Han Lin, 2017. "Functional time series forecasting with dynamic updating: An application to intraday particulate matter concentration," Econometrics and Statistics, Elsevier, vol. 1(C), pages 184-200.
    11. Yuefeng Han & Rong Chen & Dan Yang & Cun-Hui Zhang, 2020. "Tensor Factor Model Estimation by Iterative Projection," Papers 2006.02611, arXiv.org, revised May 2022.
    12. Zhaoxing Gao & Ruey S. Tsay, 2020. "A Two-Way Transformed Factor Model for Matrix-Variate Time Series," Papers 2011.09029, arXiv.org.
    13. Chen Tang & Yanlin Shi, 2021. "Forecasting High-Dimensional Financial Functional Time Series: An Application to Constituent Stocks in Dow Jones Index," JRFM, MDPI, vol. 14(8), pages 1-13, July.
    14. Shang Han Lin, 2020. "A Comparison of Hurst Exponent Estimators in Long-range Dependent Curve Time Series," Journal of Time Series Econometrics, De Gruyter, vol. 12(1), pages 1-39, January.
    15. Vieu, Philippe, 2018. "On dimension reduction models for functional data," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 134-138.
    16. Yuefeng Han & Rong Chen & Cun-Hui Zhang, 2020. "Rank Determination in Tensor Factor Model," Papers 2011.07131, arXiv.org, revised May 2022.
    17. Yoshimasa Uematsu & Takashi Yamagata, 2019. "Estimation of Weak Factor Models," ISER Discussion Paper 1053r, Institute of Social and Economic Research, Osaka University, revised Mar 2020.
    18. Han Lin Shang & Kaiying Ji, 2023. "Forecasting intraday financial time series with sieve bootstrapping and dynamic updating," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(8), pages 1973-1988, December.
    19. Yuefeng Han & Cun-Hui Zhang & Rong Chen, 2021. "CP Factor Model for Dynamic Tensors," Papers 2110.15517, arXiv.org.
    20. Gao, Zhaoxing & Tsay, Ruey S., 2023. "A Two-Way Transformed Factor Model for Matrix-Variate Time Series," Econometrics and Statistics, Elsevier, vol. 27(C), pages 83-101.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:189:y:2022:i:c:s0047259x2100141x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.