IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v61y2020i2d10.1007_s00362-017-0953-1.html
   My bibliography  Save this article

Component-wise outlier detection methods for robustifying multivariate functional samples

Author

Listed:
  • Francesca Ieva

    (Politecnico di Milano)

  • Anna Maria Paganoni

    (Politecnico di Milano)

Abstract

We propose a new method for detecting outliers in multivariate functional data. We exploit the joint use of two different depth measures, and generalize the outliergram to the multivariate functional framework, aiming at detecting and discarding both shape and magnitude outliers. The main application consists in robustifying the reference samples of data, composed by G different known groups to be used, for example, in classification procedures in order to make them more robust. We asses by means of a simulation study the method’s performance in comparison with different outlier detection methods. Finally we consider a real dataset: we classify data minimizing a suitable distance from the center of reference groups. We compare performance of supervised classification on test sets training the algorithm on original dataset and on the robustified one, respectively.

Suggested Citation

  • Francesca Ieva & Anna Maria Paganoni, 2020. "Component-wise outlier detection methods for robustifying multivariate functional samples," Statistical Papers, Springer, vol. 61(2), pages 595-614, April.
  • Handle: RePEc:spr:stpapr:v:61:y:2020:i:2:d:10.1007_s00362-017-0953-1
    DOI: 10.1007/s00362-017-0953-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-017-0953-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-017-0953-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gerda Claeskens & Mia Hubert & Leen Slaets & Kaveh Vakili, 2014. "Multivariate Functional Halfspace Depth," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 411-423, March.
    2. Daniel Gervini, 2008. "Robust functional estimation using the median and spherical principal components," Biometrika, Biometrika Trust, vol. 95(3), pages 587-600.
    3. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Multivariate functional outlier detection," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 177-202, July.
    4. López-Pintado, Sara & Romo, Juan, 2009. "On the Concept of Depth for Functional Data," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 718-734.
    5. Douglas M. Hawkins, 1980. "Critical Values for Identifying Outliers," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(1), pages 95-96, March.
    6. Jun Li & Juan A. Cuesta-Albertos & Regina Y. Liu, 2012. "DD -Classifier: Nonparametric Classification Procedure Based on DD -Plot," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 737-753, June.
    7. Berrendero, J.R. & Justel, A. & Svarc, M., 2011. "Principal components for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2619-2634, September.
    8. López-Pintado, Sara & Romo, Juan, 2011. "A half-region depth for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1679-1695, April.
    9. Wenceslao González‐Manteiga & Rosa M. Crujeiras & Ying Sun & Marc G. Genton, 2012. "Adjusted functional boxplots for spatio‐temporal data visualization and outlier detection," Environmetrics, John Wiley & Sons, Ltd., vol. 23(1), pages 54-64, February.
    10. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Rejoinder to ‘multivariate functional outlier detection’," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 269-277, July.
    11. Rob J. Hyndman & Han Lin Shang, 2008. "Rainbow plots, Bagplots and Boxplots for Functional Data," Monash Econometrics and Business Statistics Working Papers 9/08, Monash University, Department of Econometrics and Business Statistics.
    12. Sara López-Pintado & Ying Sun & Juan Lin & Marc Genton, 2014. "Simplicial band depth for multivariate functional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 321-338, September.
    13. Francesca Ieva & Anna M. Paganoni & Davide Pigoli & Valeria Vitelli, 2013. "Multivariate functional clustering for the morphological analysis of electrocardiograph curves," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 62(3), pages 401-418, May.
    14. David Kraus & Victor M. Panaretos, 2012. "Dispersion operators and resistant second-order functional data analysis," Biometrika, Biometrika Trust, vol. 99(4), pages 813-832.
    15. Hubert, M. & Vandervieren, E., 2008. "An adjusted boxplot for skewed distributions," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5186-5201, August.
    16. Davy Paindaveine & Germain Van Bever, 2015. "Discussion of “Multivariate Functional Outlier Detection”, by Mia Hubert, Peter Rousseeuw and Pieter Segaert," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 223-231, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Multivariate functional outlier detection," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 177-202, July.
    2. Dai, Wenlin & Genton, Marc G., 2019. "Directional outlyingness for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 131(C), pages 50-65.
    3. Francesca Ieva & Anna Paganoni, 2015. "Discussion of “multivariate functional outlier detection” by M. Hubert, P. Rousseeuw and P. Segaert," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 217-221, July.
    4. Oluwasegun Taiwo Ojo & Antonio Fernández Anta & Rosa E. Lillo & Carlo Sguera, 2022. "Detecting and classifying outliers in big functional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(3), pages 725-760, September.
    5. Nagy, Stanislav & Ferraty, Frédéric, 2019. "Data depth for measurable noisy random functions," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 95-114.
    6. Davy Paindaveine & Germain Van Bever, 2017. "Halfspace Depths for Scatter, Concentration and Shape Matrices," Working Papers ECARES ECARES 2017-19, ULB -- Universite Libre de Bruxelles.
    7. Martínez-Hernández, Israel & Genton, Marc G. & González-Farías, Graciela, 2019. "Robust depth-based estimation of the functional autoregressive model," Computational Statistics & Data Analysis, Elsevier, vol. 131(C), pages 66-79.
    8. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2017. "Multivariate and functional classification using depth and distance," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(3), pages 445-466, September.
    9. Carlo Sguera & Sara López-Pintado, 2021. "A notion of depth for sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(3), pages 630-649, September.
    10. Dai, Wenlin & Mrkvička, Tomáš & Sun, Ying & Genton, Marc G., 2020. "Functional outlier detection and taxonomy by sequential transformations," Computational Statistics & Data Analysis, Elsevier, vol. 149(C).
    11. Kuhnt, Sonja & Rehage, André, 2016. "An angle-based multivariate functional pseudo-depth for shape outlier detection," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 325-340.
    12. Qiu, Zhiping & Chen, Jianwei & Zhang, Jin-Ting, 2021. "Two-sample tests for multivariate functional data with applications," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    13. Yuan Yan & Marc Genton, 2015. "Discussion of “Multivariate functional outlier detection” by Mia Hubert, Peter Rousseeuw and Pieter Segaert," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 245-251, July.
    14. Weiyi Xie & Sebastian Kurtek & Karthik Bharath & Ying Sun, 2017. "A Geometric Approach to Visualization of Variability in Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 979-993, July.
    15. Zhu, Tianming & Zhang, Jin-Ting & Cheng, Ming-Yen, 2022. "One-way MANOVA for functional data via Lawley–Hotelling trace test," Journal of Multivariate Analysis, Elsevier, vol. 192(C).
    16. Jiménez Recaredo, Raúl José & Elías Fernández, Antonio, 2017. "Prediction Bands for Functional Data Based on Depth Measures," DES - Working Papers. Statistics and Econometrics. WS 24606, Universidad Carlos III de Madrid. Departamento de Estadística.
    17. Zhuo Qu & Wenlin Dai & Marc G. Genton, 2021. "Robust functional multivariate analysis of variance with environmental applications," Environmetrics, John Wiley & Sons, Ltd., vol. 32(1), February.
    18. Alvarez, Agustín & Boente, Graciela & Kudraszow, Nadia, 2019. "Robust sieve estimators for functional canonical correlation analysis," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 46-62.
    19. Ana Justel & Marcela Svarc, 2018. "A divisive clustering method for functional data with special consideration of outliers," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(3), pages 637-656, September.
    20. Graciela Boente & Matías Salibián-Barrera, 2021. "Robust functional principal components for sparse longitudinal data," METRON, Springer;Sapienza Università di Roma, vol. 79(2), pages 159-188, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:61:y:2020:i:2:d:10.1007_s00362-017-0953-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.