IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v161y2017icp12-31.html
   My bibliography  Save this article

Sparse representation of multivariate extremes with applications to anomaly detection

Author

Listed:
  • Goix, Nicolas
  • Sabourin, Anne
  • Clémençon, Stephan

Abstract

Capturing the dependence structure of multivariate extreme events is a major concern in many fields involving the management of risks stemming from multiple sources, e.g., portfolio monitoring, insurance, environmental risk management and anomaly detection. One convenient (nonparametric) characterization of extreme dependence in the framework of multivariate Extreme Value Theory (EVT) is the angular measure, which provides direct information about the probable “directions” of extremes, i.e., the relative contribution of each feature/coordinate of the largest observations. Modeling the angular measure in high-dimensional problems is a major challenge for the multivariate analysis of rare events. The present paper proposes a novel methodology aiming at exhibiting a particular kind of sparsity within the dependence structure of extremes. This is achieved by estimating the amount of mass spread by the angular measure on representative sets of directions corresponding to specific sub-cones of R+d. This dimension reduction technique paves the way towards scaling up existing multivariate EVT methods. Beyond a non-asymptotic study providing a theoretical validity framework for our method, we propose as a direct application a first anomaly detection algorithm based on multivariate EVT. This algorithm builds a sparse normal profile of extreme behaviors, to be confronted with new (possibly abnormal) extreme observations. Illustrative experimental results provide strong empirical evidence of the relevance of our approach.

Suggested Citation

  • Goix, Nicolas & Sabourin, Anne & Clémençon, Stephan, 2017. "Sparse representation of multivariate extremes with applications to anomaly detection," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 12-31.
  • Handle: RePEc:eee:jmvana:v:161:y:2017:i:c:p:12-31
    DOI: 10.1016/j.jmva.2017.06.010
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X17304062
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2017.06.010?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Einmahl, J.H.J. & Krajina, A. & Segers, J., 2011. "An M-Estimator for Tail Dependence in Arbitrary Dimensions," Discussion Paper 2011-013, Tilburg University, Center for Economic Research.
    2. Einmahl, J. H.J. & Dekkers, A. L.M. & de Haan, L., 1989. "A moment estimator for the index of an extreme-value distribution," Other publications TiSEM 81970cb3-5b7a-4cad-9bf6-2, Tilburg University, School of Economics and Management.
    3. Drees, Holger & Huang, Xin, 1998. "Best Attainable Rates of Convergence for Estimators of the Stable Tail Dependence Function," Journal of Multivariate Analysis, Elsevier, vol. 64(1), pages 25-47, January.
    4. Sabourin, Anne & Naveau, Philippe, 2014. "Bayesian Dirichlet mixture model for multivariate extremes: A re-parametrization," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 542-567.
    5. Einmahl, J.H.J. & de Haan, L.F.M. & Piterbarg, V.I., 2001. "Nonparametric estimation of the spectral measure of an extreme value distribution," Other publications TiSEM c3485b9b-a0bd-456f-9baa-0, Tilburg University, School of Economics and Management.
    6. Beirlant, Jan & Escobar-Bach, Mikael & Goegebeur, Yuri & Guillou, Armelle, 2016. "Bias-corrected estimation of stable tail dependence function," Journal of Multivariate Analysis, Elsevier, vol. 143(C), pages 453-466.
    7. Einmahl, J.H.J. & Segers, J.J.J., 2008. "Maximum Empirical Likelihood Estimation of the Spectral Measure of an Extreme Value Distribution," Other publications TiSEM e9340b9a-fe69-4e77-8594-8, Tilburg University, School of Economics and Management.
    8. Anne‐Laure Fougères & John P. Nolan & Holger Rootzén, 2009. "Models for Dependent Extremes Using Stable Mixtures," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(1), pages 42-59, March.
    9. Cooley, Daniel & Davis, Richard A. & Naveau, Philippe, 2010. "The pairwise beta distribution: A flexible parametric multivariate model for extremes," Journal of Multivariate Analysis, Elsevier, vol. 101(9), pages 2103-2117, October.
    10. Einmahl, John H. J. & Li, Jun & Liu, Regina Y., 2009. "Thresholding Events of Extreme in Simultaneous Monitoring of Multiple Risks," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 982-992.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lehtomaa, Jaakko & Resnick, Sidney I., 2020. "Asymptotic independence and support detection techniques for heavy-tailed multivariate data," Insurance: Mathematics and Economics, Elsevier, vol. 93(C), pages 262-277.
    2. Marc Chataigner & Stéphane Crépey & Jiang Pu, 2020. "Nowcasting Networks," Post-Print hal-03910123, HAL.
    3. Mourahib, Anas & Kiriliouk, Anna & Segers, Johan, 2023. "Multivariate generalized Pareto distributions along extreme directions," LIDAM Discussion Papers ISBA 2023034, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    4. Maël Chiapino & Stephan Clémençon & Vincent Feuillard & Anne Sabourin, 2020. "A multivariate extreme value theory approach to anomaly clustering and visualization," Computational Statistics, Springer, vol. 35(2), pages 607-628, June.
    5. Simpson, Emma S. & Wadsworth, Jennifer L. & Tawn, Jonathan A., 2021. "A geometric investigation into the tail dependence of vine copulas," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
    6. Chiapino, Mael & Sabourin, Anne & Segers, Johan, 2018. "Identifying groups of variables with the potential of being large simultaneously," LIDAM Discussion Papers ISBA 2018006, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    7. Marc Chataigner & Stephane Crepey & Jiang Pu, 2020. "Nowcasting Networks," Papers 2011.13687, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hu, Shuang & Peng, Zuoxiang & Segers, Johan, 2022. "Modelling multivariate extreme value distributions via Markov trees," LIDAM Discussion Papers ISBA 2022021, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    2. Kiriliouk, Anna, 2020. "Hypothesis testing for tail dependence parameters on the boundary of the parameter space," Econometrics and Statistics, Elsevier, vol. 16(C), pages 121-135.
    3. Segers, Johan, 2012. "Max-Stable Models For Multivariate Extremes," LIDAM Discussion Papers ISBA 2012011, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    4. Sabourin, Anne, 2015. "Semi-parametric modeling of excesses above high multivariate thresholds with censored data," Journal of Multivariate Analysis, Elsevier, vol. 136(C), pages 126-146.
    5. Kiriliouk, Anna & Segers, Johan & Warchol, Michal, 2014. "Nonparametric estimation of extremal dependence," LIDAM Discussion Papers ISBA 2014044, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    6. Einmahl, J.H.J. & de Haan, L.F.M. & Krajina, A., 2009. "Estimating Extreme Bivariate Quantile Regions," Other publications TiSEM 007ce0a9-dd94-4301-ad62-1, Tilburg University, School of Economics and Management.
    7. Bücher, Axel & Volgushev, Stanislav & Zou, Nan, 2019. "On second order conditions in the multivariate block maxima and peak over threshold method," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 604-619.
    8. Kiriliouk, Anna, 2017. "Hypothesis testing for tail dependence parameters on the boundary of the parameter space with application to generalized max-linear models," LIDAM Discussion Papers ISBA 2017027, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    9. Sabourin, Anne & Naveau, Philippe, 2014. "Bayesian Dirichlet mixture model for multivariate extremes: A re-parametrization," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 542-567.
    10. de Carvalho, Miguel & Oumow, Boris & Segers, Johan & WarchoÅ‚, MichaÅ‚, 2012. "A Euclidean likelihood estimator for bivariate tail dependence," LIDAM Discussion Papers ISBA 2012013, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    11. Kiriliouk, Anna & Segers, Johan & Tafakori, Laleh, 2018. "An estimator of the stable tail dependence function based on the empirical beta copula," LIDAM Discussion Papers ISBA 2018029, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    12. Cui, Hengxin & Tan, Ken Seng & Yang, Fan & Zhou, Chen, 2022. "Asymptotic analysis of portfolio diversification," Insurance: Mathematics and Economics, Elsevier, vol. 106(C), pages 302-325.
    13. Krajina, A., 2010. "An M-estimator of multivariate tail dependence," Other publications TiSEM 66518e07-db9a-4446-81be-c, Tilburg University, School of Economics and Management.
    14. Padoan, Simone A., 2011. "Multivariate extreme models based on underlying skew-t and skew-normal distributions," Journal of Multivariate Analysis, Elsevier, vol. 102(5), pages 977-991, May.
    15. Mourahib, Anas & Kiriliouk, Anna & Segers, Johan, 2023. "Multivariate generalized Pareto distributions along extreme directions," LIDAM Discussion Papers ISBA 2023034, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    16. Cai, J., 2012. "Estimation concerning risk under extreme value conditions," Other publications TiSEM a92b089f-bc4c-41c2-b297-c, Tilburg University, School of Economics and Management.
    17. Kiriliouk, Anna & Segers, Johan & Tafakori, Laleh, 2017. "An estimator of the stable tail dependence function based on the empirical beta copula," LIDAM Discussion Papers ISBA 2017028, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    18. Khader Khadraoui & Pierre Ribereau, 2019. "Bayesian Inference with M-splines on Spectral Measure of Bivariate Extremes," Methodology and Computing in Applied Probability, Springer, vol. 21(3), pages 765-788, September.
    19. Gissibl, Nadine & Klüppelberg, Claudia & Otto, Moritz, 2018. "Tail dependence of recursive max-linear models with regularly varying noise variables," Econometrics and Statistics, Elsevier, vol. 6(C), pages 149-167.
    20. Carsten Bormann & Julia Schaumburg & Melanie Schienle, 2016. "Beyond Dimension two: A Test for Higher-Order Tail Risk," The Journal of Financial Econometrics, Society for Financial Econometrics, vol. 14(3), pages 552-580.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:161:y:2017:i:c:p:12-31. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.