IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-05393158.html

CyberRisk Prediction using Machine Learning and Extreme Value Theory

Author

Listed:
  • Jules Sadefo Kamdem

    (MRE - Montpellier Recherche en Economie - UM - Université de Montpellier)

  • Danielle Selambi Kapsa

    (African Institute for Mathematical Sciences (AIMS), Limbe Crystal Gardens, P.O. Box 608, Limbe, South West Region, Cameroon)

Abstract

This paper develops a hybrid framework for quantifying the financial impact of data breaches by combining predictive machine learning with extreme value theory (EVT). Using incident-level breach data from the Privacy Rights Clearinghouse (PRC) covering the period 2005–2020, we first estimate the number of compromised records with a Random Forest model trained on organizational, temporal, and attack-type characteristics. We then analyze the tail behavior of the predicted losses to capture the fat-tailed distribution of cyber risks. Our results indicate that the distribution of affected records is well represented by a Fr´echet law, and we estimate the parameters of the Generalized Extreme Value (GEV) distribution to compute Value-at-Risk (VaR) at high confidence levels. This two-stage approach provides a rigorous assessment of maximum potential losses, addressing the question of cyber-risk insurability. By linking predictive accuracy with tail risk quantification, our findings deliver actionable insights for insurers, regulators, and organizations seeking to anticipate and manage the f inancial consequences of large-scale data breaches.

Suggested Citation

  • Jules Sadefo Kamdem & Danielle Selambi Kapsa, 2025. "CyberRisk Prediction using Machine Learning and Extreme Value Theory," Post-Print hal-05393158, HAL.
  • Handle: RePEc:hal:journl:hal-05393158
    as

    Download full text from publisher

    To our knowledge, this item is not available for download. To find whether it is available, there are three options:
    1. Check below whether another version of this item is available online.
    2. Check on the provider's web page whether it is in fact available.
    3. Perform a
    for a similarly titled item that would be available.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-05393158. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.