IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2507.08193.html
   My bibliography  Save this paper

Entity-Specific Cyber Risk Assessment using InsurTech Empowered Risk Factors

Author

Listed:
  • Jiayi Guo
  • Zhiyu Quan
  • Linfeng Zhang

Abstract

The lack of high-quality public cyber incident data limits empirical research and predictive modeling for cyber risk assessment. This challenge persists due to the reluctance of companies to disclose incidents that could damage their reputation or investor confidence. Therefore, from an actuarial perspective, potential resolutions conclude two aspects: the enhancement of existing cyber incident datasets and the implementation of advanced modeling techniques to optimize the use of the available data. A review of existing data-driven methods highlights a significant lack of entity-specific organizational features in publicly available datasets. To address this gap, we propose a novel InsurTech framework that enriches cyber incident data with entity-specific attributes. We develop various machine learning (ML) models: a multilabel classification model to predict the occurrence of cyber incident types (e.g., Privacy Violation, Data Breach, Fraud and Extortion, IT Error, and Others) and a multioutput regression model to estimate their annual frequencies. While classifier and regressor chains are implemented to explore dependencies among cyber incident types as well, no significant correlations are observed in our datasets. Besides, we apply multiple interpretable ML techniques to identify and cross-validate potential risk factors developed by InsurTech across ML models. We find that InsurTech empowered features enhance prediction occurrence and frequency estimation robustness compared to only using conventional risk factors. The framework generates transparent, entity-specific cyber risk profiles, supporting customized underwriting and proactive cyber risk mitigation. It provides insurers and organizations with data-driven insights to support decision-making and compliance planning.

Suggested Citation

  • Jiayi Guo & Zhiyu Quan & Linfeng Zhang, 2025. "Entity-Specific Cyber Risk Assessment using InsurTech Empowered Risk Factors," Papers 2507.08193, arXiv.org, revised Jul 2025.
  • Handle: RePEc:arx:papers:2507.08193
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2507.08193
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zhiyu Quan & Changyue Hu & Panyi Dong & Emiliano A. Valdez, 2024. "Improving Business Insurance Loss Models by Leveraging InsurTech Innovation," Papers 2401.16723, arXiv.org.
    2. Lerman, Robert I. & Yitzhaki, Shlomo, 1984. "A note on the calculation and interpretation of the Gini index," Economics Letters, Elsevier, vol. 15(3-4), pages 363-368.
    3. Linfeng Zhang & Changyue Hu & Zhiyu Quan, 2025. "NLP-Powered Repository and Search Engine for Academic Papers: A Case Study on Cyber Risk Literature with CyLit," North American Actuarial Journal, Taylor & Francis Journals, vol. 29(2), pages 390-421, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Quentin Wodon, 2000. "Microdeterminants of consumption, poverty, growth, and inequality in Bangladesh," Applied Economics, Taylor & Francis Journals, vol. 32(10), pages 1337-1352.
    2. José Lorenzo, 2002. "E-Index for measuring concentration," International Advances in Economic Research, Springer;International Atlantic Economic Society, vol. 8(4), pages 357-361, November.
    3. Nantian Huang & Hua Peng & Guowei Cai & Jikai Chen, 2016. "Power Quality Disturbances Feature Selection and Recognition Using Optimal Multi-Resolution Fast S-Transform and CART Algorithm," Energies, MDPI, vol. 9(11), pages 1-21, November.
    4. D'Errico, Marco & Macchiarelli, Corrado & Serafini, Roberta, 2015. "Differently unequal: Zooming-in on the distributional dimensions of the crisis in euro area countries," Economic Modelling, Elsevier, vol. 48(C), pages 93-115.
    5. Stéphane Mussard & J. Sadefo Kamdem & Françoise Seyte & Michel Terraza, 2011. "Quadratic Pen'S Parade And The Computation Of The Gini Index," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 57(3), pages 583-587, September.
    6. Peter Martey Addo & Dominique Guegan & Bertrand Hassani, 2018. "Credit Risk Analysis Using Machine and Deep Learning Models," Risks, MDPI, vol. 6(2), pages 1-20, April.
    7. Bénédicte H. Apouey & Jacques Silber, 2016. "Performance and Inequality in Health: A Comparison of Child and Maternal Health across Asia," Research on Economic Inequality, in: Inequality after the 20th Century: Papers from the Sixth ECINEQ Meeting, volume 24, pages 181-214, Emerald Group Publishing Limited.
    8. Masato Okamoto, 2009. "Decomposition of gini and multivariate gini indices," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 7(2), pages 153-177, June.
    9. Serfling, Robert & Xiao, Peng, 2007. "A contribution to multivariate L-moments: L-comoment matrices," Journal of Multivariate Analysis, Elsevier, vol. 98(9), pages 1765-1781, October.
    10. Adam Wagstaff & Eddy van Doorslaer, 2004. "Overall versus socioeconomic health inequality: a measurement framework and two empirical illustrations," Health Economics, John Wiley & Sons, Ltd., vol. 13(3), pages 297-301, March.
    11. Adam Wagstaff & Eddy Van Doorslaer, 1994. "Measuring inequalities in health in the presence of multiple‐category morbidity indicators," Health Economics, John Wiley & Sons, Ltd., vol. 3(4), pages 281-291, July.
    12. Quentin T. Wodon, 1999. "Between Group Inequality And Targeted Transfers," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 45(1), pages 21-39, March.
    13. Yoel Finkel & Yevgeny Artsev & Shlomo Yitzhaki, 2006. "Inequality measurement and the time structure of household income in Israel," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 4(2), pages 153-179, August.
    14. Schechtman, E. & Yitzhaki, S., 1999. "On the proper bounds of the Gini correlation," Economics Letters, Elsevier, vol. 63(2), pages 133-138, May.
    15. Joachim R. Frick & Jan Goebel & Edna Schechtman & Gert G. Wagner & Shlomo Yitzhaki, 2006. "Using Analysis of Gini (ANOGI) for Detecting Whether Two Subsamples Represent the Same Universe," Sociological Methods & Research, , vol. 34(4), pages 427-468, May.
    16. Simone Pellegrino, 2020. "The Gini Coefficient: Its Origins," Working papers 070, Department of Economics, Social Studies, Applied Mathematics and Statistics (Dipartimento di Scienze Economico-Sociali e Matematico-Statistiche), University of Torino.
      • Simone Pellegrino, 2024. "The Gini Coefficient: Its Origins," Working papers 086, Department of Economics, Social Studies, Applied Mathematics and Statistics (Dipartimento di Scienze Economico-Sociali e Matematico-Statistiche), University of Torino.
    17. Cameron Nadim Haddad & Daniel Gerszon Mahler & Carolina Diaz-Bonilla & Ruth Hill & Christoph Lakner & Gabriel Lara Ibarra, 2024. "The World Bank’s New Inequality Indicator : The Number of Countries with High Inequality," Policy Research Working Paper Series 10796, The World Bank.
    18. Branko Milanovic & Shlomo Yitzhak, 2006. "Decomposing World Income Distribution: Does The World Have A Middle Class?," IBT Journal of Business Studies (JBS), Ilma University, Faculty of Management Science, vol. 2(2), pages 88-110.
    19. Yves Tillé, 2016. "The legacy of Corrado Gini in survey sampling and inequality theory," METRON, Springer;Sapienza Università di Roma, vol. 74(2), pages 167-176, August.
    20. H. Eme Ichoku & William Fonta & Michael Thiede, 2011. "Socioeconomic gradients in self-rated health: a developing country case study of Enugu State, Nigeria," Economic Change and Restructuring, Springer, vol. 44(3), pages 179-202, August.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2507.08193. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.