IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2507.01991.html
   My bibliography  Save this paper

FinAI-BERT: A Transformer-Based Model for Sentence-Level Detection of AI Disclosures in Financial Reports

Author

Listed:
  • Muhammad Bilal Zafar

Abstract

The proliferation of artificial intelligence (AI) in financial services has prompted growing demand for tools that can systematically detect AI-related disclosures in corporate filings. While prior approaches often rely on keyword expansion or document-level classification, they fall short in granularity, interpretability, and robustness. This study introduces FinAI-BERT, a domain-adapted transformer-based language model designed to classify AI-related content at the sentence level within financial texts. The model was fine-tuned on a manually curated and balanced dataset of 1,586 sentences drawn from 669 annual reports of U.S. banks (2015 to 2023). FinAI-BERT achieved near-perfect classification performance (accuracy of 99.37 percent, F1 score of 0.993), outperforming traditional baselines such as Logistic Regression, Naive Bayes, Random Forest, and XGBoost. Interpretability was ensured through SHAP-based token attribution, while bias analysis and robustness checks confirmed the model's stability across sentence lengths, adversarial inputs, and temporal samples. Theoretically, the study advances financial NLP by operationalizing fine-grained, theme-specific classification using transformer architectures. Practically, it offers a scalable, transparent solution for analysts, regulators, and scholars seeking to monitor the diffusion and framing of AI across financial institutions.

Suggested Citation

  • Muhammad Bilal Zafar, 2025. "FinAI-BERT: A Transformer-Based Model for Sentence-Level Detection of AI Disclosures in Financial Reports," Papers 2507.01991, arXiv.org.
  • Handle: RePEc:arx:papers:2507.01991
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2507.01991
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Zhou, Linjiang & Shi, Xiaochuan & Bao, Yaxiong & Gao, Lihua & Ma, Chao, 2023. "Explainable artificial intelligence for digital finance and consumption upgrading," Finance Research Letters, Elsevier, vol. 58(PC).
    2. Basnet, Anup & Elias, Maxim & Salganik-Shoshan, Galla & Walker, Thomas & Zhao, Yunfei, 2025. "Analyzing the market's reaction to AI narratives in corporate filings," International Review of Financial Analysis, Elsevier, vol. 105(C).
    3. Chen, Xun-Qi & Ma, Chao-Qun & Ren, Yi-Shuai & Lei, Yu-Tian & Huynh, Ngoc Quang Anh & Narayan, Seema, 2023. "Explainable artificial intelligence in finance: A bibliometric review," Finance Research Letters, Elsevier, vol. 56(C).
    4. Xiaoqian Zhu & Huidong Wu & Yanpeng Chang & Jianping Li, 2025. "Accounting fraud detection through textual risk disclosures in annual reports: From the perspective of SEC guidelines," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 65(2), pages 1837-1862, June.
    5. Tim Loughran & Bill Mcdonald, 2011. "When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10‐Ks," Journal of Finance, American Finance Association, vol. 66(1), pages 35-65, February.
    6. Elaine Henry & James Thewissen & Wouter Torsin, 2023. "International Earnings Announcements: Tone, Forward-looking Statements, and Informativeness," European Accounting Review, Taylor & Francis Journals, vol. 32(2), pages 275-309, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xing, Manjiang & Gong, Chi & Moon, Gyu-Hyen & Ge, Xiaohong, 2025. "Digital economy, dual innovation capability and enterprise labor productivity," International Review of Financial Analysis, Elsevier, vol. 101(C).
    2. Darko B. Vuković & Senanu Dekpo-Adza & Stefana Matović, 2025. "AI integration in financial services: a systematic review of trends and regulatory challenges," Humanities and Social Sciences Communications, Palgrave Macmillan, vol. 12(1), pages 1-29, December.
    3. Zhang, Xinyu & Liu, Yixuan & Yu, Shaoze & Lin, Ouwen & Meng, Lu, 2025. "Impact of environmental protection tax on enterprise digital transformation: Evidence from Chinese listed firms," International Review of Economics & Finance, Elsevier, vol. 97(C).
    4. Van, Huong Vu & Kim, Cuong Ly, 2025. "Does increased digital transformation decrease corporate cash holdings? The moderating role of business environment quality," Finance Research Letters, Elsevier, vol. 75(C).
    5. Zhang, Dongyang, 2024. "The pathway to curb greenwashing in sustainable growth: The role of artificial intelligence," Energy Economics, Elsevier, vol. 133(C).
    6. Li, Yuantian & Xu, Haoyu & Li, Huiru & Xu, Yingming, 2025. "Digital economy, environmental regulations, and green economic development," Finance Research Letters, Elsevier, vol. 75(C).
    7. Chishti, Muhammad Zubair & Xia, Xiqiang & Du, Anna Min & Özkan, Oktay, 2025. "Digital financial inclusion, the belt and road initiative, and the Paris agreement: Impacts on energy transition grid costs," Finance Research Letters, Elsevier, vol. 72(C).
    8. Xu, Guiyang, 2025. "KIBS Clustering, digital financial inclusion, and economic growth quality," Finance Research Letters, Elsevier, vol. 72(C).
    9. Wang, Wei & Cao, QiangYuan & Li, Zhouyang & Zhu, JinYing, 2025. "Digital transformation and corporate environmental performance," Finance Research Letters, Elsevier, vol. 76(C).
    10. Li, Haibo & Xiao, Biyun, 2025. "Strengthening provincial export resilience through the digital economy: Analysis of impacts, global value chain integration, and regional disparities in China," Finance Research Letters, Elsevier, vol. 76(C).
    11. Zhou, Ling & Li, Yang, 2025. "Commercial credit supply in the digital economy era: Impact mechanisms and differences in property rights," International Review of Financial Analysis, Elsevier, vol. 99(C).
    12. Ding, Qiaoying & He, Wensheng & Deng, Yanfang, 2025. "Can tax reduction incentive policy promote corporate digital and intelligent transformation?," International Review of Financial Analysis, Elsevier, vol. 99(C).
    13. Cheng, Hang & Ruan, Ping & Wang, Peng, 2024. "The impact of surging computing power on enterprise digital transformation–Based on quasi-natural experiments set up by the National Supercomputing Center," Finance Research Letters, Elsevier, vol. 65(C).
    14. Chen, Biao & Lin, Huan & Shan, Biaoan & Xiao, Yundong, 2024. "Government investment in science and technology, digital transformation, and innovation in manufacturing enterprises," Finance Research Letters, Elsevier, vol. 69(PB).
    15. Lyu, Jiayu & Li, Lu & Liu, Yuqi & Deng, Qian, 2025. "Promoting common prosperity: How do digital capability and financial literacy matter?," International Review of Financial Analysis, Elsevier, vol. 97(C).
    16. Curi, Claudia & Mancuso, Paolo & Scarpa, Alessandro, 2025. "State-owned enterprises: A bibliometric review and research agenda," Finance Research Letters, Elsevier, vol. 74(C).
    17. Xu, Zhan & Wang, Solomon & Ye, Junchen, 2024. "The effect of digitization on corporate fraud detection evidence from China," International Review of Financial Analysis, Elsevier, vol. 96(PB).
    18. Liu, Xukang & Ma, Chaoqun & Ren, Yi-Shuai, 2024. "How AI powers ESG performance in China's digital frontier?," Finance Research Letters, Elsevier, vol. 70(C).
    19. Gu, Chen & Kurov, Alexander & Wolfe, Marketa Halova, 2018. "Relief Rallies after FOMC Announcements as a Resolution of Uncertainty," Journal of Empirical Finance, Elsevier, vol. 49(C), pages 1-18.
    20. Müller, Karsten, 2020. "German forecasters' narratives: How informative are German business cycle forecast reports?," Working Papers 23, German Research Foundation's Priority Programme 1859 "Experience and Expectation. Historical Foundations of Economic Behaviour", Humboldt University Berlin.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2507.01991. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.