Author
Listed:
- Jacek Krzysztof Jakubczak
(Faculty of Economics, Maria Curie-Skłodowska University, 20-031 Lublin, Poland)
- Dorota Chmielewska-Muciek
(Faculty of Journalism, Information and Book Studies, University of Warsaw, 00-927 Warsaw, Poland)
- Katarzyna Iwanicka
(Faculty of Journalism, Information and Book Studies, University of Warsaw, 00-927 Warsaw, Poland)
Abstract
The rapid expansion of sustainability reporting under the EU Corporate Sustainability Reporting Directive (CSRD) has intensified concerns about greenwashing, particularly in visual communication within ESG reports. Recent advances in multimodal artificial intelligence offer new possibilities for automated detection, yet their reliability in non-English corporate reporting contexts remains unclear. This study evaluates the greenwashing detection capabilities of three leading multimodal AI systems—ChatGPT 5.1, Claude 4.5 Sonnet, and Gemini 2.5 Flash—using a purposively selected sample of 20 Polish ESG reports benchmarked against ESRS-aligned performance scores from the national “Ranking ESG”. A standardized auditing prompt was applied across all tools to generate comparable assessments of visual greenwashing. Contrary to theoretical expectations and all four hypotheses, the models did not demonstrate negative correlations between performance and AI-detected greenwashing; instead, high-performing firms frequently received higher greenwashing scores. Dimensional analyses showed inconsistent and often contradictory evaluations across Environmental, Social, and Governance pillars, while inter-tool reliability proved extremely low (Krippendorff’s α ≈ 0). These findings indicate that current multimodal AI systems conflate communication sophistication with deceptive intent and lack sufficient contextual understanding for ESG assurance. The study highlights significant methodological limitations and outlines directions for developing domain-specific, ESRS-aligned AI tools for greenwashing detection.
Suggested Citation
Jacek Krzysztof Jakubczak & Dorota Chmielewska-Muciek & Katarzyna Iwanicka, 2025.
"Evaluating Multimodal AI for Greenwashing Detection: A Comparative Analysis of ChatGPT, Claude, and Gemini in ESG Reports,"
Sustainability, MDPI, vol. 18(1), pages 1-18, December.
Handle:
RePEc:gam:jsusta:v:18:y:2025:i:1:p:236-:d:1826451
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:18:y:2025:i:1:p:236-:d:1826451. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.