IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v17y2025i2p59-d1582738.html
   My bibliography  Save this article

A Methodological Framework for AI-Driven Textual Data Analysis in Digital Media

Author

Listed:
  • Douglas Cordeiro

    (Faculty of Information and Communication, Federal University of Goiás, Goiânia 74690-900, GO, Brazil)

  • Carlos Lopezosa

    (Faculty of Information and Audiovisual Media, University of Barcelona, 08193 Barcelona, Spain)

  • Javier Guallar

    (Faculty of Information and Audiovisual Media, University of Barcelona, 08193 Barcelona, Spain)

Abstract

The growing volume of textual data generated on digital media platforms presents significant challenges for the analysis and interpretation of information. This article proposes a methodological approach that combines artificial intelligence (AI) techniques and statistical methods to explore and analyze textual data from digital media. The framework, titled DAFIM (Data Analysis Framework for Information and Media), includes strategies for data collection through APIs and web scraping, textual data processing, and data enrichment using AI solutions, including named entity recognition (people, locations, objects, and brands) and the detection of clickbait in news. Sentiment analysis and text clustering techniques are integrated to support content analysis. The potential applications of this methodology include social networks, news aggregators, news portals, and newsletters, offering a robust framework for studying digital data and supporting informed decision-making. The proposed framework is validated through a case study involving data extracted from the Google News aggregation platform, focusing on the Israel–Lebanon conflict. This demonstrates the framework’s capability to uncover narrative patterns, content trends, and clickbait detection while also highlighting its advantages and limitations.

Suggested Citation

  • Douglas Cordeiro & Carlos Lopezosa & Javier Guallar, 2025. "A Methodological Framework for AI-Driven Textual Data Analysis in Digital Media," Future Internet, MDPI, vol. 17(2), pages 1-26, February.
  • Handle: RePEc:gam:jftint:v:17:y:2025:i:2:p:59-:d:1582738
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/17/2/59/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/17/2/59/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Asma Khanom & Damon Kiesow & Matt Zdun & Chi-Ren Shyu, 2023. "The News Crawler: A Big Data Approach to Local Information Ecosystems," Media and Communication, Cogitatio Press, vol. 11(3), pages 318-329.
    2. Brittany I. Davidson & Darja Wischerath & Daniel Racek & Douglas A. Parry & Emily Godwin & Joanne Hinds & Dirk Linden & Jonathan F. Roscoe & Laura Ayravainen & Alicia G. Cork, 2023. "Platform-controlled social media APIs threaten open science," Nature Human Behaviour, Nature, vol. 7(12), pages 2054-2057, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Casas, Andreu & Dagher, Georgia & O'Loughlin, Ben, 2025. "Academic Access to Social Media Data for the Study of Political Online Safety," SocArXiv 7pcjd, Center for Open Science.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:17:y:2025:i:2:p:59-:d:1582738. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.