IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v11y2026i2p25-d1850953.html

TGEconomicDataset: A Collection of Russian-Language Economic Telegram Channels and a Synthetic Data Generation Framework for Continuous Authentication

Author

Listed:
  • Elena Luneva

    (Department of Comprehensive Information Security of Electronic Computer Systems, Tomsk State University of Control Systems and Radioelectronics, 634050 Tomsk, Russia)

  • Pavel Banokin

    (Department of Comprehensive Information Security of Electronic Computer Systems, Tomsk State University of Control Systems and Radioelectronics, 634050 Tomsk, Russia)

  • Alexander Shelupanov

    (Department of Comprehensive Information Security of Electronic Computer Systems, Tomsk State University of Control Systems and Radioelectronics, 634050 Tomsk, Russia)

Abstract

Telegram, along with WhatsApp and Signal, has become very popular due to its hybrid capabilities, including both instant private and public messaging, making it an effective tool for quickly broadcasting content to a wide audience. This article presents TGEconomicDataset, a new dataset containing more than 2.9 million messages from the most popular Russian-language Telegram channels in the field of economics, as well as synthetically generated labeled mixtures of these channels. These mixtures are specifically designed to model authorship change scenarios for testing various methods for solving the problem of continuous authentication, which is of particular interest due to the need for organizations and companies to rely on data posted on social media. The presented dataset is enriched with quotes of important financial instruments such as gold futures, the USD/RUB currency pair, BRENT oil, the dollar index (DXY), and bitcoin (BTC), synchronized with the message timestamps. A detailed joint analysis of the collected data is provided. In addition to the presented dataset, we publish the scripts used to collect the data, integrate the financial indicators, and generate the synthetic mixtures for the continuous authentication task, ensuring full reproducibility of the research.

Suggested Citation

  • Elena Luneva & Pavel Banokin & Alexander Shelupanov, 2026. "TGEconomicDataset: A Collection of Russian-Language Economic Telegram Channels and a Synthetic Data Generation Framework for Continuous Authentication," Data, MDPI, vol. 11(2), pages 1-29, January.
  • Handle: RePEc:gam:jdataj:v:11:y:2026:i:2:p:25-:d:1850953
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/11/2/25/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/11/2/25/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:11:y:2026:i:2:p:25-:d:1850953. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager The email address of this maintainer does not seem to be valid anymore. Please ask MDPI Indexing Manager to update the entry or send us the correct address (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.