IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v9y2024i1p14-d1317190.html
   My bibliography  Save this article

GeMSyD: Generic Framework for Synthetic Data Generation

Author

Listed:
  • Ramona Tolas

    (Computer Science Department, Technical University of Cluj Napoca, 400114 Cluj-Napoca, Romania)

  • Raluca Portase

    (Computer Science Department, Technical University of Cluj Napoca, 400114 Cluj-Napoca, Romania)

  • Rodica Potolea

    (Computer Science Department, Technical University of Cluj Napoca, 400114 Cluj-Napoca, Romania)

Abstract

In the era of data-driven technologies, the need for diverse and high-quality datasets for training and testing machine learning models has become increasingly critical. In this article, we present a versatile methodology, the Generic Methodology for Constructing Synthetic Data Generation (GeMSyD), which addresses the challenge of synthetic data creation in the context of smart devices. GeMSyD provides a framework that enables the generation of synthetic datasets, aligning them closely with real-world data. To demonstrate the utility of GeMSyD, we instantiate the methodology by constructing a synthetic data generation framework tailored to the domain of event-based data modeling, specifically focusing on user interactions with smart devices. Our framework leverages GeMSyD to create synthetic datasets that faithfully emulate the dynamics of human–device interactions, including the temporal dependencies. Furthermore, we showcase how the synthetic data generated using our framework can serve as a valuable resource for machine learning practitioners. By employing these synthetic datasets, we perform a series of experiments to evaluate the performance of a neural-network-based prediction model in the domain of smart device interaction. Our results underscore the potential of synthetic data in facilitating model development and benchmarking.

Suggested Citation

  • Ramona Tolas & Raluca Portase & Rodica Potolea, 2024. "GeMSyD: Generic Framework for Synthetic Data Generation," Data, MDPI, vol. 9(1), pages 1-28, January.
  • Handle: RePEc:gam:jdataj:v:9:y:2024:i:1:p:14-:d:1317190
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/9/1/14/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/9/1/14/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:9:y:2024:i:1:p:14-:d:1317190. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.