IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v5y2020i2p43-d350094.html
   My bibliography  Save this article

Guidelines for a Standardized Filesystem Layout for Scientific Data

Author

Listed:
  • Florian Spreckelsen

    (Max Planck Institute for Dynamics and Self-Organization, 37077 Göttingen, Germany
    Institute for the Dynamics of Complex Systems, Georg-August-Universität, 37077 Göttingen, Germany
    German Center for Cardiovascular Research (DZHK), partner site Göttingen, 37075 Göttingen, Germany)

  • Baltasar Rüchardt

    (Max Planck Institute for Dynamics and Self-Organization, 37077 Göttingen, Germany
    Institute for the Dynamics of Complex Systems, Georg-August-Universität, 37077 Göttingen, Germany
    German Center for Cardiovascular Research (DZHK), partner site Göttingen, 37075 Göttingen, Germany)

  • Jan Lebert

    (Max Planck Institute for Dynamics and Self-Organization, 37077 Göttingen, Germany
    Institute for the Dynamics of Complex Systems, Georg-August-Universität, 37077 Göttingen, Germany
    German Center for Cardiovascular Research (DZHK), partner site Göttingen, 37075 Göttingen, Germany
    Department of Cardiology and Pneumology, University Medical Center Göttingen, 37075 Göttingen, Germany)

  • Stefan Luther

    (Max Planck Institute for Dynamics and Self-Organization, 37077 Göttingen, Germany
    Institute for the Dynamics of Complex Systems, Georg-August-Universität, 37077 Göttingen, Germany
    German Center for Cardiovascular Research (DZHK), partner site Göttingen, 37075 Göttingen, Germany
    Institute of Pharmacology and Toxicology, University Medical Center Göttingen, 37075 Göttingen, Germany)

  • Ulrich Parlitz

    (Max Planck Institute for Dynamics and Self-Organization, 37077 Göttingen, Germany
    Institute for the Dynamics of Complex Systems, Georg-August-Universität, 37077 Göttingen, Germany
    German Center for Cardiovascular Research (DZHK), partner site Göttingen, 37075 Göttingen, Germany)

  • Alexander Schlemmer

    (Max Planck Institute for Dynamics and Self-Organization, 37077 Göttingen, Germany
    German Center for Cardiovascular Research (DZHK), partner site Göttingen, 37075 Göttingen, Germany)

Abstract

Storing scientific data on the filesystem in a meaningful and transparent way is no trivial task. In particular, when the data have to be accessed after their originator has left the lab, the importance of a standardized filesystem layout cannot be underestimated. It is desirable to have a structure that allows for the unique categorization of all kinds of data from experimental results to publications. They have to be accessible to a broad variety of workflows, e.g., via graphical user interface as well as via command line, in order to find widespread acceptance. Furthermore, the inclusion of already existing data has to be as simple as possible. We propose a three-level layout to organize and store scientific data that incorporates the full chain of scientific data management from data acquisition to analysis to publications. Metadata are saved in a standardized way and connect original data to analyses and publications as well as to their originators. A simple software tool to check a file structure for compliance with the proposed structure is presented.

Suggested Citation

  • Florian Spreckelsen & Baltasar Rüchardt & Jan Lebert & Stefan Luther & Ulrich Parlitz & Alexander Schlemmer, 2020. "Guidelines for a Standardized Filesystem Layout for Scientific Data," Data, MDPI, vol. 5(2), pages 1-13, April.
  • Handle: RePEc:gam:jdataj:v:5:y:2020:i:2:p:43-:d:350094
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/5/2/43/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/5/2/43/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Xiaogang Ma & Peter Fox & Curt Tilmes & Katharine Jacobs & Anne Waple, 2014. "Capturing provenance of global change information," Nature Climate Change, Nature, vol. 4(6), pages 409-413, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Henrik tom Wörden & Florian Spreckelsen & Stefan Luther & Ulrich Parlitz & Alexander Schlemmer, 2024. "Mapping Hierarchical File Structures to Semantic Data Models for Efficient Data Integration into Research Data Management Systems," Data, MDPI, vol. 9(2), pages 1-15, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Anne Waple & Sarah Champion & Kenneth Kunkel & Curt Tilmes, 2016. "Innovations in information management and access for assessments," Climatic Change, Springer, vol. 135(1), pages 69-83, March.
    2. Anne M. Waple & Sarah M. Champion & Kenneth E. Kunkel & Curt Tilmes, 2016. "Innovations in information management and access for assessments," Climatic Change, Springer, vol. 135(1), pages 69-83, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:5:y:2020:i:2:p:43-:d:350094. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.