IDEAS home Printed from https://ideas.repec.org/a/sae/envirb/v52y2025i4p1002-1013.html
   My bibliography  Save this article

Packaging code and data for reproducible research: A case study of journey time statistics

Author

Listed:
  • Federico Botta

    (Department of Computer Science, 3286University of Exeter, Exeter, UK
    Fellow, 522468The Alan Turing Institute, UK)

  • Robin Lovelace
  • Laura Gilbert
  • Arthur Turrell

Abstract

The effective and ethical use of data to inform decision-making offers huge value to the public sector, especially when delivered by transparent, reproducible, and robust data processing workflows. One way that governments are unlocking this value is through making their data publicly available, allowing more people and organisations to derive insights. However, open data is not enough in many cases: publicly available datasets need to be accessible in an analysis-ready form from popular data science tools, such as R and Python, for them to realise their full potential. This paper explores ways to maximise the impact of open data with reference to a case study of packaging code to facilitate reproducible analysis. We present the jtstats project, which consists of a main Python package, and a smaller R version, for importing, processing, and visualising large and complex datasets representing journey times, for many transport modes and trip purposes at multiple geographic levels, released by the UK Department for Transport (DfT). jtstats shows how domain specific packages can enable reproducible research within the public sector and beyond, saving duplicated effort and reducing the risks of errors from repeated analyses. We hope that the jtstats project inspires others, particularly those in the public sector, to add value to their data sets by making them more accessible.

Suggested Citation

  • Federico Botta & Robin Lovelace & Laura Gilbert & Arthur Turrell, 2025. "Packaging code and data for reproducible research: A case study of journey time statistics," Environment and Planning B, , vol. 52(4), pages 1002-1013, May.
  • Handle: RePEc:sae:envirb:v:52:y:2025:i:4:p:1002-1013
    DOI: 10.1177/23998083241267331
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/23998083241267331
    Download Restriction: no

    File URL: https://libkey.io/10.1177/23998083241267331?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:envirb:v:52:y:2025:i:4:p:1002-1013. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.