IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v14y2022i7p4211-d785405.html
   My bibliography  Save this article

Investigating the Potential of Data Science Methods for Sustainable Public Transport

Author

Listed:
  • Christine Keller

    (Institute for Ubiquitous Mobility Systems, Hochschule Karlsruhe University of Applied Sciences, 76133 Karlsruhe, Germany)

  • Felix Glück

    (Institute for Ubiquitous Mobility Systems, Hochschule Karlsruhe University of Applied Sciences, 76133 Karlsruhe, Germany)

  • Carl Friedrich Gerlach

    (Institute for Ubiquitous Mobility Systems, Hochschule Karlsruhe University of Applied Sciences, 76133 Karlsruhe, Germany)

  • Thomas Schlegel

    (Institute for Ubiquitous Mobility Systems, Hochschule Karlsruhe University of Applied Sciences, 76133 Karlsruhe, Germany)

Abstract

The planning and implementation of public transport involves many data sources. These data sources in turn generate a high volume of data, in a wide variety of formats and data rates. This phenomenon is reinforced by the ongoing digitization of public transport; new data sources have continuously emerged in public transport in recent years and decades. This results in a great potential for the application and utilization of data science methods in public transport. Using big data methods and sources can, or in some cases already does, contribute to a better understanding and the further optimization of public transport networks, public transport service and public transport in general. This paper classifies data sources in the field of public transport and examines systematically for which use cases the data are used or can be used. These steps contribute by structuring ongoing discussions about the application of data science in the public transport domain and illustrate the potential of the application of data science for public transport. We present several use cases in which we applied data science methods, such as machine learning and visualization to public transport data. Several of these projects use data from automated passenger information systems, a data source that has not been widely studied to date. We report our findings for these use cases and discuss the lessons learned, to inform future research on these use cases and discuss their potential. This paper concludes with a summary of the typical problems that occur when dealing with big public transport data and a discussion of solutions for these problems. This discussion identifies future work and topics worth investigating for public transport companies as well as for researchers. Working on these topics will, in our opinion, support the improvement of public transport towards the efficiency and attractiveness that is needed for public transport to play its essential role in future sustainable mobility. The application of these methods in public transport requires the collaboration of domain experts with researchers and data scientists, calling for a mutual understanding. This paper also contributes to this understanding by providing an overview of the methods that are already used, potential new use cases, data sources, challenges and possible solutions.

Suggested Citation

  • Christine Keller & Felix Glück & Carl Friedrich Gerlach & Thomas Schlegel, 2022. "Investigating the Potential of Data Science Methods for Sustainable Public Transport," Sustainability, MDPI, vol. 14(7), pages 1-26, April.
  • Handle: RePEc:gam:jsusta:v:14:y:2022:i:7:p:4211-:d:785405
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/14/7/4211/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/14/7/4211/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Timothy F. Welch & Alyas Widita, 2019. "Big data in public transportation: a review of sources and methods," Transport Reviews, Taylor & Francis Journals, vol. 39(6), pages 795-818, November.
    2. Li Cai & Sijin Li & Shipu Wang & Yu Liang, 2018. "GPS Trajectory Clustering and Visualization Analysis," Annals of Data Science, Springer, vol. 5(1), pages 29-42, March.
    3. Bagchi, M. & White, P.R., 2005. "The potential of public transport smart card data," Transport Policy, Elsevier, vol. 12(5), pages 464-474, September.
    4. Shefang Wang & Chaoru Lu & Chenhui Liu & Yue Zhou & Jun Bi & Xiaomei Zhao, 2020. "Understanding the Energy Consumption of Battery Electric Buses in Urban Public Transport Systems," Sustainability, MDPI, vol. 12(23), pages 1-12, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liao, Cong & Scheuer, Bronte, 2022. "Evaluating the performance of transit-oriented development in Beijing metro station areas: Integrating morphology and demand into the node-place model," Journal of Transport Geography, Elsevier, vol. 100(C).
    2. Liping Ge & Malek Sarhani & Stefan Voß & Lin Xie, 2021. "Review of Transit Data Sources: Potentials, Challenges and Complementarity," Sustainability, MDPI, vol. 13(20), pages 1-37, October.
    3. Kevin Credit & Zander Arnao, 2023. "A method to derive small area estimates of linked commuting trips by mode from open source LODES and ACS data," Environment and Planning B, , vol. 50(3), pages 709-722, March.
    4. Wang, Yihong & Correia, Gonçalo Homem de Almeida & de Romph, Erik & Timmermans, H.J.P., 2017. "Using metro smart card data to model location choice of after-work activities: An application to Shanghai," Journal of Transport Geography, Elsevier, vol. 63(C), pages 40-47.
    5. Apanasevic, Tatjana & Rudmark, Daniel, 2021. "Crowdsourcing and Public Transportation: Barriers and Opportunities," 23rd ITS Biennial Conference, Online Conference / Gothenburg 2021. Digital societies and industrial transformations: Policies, markets, and technologies in a post-Covid world 238005, International Telecommunications Society (ITS).
    6. Iván López & Pedro Luis Calvo & Gonzalo Fernández-Sánchez & Carlos Sierra & Roberto Corchero & Cesar Omar Chacón & Carlos de Juan & Daniel Rosas & Francisco Burgos, 2022. "Different Approaches for a Goal: The Electrical Bus-EMT Madrid as a Successful Case Study," Energies, MDPI, vol. 15(17), pages 1-24, August.
    7. Tao, Sui & Rohde, David & Corcoran, Jonathan, 2014. "Examining the spatial–temporal dynamics of bus passenger travel behaviour using smart card data and the flow-comap," Journal of Transport Geography, Elsevier, vol. 41(C), pages 21-36.
    8. Bantis, Thanos & Haworth, James, 2020. "Assessing transport related social exclusion using a capabilities approach to accessibility framework: A dynamic Bayesian network approach," Journal of Transport Geography, Elsevier, vol. 84(C).
    9. Mohammadi, Neda & Taylor, John E., 2017. "Urban energy flux: Spatiotemporal fluctuations of building energy consumption and human mobility-driven prediction," Applied Energy, Elsevier, vol. 195(C), pages 810-818.
    10. Masood Jafari Kang & Shervin Ataeian & S. M. Mahdi Amiripour, 2021. "A procedure for public transit OD matrix generation using smart card transaction data," Public Transport, Springer, vol. 13(1), pages 81-100, March.
    11. Benito Zaragozí & Sergio Trilles & Aaron Gutiérrez & Daniel Miravet, 2021. "Development of a Common Framework for Analysing Public Transport Smart Card Data," Energies, MDPI, vol. 14(19), pages 1-22, September.
    12. Zhu, Yiwen & Koutsopoulos, Haris N. & Wilson, Nigel H.M., 2017. "A probabilistic Passenger-to-Train Assignment Model based on automated data," Transportation Research Part B: Methodological, Elsevier, vol. 104(C), pages 522-542.
    13. Qingru Zou & Xiangming Yao & Peng Zhao & Heng Wei & Hui Ren, 2018. "Detecting home location and trip purposes for cardholders by mining smart card transaction data in Beijing subway," Transportation, Springer, vol. 45(3), pages 919-944, May.
    14. Erick Yohanes Kalengkongan & Wilson Bogar & Fitri H. Mamonto, 2022. "The Quality of Vehicles' Public Service Testing in The Tomohon Transportation Department," Technium Social Sciences Journal, Technium Science, vol. 32(1), pages 62-75, June.
    15. Simon Bell & Francesca Benatti & Neil R. Edwards & Robin Laney & David R. Morse & Lara Piccolo & Oliver Zanetti, 2018. "Smart Cities and M3: Rapid Research, Meaningful Metrics and Co-Design," Systemic Practice and Action Research, Springer, vol. 31(1), pages 27-53, February.
    16. De Zhao & Wei Wang & Amber Woodburn & Megan S. Ryerson, 2017. "Isolating high-priority metro and feeder bus transfers using smart card data," Transportation, Springer, vol. 44(6), pages 1535-1554, November.
    17. Marcin Połom & Paweł Wiśniewski, 2021. "Assessment of the Emission of Pollutants from Public Transport Based on the Example of Diesel Buses and Trolleybuses in Gdynia and Sopot," IJERPH, MDPI, vol. 18(16), pages 1-17, August.
    18. Hamed Faroqi & Mahmoud Mesbah & Jiwon Kim & Ali Khodaii, 2022. "Targeted Advertising in the Public Transit Network Using Smart Card Data," Networks and Spatial Economics, Springer, vol. 22(1), pages 97-124, March.
    19. Egu, Oscar & Bonnel, Patrick, 2020. "How comparable are origin-destination matrices estimated from automatic fare collection, origin-destination surveys and household travel survey? An empirical investigation in Lyon," Transportation Research Part A: Policy and Practice, Elsevier, vol. 138(C), pages 267-282.
    20. Wang, Zi-Jia & Jia, Hui-Hui & Dai, Fangzhou & Diao, Mi, 2022. "Understanding the ground access and airport choice behavior of air passengers using transit payment transaction data," Transport Policy, Elsevier, vol. 127(C), pages 179-190.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:14:y:2022:i:7:p:4211-:d:785405. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.