IDEAS home Printed from https://ideas.repec.org/p/osf/socarx/trbv9.html
   My bibliography  Save this paper

Mitigating Bias in Big Data for Transportation

Author

Listed:
  • Griffin, Greg Phillip

    (The University of Texas at San Antonio)

  • Mulhall, Megan
  • Simek, Chris
  • Riggs, William W.

Abstract

Emerging big data resources and practices provide opportunities to improve transportation safety planning and outcomes. However, researchers and practitioners recognise that big data from mobile phones, social media, and on-board vehicle systems include biases in representation and accuracy, related to transportation safety statistics. This study examines both the sources of bias and approaches to mitigate them through a review of published studies and interviews with experts. Coding of qualitative data enabled topical comparisons and reliability metrics. Results identify four categories of bias and mitigation approaches that concern transportation researchers and practitioners: sampling, measurement, demographics, and aggregation. This structure for understanding and working with bias in big data supports research with practical approaches for rapidly evolving transportation data sources.

Suggested Citation

  • Griffin, Greg Phillip & Mulhall, Megan & Simek, Chris & Riggs, William W., 2020. "Mitigating Bias in Big Data for Transportation," SocArXiv trbv9, Center for Open Science.
  • Handle: RePEc:osf:socarx:trbv9
    DOI: 10.31219/osf.io/trbv9
    as

    Download full text from publisher

    File URL: https://osf.io/download/5e21d350675e0e008c6b5eb7/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/trbv9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Piwek, Lukasz & Joinson, Adam & Morvan, James, 2015. "The use of self-monitoring solutions amongst cyclists: An online survey and empirical study," Transportation Research Part A: Policy and Practice, Elsevier, vol. 77(C), pages 126-136.
    2. Peng, Peng & Yang, Yu & Lu, Feng & Cheng, Shifen & Mou, Naixia & Yang, Ren, 2018. "Modelling the competitiveness of the ports along the Maritime Silk Road with big data," Transportation Research Part A: Policy and Practice, Elsevier, vol. 118(C), pages 852-867.
    3. Andrew Mondschein, 2015. "Five-star transportation: using online activity reviews to examine mode choice to non-work destinations," Transportation, Springer, vol. 42(4), pages 707-722, July.
    4. Mi Diao & Yi Zhu & Joseph Ferreira Jr & Carlo Ratti, 2016. "Inferring individual daily activities from mobile phone traces: A Boston example," Environment and Planning B, , vol. 43(5), pages 920-940, September.
    5. Erick Guerra & Eric A. Morris, 2018. "Cities, Automation, and the Self-parking Elephant in the Room," Planning Theory & Practice, Taylor & Francis Journals, vol. 19(2), pages 291-297, March.
    6. Zhao, Jinbao & Wang, Jian & Xing, Zhaomin & Luan, Xin & Jiang, Yang, 2018. "Weather and cycling: Mining big data to have an in-depth understanding of the association of weather variability with cycling on an off-road trail and an on-road bike lane," Transportation Research Part A: Policy and Practice, Elsevier, vol. 111(C), pages 119-135.
    7. Crystal Legacy & David Ashmore & Jan Scheurer & John Stone & Carey Curtis, 2019. "Planning the driverless city," Transport Reviews, Taylor & Francis Journals, vol. 39(1), pages 84-102, January.
    8. Patrick Bonnel & Caroline Bayart & Brett Smith, 2015. "Workshop Synthesis: Comparing and Combining Survey Modes," Post-Print halshs-01663724, HAL.
    9. Gürbüz, Feyza & Turna, Fatma, 2018. "Rule extraction for tram faults via data mining for safe transportation," Transportation Research Part A: Policy and Practice, Elsevier, vol. 116(C), pages 568-579.
    10. Lisa Schweitzer, 2014. "Planning and Social Media: A Case Study of Public Transit and Stigma on Twitter," Journal of the American Planning Association, Taylor & Francis Journals, vol. 80(3), pages 218-238, July.
    11. Abdel-Aty, Mohamed & Lee, Jaeyoung & Siddiqui, Chowdhury & Choi, Keechoo, 2013. "Geographical unit based analysis in the context of transportation safety planning," Transportation Research Part A: Policy and Practice, Elsevier, vol. 49(C), pages 62-75.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mohammad Anwar Alattar & Caitlin Cottrill & Mark Beecroft, 2021. "Sources and Applications of Emerging Active Travel Data: A Review of the Literature," Sustainability, MDPI, vol. 13(13), pages 1-17, June.
    2. Alattar, Mohammad Anwar & Cottrill, Caitlin & Beecroft, Mark, 2021. "Public participation geographic information system (PPGIS) as a method for active travel data acquisition," Journal of Transport Geography, Elsevier, vol. 96(C).
    3. Jiping Cao & Hartwig H. Hochmair & Fisal Basheeh, 2022. "The Effect of Twitter App Policy Changes on the Sharing of Spatial Information through Twitter Users," Geographies, MDPI, vol. 2(3), pages 1-14, September.
    4. Michał Zawodny & Maciej Kruszyna, 2022. "Proposals for Using the Advanced Tools of Communication between Autonomous Vehicles and Infrastructure in Selected Cases," Energies, MDPI, vol. 15(18), pages 1-15, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andre Carrel & Raja Sengupta & Joan L. Walker, 2017. "The San Francisco Travel Quality Study: tracking trials and tribulations of a transit taker," Transportation, Springer, vol. 44(4), pages 643-679, July.
    2. Xin Li & Bingruo Duan, 2018. "Organizational microblogging for event marketing: a new approach to creative placemaking," International Journal of Urban Sciences, Taylor & Francis Journals, vol. 22(1), pages 59-79, January.
    3. Luca Staricco & Valentina Rappazzo & Jacopo Scudellari & Elisabetta Vitale Brovarone, 2019. "Toward Policies to Manage the Impacts of Autonomous Vehicles on the City: A Visioning Exercise," Sustainability, MDPI, vol. 11(19), pages 1-21, September.
    4. Rahat Sabyrbekov & Indra Overland, 2020. "Why Choose to Cycle in a Low-Income Country?," Sustainability, MDPI, vol. 12(18), pages 1-14, September.
    5. Sarker, Rumana Islam & Kaplan, Sigal & Mailer, Markus & Timmermans, Harry J.P., 2019. "Applying affective event theory to explain transit users’ reactions to service disruptions," Transportation Research Part A: Policy and Practice, Elsevier, vol. 130(C), pages 593-605.
    6. Yaxi Gong & Xiang Ji & Yuan Zhang & Shanshan Cheng, 2023. "Spatial Vitality Evaluation and Coupling Regulation Mechanism of a Complex Ecosystem in Lixiahe Plain Based on Multi-Source Data," Sustainability, MDPI, vol. 15(3), pages 1-32, January.
    7. Mariano J. Rabassa & Mariana Conte Grand & Christian M. García-Witulski, 2021. "Heat warnings and avoidance behavior: evidence from a bike-sharing system," Environmental Economics and Policy Studies, Springer;Society for Environmental Economics and Policy Studies - SEEPS, vol. 23(1), pages 1-28, January.
    8. Jia Guo & Yusak Susilo & Constantinos Antoniou & Anna Pernestål Brenden, 2020. "Influence of Individual Perceptions on the Decision to Adopt Automated Bus Services," Sustainability, MDPI, vol. 12(16), pages 1-13, August.
    9. Ghadiri, Mehdi & Rassafi, Amir Abbas & Mirbaha, Babak, 2019. "The effects of traffic zoning with regular geometric shapes on the precision of trip production models," Journal of Transport Geography, Elsevier, vol. 78(C), pages 150-159.
    10. Yang, Dong & Liao, Shiguan & Venus Lun, Y.H & Bai, Xiwen, 2023. "Towards sustainable port management: Data-driven global container ports turnover rate assessment," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 175(C).
    11. Hong, Jinhyun & Philip McArthur, David & Stewart, Joanna L., 2020. "Can providing safe cycling infrastructure encourage people to cycle more when it rains? The use of crowdsourced cycling data (Strava)," Transportation Research Part A: Policy and Practice, Elsevier, vol. 133(C), pages 109-121.
    12. Luo, Shuli & He, Sylvia Y. & Grant-Muller, Susan & Song, Linqi, 2023. "Influential factors in customer satisfaction of transit services: Using crowdsourced data to capture the heterogeneity across individuals, space and time," Transport Policy, Elsevier, vol. 131(C), pages 173-183.
    13. Fangye Du & Jiaoe Wang & Liang Mao & Jian Kang, 2024. "Daily rhythm of urban space usage: insights from the nexus of urban functions and human mobility," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-10, December.
    14. Shuli Luo & Sylvia Y He, 2021. "Using data mining to explore the spatial and temporal dynamics of perceptions of metro services in China: The case of Shenzhen," Environment and Planning B, , vol. 48(3), pages 449-466, March.
    15. Fabre, Léa & Bayart, Caroline & Bonnel, Patrick & Mony, Nicolas, 2023. "The potential of Wi-Fi data to estimate bus passenger mobility," Technological Forecasting and Social Change, Elsevier, vol. 192(C).
    16. Christine Eisenmann & Bastian Chlond & Clotilde Minster & Christian Jödden & Peter Vortisch, 2019. "Assessing the effects of a mixed-mode design in a longitudinal household travel survey," Transportation, Springer, vol. 46(5), pages 1737-1753, October.
    17. Aschauer, Florian & Hössinger, Reinhard & Jara-Diaz, Sergio & Schmid, Basil & Axhausen, Kay & Gerike, Regine, 2021. "Comprehensive data validation of a combined weekly time use and travel survey," Transportation Research Part A: Policy and Practice, Elsevier, vol. 153(C), pages 66-82.
    18. Luo, Shuli & He, Sylvia Y., 2021. "Understanding gender difference in perceptions toward transit services across space and time: A social media mining approach," Transport Policy, Elsevier, vol. 111(C), pages 63-73.
    19. Liu, Jie & Xu, Yubo & Wang, Lisong, 2022. "Fault information mining with causal network for railway transportation system," Reliability Engineering and System Safety, Elsevier, vol. 220(C).
    20. Millard-Ball, Adam, 2019. "The autonomous vehicle parking problem," Transport Policy, Elsevier, vol. 75(C), pages 99-108.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:trbv9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.