IDEAS home Printed from https://ideas.repec.org/a/eee/transb/v44y2010i5p662-673.html
   My bibliography  Save this article

Bayesian mixture modeling approach to account for heterogeneity in speed data

Author

Listed:
  • Park, Byung-Jung
  • Zhang, Yunlong
  • Lord, Dominique

Abstract

Speed is one of the most important parameters describing the condition of the traffic flow. Many analytical models related to traffic flow either produce speed as a performance measure, or use speed to determine other measures such as travel time, delay, and the level of service. Mathematical models or distributions used to describe speed characteristics are very useful, especially when they are utilized in the context of simulation and theoretical derivations. Traditionally, normal, log-normal and composite distributions have been the usual mathematical distributions to characterize speed data. These traditional distributions, however, often fail to produce an adequate goodness-of-fit when the empirical distribution of speed data exhibits bimodality (or multimodality), skewness, or excess kurtosis (peakness). This often occurs when the speed data are generated from several different sub-populations, for example, mixed traffic flow conditions or mixed vehicle compositions. The traditional modeling approach also lacks the ability to explain the underlying factors that lead to different speed distribution curves. The objective of this paper is to explore the applicability of the finite mixture of normal (Gaussian) distributions to capture the heterogeneity in vehicle speed data, and thereby explaining the aforementioned special characteristics. For the parameter estimation, Bayesian estimation method via Markov Chain Monte Carlo (MCMC) sampling is adopted. The field data collected on IH-35 in Texas is used to evaluate the proposed models. The results of this study show that the finite mixture of normal distributions can very effectively describe the heterogeneous speed data, and provide richer information usually not available from the traditional models. The finite mixture modeling produces an excellent fit to the multimodal speed distribution curve. Moreover, the causes of different speed distributions can be identified through investigating the components.

Suggested Citation

  • Park, Byung-Jung & Zhang, Yunlong & Lord, Dominique, 2010. "Bayesian mixture modeling approach to account for heterogeneity in speed data," Transportation Research Part B: Methodological, Elsevier, vol. 44(5), pages 662-673, June.
  • Handle: RePEc:eee:transb:v:44:y:2010:i:5:p:662-673
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0191-2615(10)00019-6
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sylvia. Richardson & Peter J. Green, 1997. "On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion)," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(4), pages 731-792.
    2. Jie Q. Guo & Pravin K. Trivedi, 2002. "Flexible Parametric Models for Long‐tailed Patent Count Distributions," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 64(1), pages 63-82, February.
    3. Peiming Wang & Iain Cockburn & Martin L. Puterman, "undated". "A Mixed Poisson Regression Model for Analysis of Patent Data," Computing in Economics and Finance 1996 _049, Society for Computational Economics.
    4. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    5. Matthew Stephens, 2000. "Dealing with label switching in mixture models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(4), pages 795-809.
    6. Peter Lenk & Wayne DeSarbo, 2000. "Bayesian inference for finite mixtures of generalized linear models with random effects," Psychometrika, Springer;The Psychometric Society, vol. 65(1), pages 93-119, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jenelius, Erik & Koutsopoulos, Haris N., 2013. "Travel time estimation for urban road networks using low frequency probe vehicle data," Transportation Research Part B: Methodological, Elsevier, vol. 53(C), pages 64-81.
    2. Hu, Yang & Baraldi, Piero & Di Maio, Francesco & Zio, Enrico, 2015. "A particle filtering and kernel smoothing-based approach for new design component prognostics," Reliability Engineering and System Safety, Elsevier, vol. 134(C), pages 19-31.
    3. Kuo, Pei-Fen & Lord, Dominique, 2013. "Accounting for site-selection bias in before–after studies for continuous distributions: Characteristics and application using speed data," Transportation Research Part A: Policy and Practice, Elsevier, vol. 49(C), pages 256-269.
    4. Drobetz, Wolfgang & Merikas, Andreas & Merika, Anna & Tsionas, Mike G., 2014. "Corporate social responsibility disclosure: The case of international shipping," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 71(C), pages 18-44.
    5. Fang, Zhenyuan & Zhu, Shichao & Fu, Xin & Liu, Fang & Huang, Helai & Tang, Jinjun, 2022. "Multivariate analysis of traffic flow using copula-based model at an isolated road intersection," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 599(C).
    6. Parry, Katharina & Hazelton, Martin L., 2013. "Bayesian inference for day-to-day dynamic traffic models," Transportation Research Part B: Methodological, Elsevier, vol. 50(C), pages 104-115.
    7. Han, Gain & Sohn, Keemin, 2016. "Activity imputation for trip-chains elicited from smart-card data using a continuous hidden Markov model," Transportation Research Part B: Methodological, Elsevier, vol. 83(C), pages 121-135.
    8. Shen, Luou & Liu, Jiajun & Yao, Zhihong & Wu, Weitiao & Yang, Hongtai, 2022. "A new queue shock wave theory based on platoon dispersion modeling," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 603(C).
    9. Xiong, Yingge & Mannering, Fred L., 2013. "The heterogeneous effects of guardian supervision on adolescent driver-injury severities: A finite-mixture random-parameters approach," Transportation Research Part B: Methodological, Elsevier, vol. 49(C), pages 39-54.
    10. Li, Baibing, 2017. "Stochastic modeling for vehicle platoons (I): Dynamic grouping behavior and online platoon recognition," Transportation Research Part B: Methodological, Elsevier, vol. 95(C), pages 364-377.
    11. Lee, Minseo & Sohn, Keemin, 2015. "Inferring the route-use patterns of metro passengers based only on travel-time data within a Bayesian framework using a reversible-jump Markov chain Monte Carlo (MCMC) simulation," Transportation Research Part B: Methodological, Elsevier, vol. 81(P1), pages 1-17.
    12. Eduardo Gutiérrez González & Olga Vladimirovna Panteleeva, 2020. "A model for planning and optimizing an engineering company production," OPSEARCH, Springer;Operational Research Society of India, vol. 57(3), pages 669-699, September.
    13. Wu, Fuliang & Bektaş, Tolga & Dong, Ming & Ye, Hongbo & Zhang, Dali, 2021. "Optimal driving for vehicle fuel economy under traffic speed uncertainty," Transportation Research Part B: Methodological, Elsevier, vol. 154(C), pages 175-206.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yuan Fang & Dimitris Karlis & Sanjeena Subedi, 2022. "Infinite Mixtures of Multivariate Normal-Inverse Gaussian Distributions for Clustering of Skewed Data," Journal of Classification, Springer;The Classification Society, vol. 39(3), pages 510-552, November.
    2. Weber, Anett & Steiner, Winfried J., 2021. "Modeling price response from retail sales: An empirical comparison of models with different representations of heterogeneity," European Journal of Operational Research, Elsevier, vol. 294(3), pages 843-859.
    3. Brian Neelon & A. James O'Malley & Sharon-Lise T. Normand, 2011. "A Bayesian Two-Part Latent Class Model for Longitudinal Medical Expenditure Data: Assessing the Impact of Mental Health and Substance Abuse Parity," Biometrics, The International Biometric Society, vol. 67(1), pages 280-289, March.
    4. Komárek, Arnost, 2009. "A new R package for Bayesian estimation of multivariate normal mixtures allowing for selection of the number of components and interval-censored data," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 3932-3947, October.
    5. Roy Costilla & Ivy Liu & Richard Arnold & Daniel Fernández, 2019. "Bayesian model-based clustering for longitudinal ordinal data," Computational Statistics, Springer, vol. 34(3), pages 1015-1038, September.
    6. Arima, Serena & Basset, Alberto & Jona Lasinio, Giovanna & Pollice, Alessio & Rosati, Ilaria, 2013. "A hierarchical Bayesian model for the ecological status classification of lagoons," Ecological Modelling, Elsevier, vol. 263(C), pages 187-195.
    7. Cai, Jing-Heng & Song, Xin-Yuan & Lam, Kwok-Hap & Ip, Edward Hak-Sing, 2011. "A mixture of generalized latent variable models for mixed mode and heterogeneous data," Computational Statistics & Data Analysis, Elsevier, vol. 55(11), pages 2889-2907, November.
    8. Ungolo, Francesco & Kleinow, Torsten & Macdonald, Angus S., 2020. "A hierarchical model for the joint mortality analysis of pension scheme data with missing covariates," Insurance: Mathematics and Economics, Elsevier, vol. 91(C), pages 68-84.
    9. Papastamoulis, Panagiotis, 2018. "Overfitting Bayesian mixtures of factor analyzers with an unknown number of components," Computational Statistics & Data Analysis, Elsevier, vol. 124(C), pages 220-234.
    10. Kozumi, Hideo, 2004. "Posterior analysis of latent competing risk models by parallel tempering," Computational Statistics & Data Analysis, Elsevier, vol. 46(3), pages 441-458, June.
    11. Xiong, Yingge & Tobias, Justin L. & Mannering, Fred L., 2014. "The analysis of vehicle crash injury-severity data: A Markov switching approach with road-segment heterogeneity," Transportation Research Part B: Methodological, Elsevier, vol. 67(C), pages 109-128.
    12. Royce Anders & William Batchelder, 2015. "Cultural Consensus Theory for the Ordinal Data Case," Psychometrika, Springer;The Psychometric Society, vol. 80(1), pages 151-181, March.
    13. Lu, Xiaosun & Huang, Yangxin & Zhu, Yiliang, 2016. "Finite mixture of nonlinear mixed-effects joint models in the presence of missing and mismeasured covariate, with application to AIDS studies," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 119-130.
    14. Lubrano, Michel & Ndoye, Abdoul Aziz Junior, 2016. "Income inequality decomposition using a finite mixture of log-normal distributions: A Bayesian approach," Computational Statistics & Data Analysis, Elsevier, vol. 100(C), pages 830-846.
    15. Kazuhiko Kakamu, 2022. "Bayesian analysis of mixtures of lognormal distribution with an unknown number of components from grouped data," Papers 2210.05115, arXiv.org, revised Sep 2023.
    16. Terrance Savitsky & Daniel McCaffrey, 2014. "Bayesian Hierarchical Multivariate Formulation with Factor Analysis for Nested Ordinal Data," Psychometrika, Springer;The Psychometric Society, vol. 79(2), pages 275-302, April.
    17. Carnicero, José Antonio & Wiper, Michael Peter, 2008. "A semi-parametric model for circular data based on mixtures of beta distributions," DES - Working Papers. Statistics and Econometrics. WS ws081305, Universidad Carlos III de Madrid. Departamento de Estadística.
    18. S. Upadhyay & M. Peshwani, 2008. "Posterior analysis of lognormal regression models using the Gibbs sampler," Statistical Papers, Springer, vol. 49(1), pages 59-85, March.
    19. Xiong, Yingge & Mannering, Fred L., 2013. "The heterogeneous effects of guardian supervision on adolescent driver-injury severities: A finite-mixture random-parameters approach," Transportation Research Part B: Methodological, Elsevier, vol. 49(C), pages 39-54.
    20. Shiferaw Gurmu & Fidel Pérez-Sebastián, 2008. "Patents, R&D and lag effects: evidence from flexible methods for count panel data on manufacturing firms," Empirical Economics, Springer, vol. 35(3), pages 507-526, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transb:v:44:y:2010:i:5:p:662-673. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/548/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.