IDEAS home Printed from https://ideas.repec.org/h/elg/eechap/21868_4.html
   My bibliography  Save this book chapter

The ensemble learning process for short-term prediction of traffic state on rural roads

In: Handbook on Artificial Intelligence and Transport

Author

Listed:
  • Arash Rasaizadi
  • Fateme Hafizi
  • Seyedehsan Seyedabrishami

Abstract

Rural road traffic data including, speed, volume, density, travel time, and traffic state are considered “Big Data” due to high volume, speed of data generation, and varieties including videos, texts, and quantitative data. The analysis and prediction of this data for the short-term future provide real-time information for road operators and passengers to better plan their trips. Among traffic parameters, speed, volume, density, and travel time are quantitative, and traffic state is qualitative—usually classified into light, semi-heavy, and heavy states. In this chapter, traffic data from rural roads covering a period of five years is applied to the calibration of statistical time series and machine learning algorithms to explore influential factors on a real-time traffic state. First, the traffic database is shown with calendar data including season, month, week, day, hour, holiday, sequence of holidays, and weather data obtained from meteorological stations. Among the aforementioned variables, several variables are defined in cyclical form, and others are converted to dummy variables. In the second step, time series regression, long short-term memory (LSTM), random forest (RF), support vector machine (SVM), and K-nearest neighbours (KNN) are trained by using the first three years of data as a training dataset, and their performance evaluated using the remaining two years as a test dataset. In terms of accuracy, the RF model showed superiority over other models (RF accuracy, 76.9%). In terms of balanced accuracy, SVM was more accurate than RF to predict light traffic state (SVM accuracy, 78.7%). The maximum balanced accuracy for predicting semi-heavy and heavy states was achieved by RF (69.9% and 53.8%, respectively). There was no single model with the highest accuracy for each month of a year. These two reasons were motivations to use the ordered logit (OL) model in the ensemble learning process. The input of this process is the output of the base models. Combining the outputs of single models provides a single output that is expected to be more accurate than the base models. After calibrating the OL model using the predictions from the base models for the first year of the test data set, all models were evaluated on data from the second year. The highest accuracy was obtained using the OL model in the ensemble learning process (an accuracy of 82.2%).

Suggested Citation

  • Arash Rasaizadi & Fateme Hafizi & Seyedehsan Seyedabrishami, 2023. "The ensemble learning process for short-term prediction of traffic state on rural roads," Chapters, in: Hussein Dia (ed.), Handbook on Artificial Intelligence and Transport, chapter 4, pages 102-123, Edward Elgar Publishing.
  • Handle: RePEc:elg:eechap:21868_4
    as

    Download full text from publisher

    File URL: https://www.elgaronline.com/doi/10.4337/9781803929545.00010
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:elg:eechap:21868_4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Darrel McCalla (email available below). General contact details of provider: http://www.e-elgar.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.