Skip to content

Forecast hourly interstate 94 traffic volume with different Deep Learning approaches

Notifications You must be signed in to change notification settings

lprtk/forecast-hourly-traffic-volume

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Time series and Deep Learning for traffic volume prediction

GitHub issues GitHub forks Github Stars Code style: black

MAP_i94

Table of contents

Content

This project focuses on modeling and forecasting hourly urban traffic volumes between two major U.S. cities in the state of Minnesota, Minneapolis and St. Paul, and evaluating the effects of various contextual characteristics such as days of the week, holidays, and weather conditions on traffic volumes.

The dataset used catalogs hourly traffic volume on westbound Interstate 94 for the Minnesota State Department of Transportation ATR 301 station, roughly halfway between Minneapolis and St Paul. Initially, we have 48,000 observations ranging from October 2, 2012 at 9:00 AM to September 30, 2018 at 11:00 PM. To complete our traffic volume regression task, the company OpenWeatherMap added contextual and meteorological data: national and regional holidays, amount of rain and snow (in mm) that fell in 1 hour, temperature, cloud cover rate.

The objective is to try a large spectrum of approaches from the simplest to the most complex and by confronting the traditional Econometric approach and the new techniques of Machine and Deep Learning: linear regression, LightGBM, GRU, LSTM, Bi-LSTM.
In a second step, we propose an opening towards Deep Hybrid Learning approaches with a CNN-LSTM model. Then, inspired by the ES-RNN, or MES-LSTM, observed during the Makridakis-4 (M4) forecasting contest, we wanted to try to implement a hybrid model merging a linear regression and the very powerful LightGBM. We operated very simply, the objective is to assign a weight α to the regression forecast and 1-α to the LightGBM forecast. This α coefficient is determined by modeling an exponential curve, with a certain coefficient, in order to give more strength to the short-term forecasts of the regression and then to reverse the trend afterwards so as to provide a greater weight to the long-term forecasts of the LightGBM model. The performances of these two merged models are not phenomenal but we wanted to explore this track in order to go further and consider more interesting combinations such as ARIMA-LSTM models.

Requirements

  • Python version 3.9.7

File details

  • forecast_hourly_traffic_volume
    • This is a .ipynb file which contains the code.
  • data
    • This folder contains the data.

Here is the project pattern:

- project 
    > forecast-hourly-traffic-volume
        > image 
            - map.JPG
        - forecast_hourly_traffic_volume.ipynb 
        > data 
            - interstate_traffic.csv

Features

My profilMy GitHubOriginal UCIML dataset