Time-series Forecasting

Dataset structure

We are given a multivariate dataset that consists of 7 ordered features. We have 68528 data points for each feature. Here is an overview of the 7 ordered features of the data frame:

We have tried 2 scenarios:

a) Divide the dataset into train and test.

b) Feed all examples into the training process and do validation. When we tried to implement the second scenario into a simple model, we got much better results. So for the next experiment, we constantly used all datasets for the training process in the model selection.

We first had to be careful that the dataset was clean and preprocessed. That means no wrong or NaN values or significant time gaps in the data. ANNs look at the numerical value of the input so after we normalized all training inputs to be within the same range.

Model Selection

Being a multivariate time series forecasting problem, we have to provide a prediction for each time step in the test prediction window. The networks we have implemented mostly used the Mean Average Error (MAE) as a metrics. The metric used to evaluate models and place the Teams in Leaderboard on Codalab is the Root Mean Squared Error (RMSE).

Unchanged value of Window, Stride, and Telescope.

In this section, we have implemented a couple of different model scenarios. We always keep the window equal to 200, stride equal 20, and telescope with 50.

Result in CodaLab

Model (with window=200, stride=20 and telescope=50)	Result (RMSE)
Bidirectional LSTM with two-layer LSTM different units	176.7613220215
One-Dimension CNN	27.7518043518
Simple GRU	17.8791122437
Simple LSTM	16.9558753967
Bidirectional LSTM without cascade	16.2943630219
Bidirectional LSTM with single-layer LSTM and single-layer GRU	13.6941833496
Bidirectional LSTM with two-layer LSTM	11.5225048065

Changed value of Window, Stride, and Telescope

Bidirectional LSTM with two-layer LSTM	Result (RMSE)
1. Model with window=200, stride=10 and telescope=100	10.0559186935
2. Model with window=200, stride=10 and telescope=100	9.1587381363
3. Model with window=600, stride=2 and telescope=864	4.7141537666
4. Model with window=600, stride=1 and telescope=864	4.0347905159

Early Stopping and Reduce Learning Rate

We implement early stopping to tackle the overfitting problem, even though we have implemented Dropout in our model.

EarlyStopping(monitor='val_loss', 
mode='min',
patience=10,
restore_best_weights=True)

The goal is to stop the training when the validation error starts increasing. When our training is going to plateau, we reduce the learning rate,

ReduceLROnPlateau(monitor='val_loss',
mode='min', 
patience=5, 
factor=0.5,
min_lr=1e-5).

Conclusion

After first inspecting the structure of our time series dataset and deciding to use all data for the training process and validation, we tested different model architectures. In the first step of model selection, various models were tested with fixed and unchanged window, stride and telescope values. Based on the result we had on Codalab, we kept the model architecture with the lowest RMSE value, the Bidirectional LSTM with 2 layers with an RMSE value of 11.5225048065.

In the second step of model selection, we modified the hyperparameter values of the model architecture selected in the first step. The best result we managed to get is 4.0347905159 with a 2-layer bidirectional LSTM model with window=600, stride 1, and telescope=864. That’s why we chose it as our best model.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.DS_Store		.DS_Store
README.md		README.md
TensorboardChal.ipynb		TensorboardChal.ipynb
challenge2-GAP.ipynb		challenge2-GAP.ipynb
challenge2-biLSTM-GMP.ipynb		challenge2-biLSTM-GMP.ipynb
challenge2-biLSTM.ipynb		challenge2-biLSTM.ipynb
challenge2-biLSTMGAP.ipynb		challenge2-biLSTMGAP.ipynb
challenge2.ipynb		challenge2.ipynb
model.png		model.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Time-series Forecasting

Dataset structure

Model Selection

Unchanged value of Window, Stride, and Telescope.

Result in CodaLab

Changed value of Window, Stride, and Telescope

Early Stopping and Reduce Learning Rate

Conclusion

About

Releases

Packages

Languages

irfanmasoudi/anndl2

Folders and files

Latest commit

History

Repository files navigation

Time-series Forecasting

Dataset structure

Model Selection

Unchanged value of Window, Stride, and Telescope.

Result in CodaLab

Changed value of Window, Stride, and Telescope

Early Stopping and Reduce Learning Rate

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages