Machine Learning for System and Control

A final project of 5SC28-Machine Learning for System and Control 2021/2022 course at TU Eindhoven. The project is all about the unbalanced disk modelling and controlling it so that it can swing up and also having a $\pm10^{\circ}$ after reaching the $180^{\circ}$ swing-up for the multi-target policy.

Data-driven modelling (System Identification)

The modelling here are using the NARX model structure, where it is implemented in both the Gaussian Process and Artificial Neural Network using scikit-learn and PyTorch respectively. For the Gaussian Process, we managed to use the exact method of inference, thus it may take several hours to train the Gaussian Process. For the grid search Gaussian Process implementation, it provides a good approximations as well.

Data-driven control (Reinforcement Learning)

The overall objective is to control an unbalanced disk to swing-up or making a swing-up policy (see Gym Unbalacned Disk library by Gerben Beintema). There are several methods that we have done, which are:

DQN (Deep Q-Network) with stable-baselines3
A2C (Advantage Actor Critic) with PyTorch. Use the a2c_image.ipynb and a2c_eval.py for generating figures and evaluate the model respectively.
SAC (Soft Actor-Critic) with stable-baselines3
Classical Q-Learning (Tabular Q-learning)
Multi_SAC for multi-target policy $\pm10^{\circ}$ using the SAC method with stable-baselines3

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
data_sets		data_sets
models		models
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
ANN&GP.ipynb		ANN&GP.ipynb
DQN.ipynb		DQN.ipynb
Final_GP.ipynb		Final_GP.ipynb
Multi_SAC.ipynb		Multi_SAC.ipynb
README.md		README.md
SAC.ipynb		SAC.ipynb
UnbalancedDisc1.jpeg		UnbalancedDisc1.jpeg
a2c_eval.py		a2c_eval.py
a2c_image.ipynb		a2c_image.ipynb
actor_critic.py		actor_critic.py
clas-Q-learning.ipynb		clas-Q-learning.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning for System and Control

Data-driven modelling (System Identification)

Data-driven control (Reinforcement Learning)

About

Releases

Packages

Languages

grafaelw/5SC28-ML4SC

Folders and files

Latest commit

History

Repository files navigation

Machine Learning for System and Control

Data-driven modelling (System Identification)

Data-driven control (Reinforcement Learning)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages