Dynamic Programming:
- Value Iteration
- Policy Iteration
Temporal-Difference Learning:
- Q-Learning
- SARSA
Policy Gradient Methods:
- REINFORCE
Reference: Reinforcement Learning: An Introduction by Sutton and Barto.
- Histograms
- Kernel Density Estimators
- Nearest Neighbours
Reference: Pattern Recognition And Machine Learning by Bishop.
- K-means Clustering
- Mixtures of Gaussians
- Alternative views of EM
Reference: Pattern Recognition And Machine Learning by Bishop.
- Gaussian Processes
- Support Vector Machines
- Bayesian Optimization
- LQR
- Mixed Integer Programming
- OptNet