Hands-on project after finishing Deepmind's RL course. First I learned and used JAX for the implementations, but I found that it took too much time to debug anything because of the lack of documentation (mostly for haiku) and forums. I decided to abandon the project and re-implement & re-design it in PyTorch instead.
🚧 The project is still in progress.