cs285_homework_fall2020/hw2 at master · vincentkslim/cs285_homework_fall2020

History

Name		Name	Last commit message	Last commit date
parent directory ..
cs285		cs285
run_logs		run_logs
soln_pdf		soln_pdf
Pipfile		Pipfile
README.md		README.md
cs285_hw2.pdf		cs285_hw2.pdf
dataviz.ipynb		dataviz.ipynb
requirements.txt		requirements.txt
requirements_colab.txt		requirements_colab.txt
setup.py		setup.py

README.md

This homework (code + data visualization) is complete.

Note that there may be minor bugs in the code.

Setup

You can run this code on your own machine or on Google Colab.

Local option: If you choose to run locally, you will need to install MuJoCo and some Python packages; see installation.md from homework 1 for instructions. If you completed this installation for homework 1, you do not need to repeat it.
Colab: The first few sections of the notebook will install all required dependencies. You can try out the Colab option by clicking the badge below:

Complete the code

The following files have blanks to be filled with your solutions from homework 1. The relevant sections are marked with "TODO: get this from hw1".

You will then need to complete the following new files for homework 2. The relevant sections are marked with "TODO".

You will also want to look through scripts/run_hw2.py (if running locally) or scripts/run_hw2.ipynb (if running on Colab), though you will not need to edit this files beyond changing runtime arguments in the Colab notebook.

You will be running your policy gradients implementation in four experiments total, investigating the effects of design decisions like reward-to-go estimators, neural network baselines for variance reduction, and advantage normalization. See the assignment PDF for more details.

Plotting your results

We have provided a snippet that may be used for reading your Tensorboard eventfiles in scripts/read_results.py. Reading these eventfiles and plotting them with matplotlib or seaborn will produce the cleanest results for your submission. For debugging purposes, we recommend visualizing the Tensorboard logs using tensorboard --logdir data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hw2

hw2

README.md

This homework (code + data visualization) is complete.

Setup

Complete the code

Plotting your results

Files

hw2

Directory actions

More options

Directory actions

More options

Latest commit

History

hw2

Folders and files

parent directory

README.md

This homework (code + data visualization) is complete.

Setup

Complete the code

Plotting your results