Goal-Conditional Policy Gradients Quickly learn policies for continuous control in sparse reward environments Blog post with discussion here