We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 037d99c commit 15aedd0Copy full SHA for 15aedd0
README.md
@@ -34,8 +34,10 @@ some example:
34
35
## Performance of Priority DQN
36
Training for 45k episodes and max eval mean score is 7700(eval for 50 episodes).
37
-
+
38
+
39
40
41
## Update
42
1. add max steps and max illegal steps of one episode;
43
+2. add dqn agent and training infomation;
0 commit comments