Base classes for Off Policy Agents #169

Sharad24 · 2020-06-12T18:59:58Z

This should make the code much more comprehensible, especially with the number of arguments we have. And at the same time resolve a lot of maintainability issues.

sampreet-arthi · 2020-06-15T17:18:02Z

There's also a lot of code duplication issues that show up pylint. Maybe work on that too.

Sharad24 · 2020-06-16T20:39:15Z

Yup, that's the goal. Another thing to do would be properly decide the parameters to be kept/removed in agents and added/removed from the trainers.

sampreet-arthi · 2020-07-17T13:23:31Z

Since we already have the On Policy Agents, I'm renaming this to Off Policy. I'll raise a PR for this soon.

sampreet-arthi · 2020-07-18T20:29:27Z

I'm thinking of refactoring each of the individual off policy algorithms first so that the code is neater, more uniform and shorter.

sampreet-arthi · 2020-07-31T11:17:13Z

To-do:

Refactor DDPG
Refactor TD3
Refactor SAC
Finalise BaseAgent and Base OffPolicyAgent classes
Refactor Trainer and OffPolicyTrainer (Trainer classes are also really long, would be a good idea to first shorten them then maybe we can separate them into multiple files if they're still too big)
Add CUDA support for all of them
Add support for Prioritized Experience Replay for all Off Policy algos

Sharad24 · 2020-08-26T10:14:04Z

Tracking in separate issues now. #263, #162 and #264

sampreet-arthi changed the title ~~Base classes for agents~~ Base classes for Off Policy Agents Jul 17, 2020

sampreet-arthi self-assigned this Jul 18, 2020

sampreet-arthi mentioned this issue Jul 18, 2020

Splitting up DQN #198

Merged

sampreet-arthi mentioned this issue Jul 28, 2020

Use PushReplayBuffer in Off Policy Algos #165

Closed

Sharad24 mentioned this issue Aug 26, 2020

Prioritized Replay Buffer Support for Off Policy Agents #264

Open

Sharad24 closed this as completed Aug 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Base classes for Off Policy Agents #169

Base classes for Off Policy Agents #169

Sharad24 commented Jun 12, 2020 •

edited

Loading

sampreet-arthi commented Jun 15, 2020

Sharad24 commented Jun 16, 2020

sampreet-arthi commented Jul 17, 2020

sampreet-arthi commented Jul 18, 2020

sampreet-arthi commented Jul 31, 2020 •

edited by mehulrastogi

Loading

Sharad24 commented Aug 26, 2020

Base classes for Off Policy Agents #169

Base classes for Off Policy Agents #169

Comments

Sharad24 commented Jun 12, 2020 • edited Loading

sampreet-arthi commented Jun 15, 2020

Sharad24 commented Jun 16, 2020

sampreet-arthi commented Jul 17, 2020

sampreet-arthi commented Jul 18, 2020

sampreet-arthi commented Jul 31, 2020 • edited by mehulrastogi Loading

Sharad24 commented Aug 26, 2020

Sharad24 commented Jun 12, 2020 •

edited

Loading

sampreet-arthi commented Jul 31, 2020 •

edited by mehulrastogi

Loading