MJX Training Implementation #1

michael-lutz · 2024-05-23T23:59:04Z

The following is copied from kscalelabs/sim#14

This PR introduces a new way to massively scale up locomotion training. Building upon Brax, it ultimately uses the MJX physics engine for simulation.

Structure

Specifically, this PR includes the following directories:

Envs
Experiments
Utils
(example) Weights
train.py
play.py

Envs includes two types of Brax environments: DefaultHumanoidEnv and StompyEnv. Each environment includes a main class which implements the Brax environment interface and utilizes MJX for all physics calculations. One important thing to note is that reward functions are modular, allowing for quick experimentation.

Experiments includes two .yaml files that include sample configurations for model training.

Utils include default values, rendering rollouts, etc.

Weights currently include default humanoid weights (for locomotion) that should work out of the box

train.py and play.py both integrate with wandb. train.py utilizes the Brax implementation of PPO for now, but can be easily customized if needed.

Performance Samples

Training Curves

Example humanoid robot walking in MJX
https://github.com/kscalelabs/sim/assets/43460304/8e12b0e6-48ea-4af0-8283-1dc4880767b4

Humanoid trained in MJX, eval in CPU-based MuJoCo
https://github.com/kscalelabs/sim/assets/43460304/7f158aeb-6bc9-4056-bd1d-12882adbd13c

…x and mjx

budzianowski

A couple of nits, lgtm otherwise!

budzianowski · 2024-05-24T00:27:37Z

ksim/mjx_gym/envs/__init__.py

@@ -0,0 +1,10 @@
+from brax import envs


If you can omit putting stuff here, that would be preferable.

budzianowski · 2024-05-24T00:28:02Z

ksim/mjx_gym/envs/default_humanoid_env/default_humanoid.py

@@ -0,0 +1,152 @@
+import jax


Adding one line explanation would be useful.

budzianowski · 2024-05-24T00:28:08Z

ksim/mjx_gym/envs/default_humanoid_env/default_humanoid.py

@@ -0,0 +1,152 @@
+import jax
+import jax.numpy as jp


budzianowski · 2024-05-24T00:30:20Z

ksim/mjx_gym/envs/default_humanoid_env/default_humanoid.py

+            Observations of the environment.
+        """
+        position = data.qpos
+        if self._exclude_current_positions_from_observation:


Why this is needed?

codekansas

lgtm

michael-lutz added 20 commits May 23, 2024 21:11

feat: created new default humanoid environment class implementing bra…

3592d06

…x and mjx

feat: ran initial training with new MJX environment

033d01f

feat: added nicer training script

31c1b35

feat: added training config for stompy

3fa16ca

feat: storing weight checkpoints and added play

4df08c3

feat: added stomppy environment

d3a642c

feat: added model checkpointing

778476b

feat: printing when rendering

c443164

feat: removed unnecessary model checkpointing

e018d18

chore: cleaned up play script and added CPU-only rendering

9c18a08

chore: removed brax replication

ede4b2e

fix: fixing stompy environment import issues and created simplified mesh

23836dc

chore: sorting libraries, typing, etc

8eaf94a

chore: cleaned up training scripts and file organization

02c1097

chore: removed unused imports

f1b3424

fix: removing duplicate exclude definition in pyproject

58e84cc

fix: moved mjx_gym to ksim folder

a34e181

chore: removing duplicate folders

398ae86

feat: updated the readme with relevant getting-started information

8fe5c18

chore: fixed spacing and updated setup.py

8fc8fef

michael-lutz requested review from codekansas and budzianowski May 23, 2024 23:59

michael-lutz self-assigned this May 24, 2024

michael-lutz added the enhancement New feature or request label May 24, 2024

michael-lutz added 2 commits May 24, 2024 00:15

chore: updating pyproject

498953f

thing

b0ec4e2

budzianowski approved these changes May 24, 2024

View reviewed changes

codekansas and others added 3 commits May 24, 2024 00:40

fix some types

6d0bc2b

fix typing

c88da97

chore: fixing import ordering

2701d57

codekansas approved these changes May 24, 2024

View reviewed changes

michael-lutz enabled auto-merge (squash) May 24, 2024 02:25

michael-lutz disabled auto-merge May 24, 2024 02:26

michael-lutz merged commit 84606b8 into master May 24, 2024
1 check passed

michael-lutz deleted the transfer-branch branch May 24, 2024 02:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MJX Training Implementation #1

MJX Training Implementation #1

michael-lutz commented May 23, 2024

budzianowski left a comment

budzianowski May 24, 2024

budzianowski May 24, 2024

budzianowski May 24, 2024

budzianowski May 24, 2024

codekansas left a comment

MJX Training Implementation #1

MJX Training Implementation #1

Conversation

michael-lutz commented May 23, 2024

Structure

Performance Samples

budzianowski left a comment

Choose a reason for hiding this comment

budzianowski May 24, 2024

Choose a reason for hiding this comment

budzianowski May 24, 2024

Choose a reason for hiding this comment

budzianowski May 24, 2024

Choose a reason for hiding this comment

budzianowski May 24, 2024

Choose a reason for hiding this comment

codekansas left a comment

Choose a reason for hiding this comment