Action Normalization #57

cheng-chi · 2023-05-02T05:23:43Z

As per discussion with @snasiriany, this is my current implementation of action normalization which is required for diffusion policy integration. These code are not fully tested and is meant to be a starting point for discussions.

DO NOT MERGE

snasiriany

Looks mostly good! Left some comments, mainly about naming and doc strings.

Also, should we infer hdf5_normlize_action from the dataset, rather than manually specifying it in the config?

snasiriany · 2023-05-03T18:11:20Z

robomimic/algo/algo.py

@@ -421,7 +421,7 @@ class RolloutPolicy(object):
    """
    Wraps @Algo object to make it easy to run policies in a rollout loop.
    """
-    def __init__(self, policy, obs_normalization_stats=None):
+    def __init__(self, policy, obs_normalization_stats=None, action_normalization_stats=None):


can you add some comments in the function docstring for action_normalization_stats? Similar to how it's already done for obs_normalization_stats

snasiriany · 2023-05-03T18:13:15Z

robomimic/algo/algo.py

@@ -474,4 +475,7 @@ def __call__(self, ob, goal=None):
        if goal is not None:
            goal = self._prepare_observation(goal)
        ac = self.policy.get_action(obs_dict=ob, goal_dict=goal)
-        return TensorUtils.to_numpy(ac[0])
+        ac = TensorUtils.to_numpy(ac)


any reason for changing ac[0] to ac? Can we keep things as ac[0]?

snasiriany · 2023-05-03T18:14:01Z

robomimic/config/base_config.py

@@ -156,6 +156,8 @@ class has a default implementation that usually doesn't need to be overriden.
        # of each observation in each dimension, computed across the training set. See SequenceDataset.normalize_obs
        # in utils/dataset.py for more information.
        self.train.hdf5_normalize_obs = False
+
+        self.train.hdf5_normalize_action = False


can you add a comment to describe the use case (similar to rest of file)

snasiriany · 2023-05-03T18:14:54Z

robomimic/utils/dataset.py

@@ -30,6 +30,7 @@ def __init__(
        hdf5_cache_mode=None,
        hdf5_use_swmr=True,
        hdf5_normalize_obs=False,
+        hdf5_normalize_action=False,


function docstring needs comment for this attribute

snasiriany · 2023-05-03T18:16:16Z

robomimic/utils/obs_utils.py

@@ -499,6 +499,17 @@ def normalize_obs(obs_dict, obs_normalization_stats):

    return obs_dict

+def normalize_actions(actions, action_normalization_stats):


small nitpick: our convention here is to use " rather than '. can you make the style change?

also, both normalize_actions and unnormalize_actions need docstring

snasiriany · 2023-05-04T19:38:17Z

robomimic/utils/dataset.py

@@ -366,6 +372,99 @@ def get_obs_normalization_stats(self):
        assert self.hdf5_normalize_obs, "not using observation normalization!"
        return deepcopy(self.obs_normalization_stats)

+    def normalize_actions(self):


Naming of this function may be confused for normalize_actions in ObsUtils. How about renaming this to get_action_normalization_stats?

snasiriany · 2023-05-04T19:40:03Z

robomimic/utils/dataset.py

+            return obs_traj
+
+        ep = self.dataset.demos[0]
+        obs_traj = get_obs_traj(ep)


naming of obs here might be confused for observations that we pass into the policy (eg. images). Can we replace this term to be more general? And all other places where we name things with obs in this function

cheng-chi added 2 commits May 2, 2023 00:19

added action normalization stats to SequenceDataset

abbee68

initial implementation of action normalization

8662524

snasiriany requested changes May 4, 2023

View reviewed changes

cheng-chi added 5 commits May 8, 2023 20:28

added script to set attr

ae4bd7d

addressed most comments

ed5e8c6

debug checkout

9a92b36

fixed ac

a2c2e47

modified env_args also

c169db8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Action Normalization #57

Action Normalization #57

cheng-chi commented May 2, 2023 •

edited

Loading

snasiriany left a comment

snasiriany May 3, 2023

snasiriany May 3, 2023

snasiriany May 3, 2023

snasiriany May 3, 2023

snasiriany May 3, 2023

snasiriany May 3, 2023

snasiriany May 4, 2023

snasiriany May 4, 2023

		@@ -499,6 +499,17 @@ def normalize_obs(obs_dict, obs_normalization_stats):

		return obs_dict

		def normalize_actions(actions, action_normalization_stats):

Action Normalization #57

Are you sure you want to change the base?

Action Normalization #57

Conversation

cheng-chi commented May 2, 2023 • edited Loading

snasiriany left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cheng-chi commented May 2, 2023 •

edited

Loading