Added new curriculum mdp that allows modification on any environment parameters #2777

ooctipus · 2025-06-26T00:27:30Z

Description

This PR created two curriculum mdp that can change any parameter in env instance.
namely modify_term_cfg and modify_env_param.

modify_env_param is a more general version that can override any value belongs to env, but requires user to know the full path to the value.

modify_term_cfg only work with manager_term, but is a more user friendly version that simplify path specification, for example, instead of write "observation_manager.cfg.policy.joint_pos.noise", you instead write "observations.policy.joint_pos.noise", consistent with hydra overriding style

Besides path to value is needed, modify_fn, modify_params is also needed for telling the term how to modify.

Demo 1: difficulty-adaptive modification for all python native data type

# iv -> initial value, fv -> final value
def initial_final_interpolate_fn(env: ManagerBasedRLEnv, env_id, data, iv, fv, get_fraction):
    iv_, fv_ = torch.tensor(iv, device=env.device), torch.tensor(fv, device=env.device)
    fraction = eval(get_fraction)
    new_val = fraction * (fv_ - iv_) + iv_
    if isinstance(data, float):
        return new_val.item()
    elif isinstance(data, int):
        return int(new_val.item())
    elif isinstance(data, (tuple, list)):
        raw = new_val.tolist()
        # assume iv is sequence of all ints or all floats:
        is_int = isinstance(iv[0], int)
        casted = [int(x) if is_int else float(x) for x in raw]
        return tuple(casted) if isinstance(data, tuple) else casted
    else:
        raise TypeError(f"Does not support the type {type(data)}")

(float)

    joint_pos_unoise_min_adr = CurrTerm(
        func=mdp.modify_term_cfg,
        params={
            "address": "observations.policy.joint_pos.noise.n_min",
            "modify_fn": initial_final_interpolate_fn,
            "modify_params": {"iv": 0., "fv": -.1, "get_fraction": "env.command_manager.get_command("difficulty")"}
        }
    )

(tuple or list)

command_object_pose_xrange_adr = CurrTerm(
        func=mdp.modify_term_cfg,
        params={
            "address": "commands.object_pose.ranges.pos_x",
            "modify_fn": initial_final_interpolate_fn,
            "modify_params": {"iv": (-.5, -.5), "fv": (-.75, -.25), "get_fraction": "env.command_manager.get_command("difficulty")"}
        }
    )

Demo 3: overriding entire term on env_step counter rather than adaptive

def value_override(env: ManagerBasedRLEnv, env_id, data, new_val, num_steps):
    if env.common_step_counter > num_steps:
        return new_val
    return mdp.modify_term_cfg.NO_CHANGE

object_pos_curriculum = CurrTerm(
        func=mdp.modify_term_cfg,
        params={
            "address": "commands.object_pose",
            "modify_fn": value_override,
            "modify_params": {"new_val": <new_observation_term>, "num_step": 120000 }
        }
    )

Demo 4: overriding Tensor field within some arbitary class not visible from term_cfg
(you can see that 'address' is not as nice as mdp.modify_term_cfg)

def resample_bucket_range(env: ManagerBasedRLEnv, env_id, data, static_friction_range, dynamic_friction_range, restitution_range, num_steps):
    if env.common_step_counter > num_steps:
          range_list = [static_friction_range, dynamic_friction_range, restitution_range]
          ranges = torch.tensor(range_list, device="cpu")
          new_buckets = math_utils.sample_uniform(ranges[:, 0], ranges[:, 1], (len(data), 3), device="cpu")
          return new_buckets
    return mdp.modify_env_param.NO_CHANGE

object_physics_material_curriculum = CurrTerm(
        func=mdp.modify_env_param,
        params={
            "address": "event_manager.cfg.object_physics_material.func.material_buckets",
            "modify_fn": resample_bucket_range,
            "modify_params": {"static_friction_range": [.5, 1.], "dynamic_friction_range": [.3, 1.], "restitution_range": [0.0, 0.5], "num_step": 120000 }
        }
    )

Type of change

New feature (non-breaking change which adds functionality)

Checklist

I have run the pre-commit checks with ./isaaclab.sh --format
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file
I have added my name to the CONTRIBUTORS.md or my name already exists there

ooctipus · 2025-06-26T00:31:08Z

@jtigue-bdai Feel free to view and provide some feedback

jtigue-bdai

Thanks for this @ooctipus, we don't currently have tests for mdp terms but do you think you could put together a unit test for this? Because it has the potential for touching so many things I think it would be good to get some unit tests for it.

jtigue-bdai · 2025-06-26T12:57:53Z

source/isaaclab/isaaclab/envs/mdp/curriculums.py

+    Reads `cfg.params["address"]`, replaces only the first occurrence of "s."
+    with "_manager.cfg.", and then behaves identically to ModifyEnvParam.
+
+    for example: command_manager.cfg.object_pose.ranges.xpos -> commands.object_pose.ranges.xpos


In the example here can you show an example use of this so the syntax is clear?

source/isaaclab/isaaclab/envs/mdp/curriculums.py

jtigue-bdai · 2025-06-26T15:19:59Z

source/isaaclab/isaaclab/envs/mdp/curriculums.py

+    This term compiles getter/setter accessors for a target attribute (specified by
+    `cfg.params["address"]`) the first time it is called, then on each invocation
+    reads the current value, applies a user-provided `modify_fn`, and writes back
+    the result.


Can you add an example code snippet on how you would use this?

source/isaaclab/docs/CHANGELOG.rst

jtigue-bdai · 2025-06-26T15:24:37Z

source/isaaclab/isaaclab/envs/mdp/curriculums.py

+        if isinstance(self.container, tuple):
+            getter = lambda: self.container[self.last]
+
+            def setter(val):
+                tuple_list = list(self.container)
+                tuple_list[self.last] = val
+                self.container = tuple(tuple_list)
+
+        elif isinstance(self.container, dict):
+            getter = lambda: self.container[self.last]
+
+            def setter(val):
+                self.container[self.last] = val
+
+        elif isinstance(self.container, object):
+            getter = lambda: getattr(self.container, self.last)
+
+            def setter(val):
+                setattr(self.container, self.last, val)


do we need to add a condition for single values (i.e. int, float, bool, etc) or does the object condition handle this?

we don't need for single values, because the object condition handle this, the type check is checking the container not the last.

for example, observations.policy.joint_pos.unoise.n_min has value -0.1,
then the self.container becomes unoise, a object, self.last becomes n_min.

there are three kinds of container as far as I can think of, tuple, dict, or object. So the condition should be complete

source/isaaclab/test/envs/test_modify_env_param_curr_term.py

…hing, and wrote test for this modify_env_param and modify_term_cfg

ooctipus requested review from jsmith-bdai, kellyguo11 and Mayankm96 as code owners June 26, 2025 00:27

jtigue-bdai reviewed Jun 26, 2025

View reviewed changes

add modifycation to any env parameter curriculum mdp

986d36e

jtigue-bdai mentioned this pull request Jun 27, 2025

Adds modify_environment_parameter to curriculums #2696

Closed

6 tasks

ooctipus requested a review from pascal-roth as a code owner June 27, 2025 19:34

ooctipus force-pushed the feat/modify_env_param_curriculum branch from e28803f to 8957e93 Compare June 27, 2025 19:35

jtigue-bdai reviewed Jun 27, 2025

View reviewed changes

source/isaaclab/test/envs/test_modify_env_param_curr_term.py Outdated Show resolved Hide resolved

ooctipus added 2 commits June 27, 2025 14:26

introduce NO_CHANGE token to prevent modify_env_param from doing anyt…

5066bff

…hing, and wrote test for this modify_env_param and modify_term_cfg

pass pre-commit

60a0b87

ooctipus force-pushed the feat/modify_env_param_curriculum branch from 8957e93 to 60a0b87 Compare June 27, 2025 21:27

added support for list case with simpler setter compared to tuple

63e6f1d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added new curriculum mdp that allows modification on any environment parameters #2777

Added new curriculum mdp that allows modification on any environment parameters #2777

ooctipus commented Jun 26, 2025 •

edited

Loading

Uh oh!

ooctipus commented Jun 26, 2025

Uh oh!

jtigue-bdai left a comment

Uh oh!

jtigue-bdai Jun 26, 2025

Uh oh!

Uh oh!

jtigue-bdai Jun 26, 2025

Uh oh!

Uh oh!

jtigue-bdai Jun 26, 2025

Uh oh!

ooctipus Jun 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Added new curriculum mdp that allows modification on any environment parameters #2777

Are you sure you want to change the base?

Added new curriculum mdp that allows modification on any environment parameters #2777

Conversation

ooctipus commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Checklist

Uh oh!

ooctipus commented Jun 26, 2025

Uh oh!

jtigue-bdai left a comment

Choose a reason for hiding this comment

Uh oh!

jtigue-bdai Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jtigue-bdai Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jtigue-bdai Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

ooctipus Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ooctipus commented Jun 26, 2025 •

edited

Loading

ooctipus Jun 26, 2025 •

edited

Loading