Accessing mean reward for curriculum learning #427
-
|
Hi, I’m currently exploring curriculum learning strategies with mjlab, and I was wondering whether there is an existing way to access the mean reward during training. At the moment, I’m basing the curriculum solely on the number of training steps, which works to some extent but is not very satisfying compared to a performance-based signal. Thanks in advance! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
|
Hi @MarcDcls, The answer is yes! Curriculum functions receive env.reward_manager._episode_sums["your_reward_term"][env_ids].mean()Cheers |
Beta Was this translation helpful? Give feedback.
Hi @MarcDcls,
The answer is yes! Curriculum functions receive
envas their first argument, so you have access to all managers, including the reward manager. You can compute the mean reward with something like:Cheers