Accessing mean reward for curriculum learning #427

MarcDcls · 2025-12-17T16:49:17Z

MarcDcls
Dec 17, 2025

Hi,

I’m currently exploring curriculum learning strategies with mjlab, and I was wondering whether there is an existing way to access the mean reward during training.

At the moment, I’m basing the curriculum solely on the number of training steps, which works to some extent but is not very satisfying compared to a performance-based signal.

Thanks in advance!

Answered by kevinzakka

Dec 18, 2025

Hi @MarcDcls,

The answer is yes! Curriculum functions receive env as their first argument, so you have access to all managers, including the reward manager. You can compute the mean reward with something like:

env.reward_manager._episode_sums["your_reward_term"][env_ids].mean()

Cheers

View full answer

kevinzakka · 2025-12-18T09:17:23Z

kevinzakka
Dec 18, 2025
Maintainer

Hi @MarcDcls,

The answer is yes! Curriculum functions receive env as their first argument, so you have access to all managers, including the reward manager. You can compute the mean reward with something like:

env.reward_manager._episode_sums["your_reward_term"][env_ids].mean()

Cheers

1 reply

MarcDcls Dec 18, 2025
Author

Nice 👍

Thanks for the quick answer!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Accessing mean reward for curriculum learning #427

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Accessing mean reward for curriculum learning #427

Uh oh!

MarcDcls Dec 17, 2025

Replies: 1 comment · 1 reply

Uh oh!

Uh oh!

kevinzakka Dec 18, 2025 Maintainer

Uh oh!

MarcDcls Dec 18, 2025 Author

MarcDcls
Dec 17, 2025

Replies: 1 comment 1 reply

kevinzakka
Dec 18, 2025
Maintainer

MarcDcls Dec 18, 2025
Author