Release 1.4.0 #260

Toni-SM · 2025-01-16T22:22:54Z

No description provided.

* Add mixed precision option into PPO algorithm * Expand mixed precision to forward passes during data sampling phase * Group setup statements * Remove unnecessary parenthesis from autocast function

* Call agent's pre-interaction during evaluation * Increase MINOR version and update CHANGELOG

* Add utility to tensorize and flatten gymnasium spaces * Add spaces utility to docs * Add test for space utils implementations * Add funtion to sample spaces by batches * Update test file * Allow None type * Update model defintions to support different input spaces * Update model definitions to support different input spaces * Update model definitions test file * Replace view by reshape to support unclear operations * Replace gym by gymnasium in wrapper base class annotation * Use space utils to handle spaces in gymnasium-based env wrappers * Replace gym with gymnasium in DeepMind wrapper test * Add utility to convert gym spaces to gymnasium spaces * Convert gym wrapper spaces to gymnasium * Create a common test file for hypothesis' strategies * Add test for convert_gym_space space utils * Remove gymnasium.spaces.Sequence from supported spaces * Add space utility to untensorize spaces * Update Isaac Gym preview wrapper to use space utils * Update Omniverse Isaac Gym wrapper to use space utils * Use spaces utils to process actions in Gym wrapper * Use spaces utils to process actions in Gymnasium wrapper * Use spaces utils to process actions in Isaac Lab wrapper * Raise exceptions when the spaces/values are not supported * Use spaces utils to process actions in Isaac Gym preview wrapper * Use spaces utils to process actions in Omniverse Isaac Gym wrapper * Use spaces utils to process actions in PettingZoo wrapper * Use spaces utils to process actions in DeepMind wrapper * Allow to remove batch dimension when converting gym space * Use spaces utils to process actions in Brax wrapper and allow operation with other spaces * Update spaces utils section in docs * Replace each _get_space_size method call by the compute_space_size utility * Add static method to parse jax device * Add space utility in jax * Add spaces utils test in jax * Rename spaces utility test in torch * Move spaces utils implementation in torch to its own file * Replace each _get_space_size method call by the compute_space_size utility in jax * Add spaces utils API in jax to docs * Update model definitions to support different input spaces in jax * Update model instantiators definitions test file in jax * Replace gym by gymnasium in wrapper base class annotation in jax * Add parameter to handle JAX and NumPy backends * Update Gymnasium wrapper to use space utils in jax * Update Gym wrapper to use space utils in jax * Update PettingZoo wrapper to use space utils in jax * Update Brax wrapper to use space utils in jax * Update Isaac Lab wrapper to use space utils in jax * Update Omniverse Isaac Gym wrapper to use space utils in jax * Update Isaac Gym preview wrapper to use space utils in jax * Move torch-based import statements * Tensorize and flatten Isaac Lab MARL env state * Add docstring to JAX config's parse_device function * Update docs and doctrings * Cache env info in Omniverse Isaac Gym wrapper * Update CHANGELOG * Improve CHANGELOG description * Improve JAX config parse_device implementation * Use gymnsasium batch utility to sample fundamental spaces

* Remove gym from skrl.utils * Remove gym from skrl.resources * Remove gym from skrl.multi_agents * Remove gym from skrl.models * Remove gym from skrl.memories * Remove gym from skrl.envs * Remove gym from skrl.agents * Update dependencies * Remove gym from docs * Update CHANGELOG

…nt step (#208) * Fix: with SAC, a new training batch should be sampled for each gradient_step * Apply format --------- Co-authored-by: yibo di <[email protected]> Co-authored-by: Toni-SM <[email protected]>

* Move sample inside gradient step loop in TD3 (RNN), DDPG (RNN), SAC, SAC (RNN), DQN and DDQN * Update CHANGELOG.md * Apply format --------- Co-authored-by: Deniz Seven <[email protected]> Co-authored-by: Toni-SM <[email protected]>

* Add class mapping for Categorical model * Apply format --------- Co-authored-by: Toni-SM <[email protected]>

* Add new pre-commit hooks (black, codespell, among others) * Remove yapf config and update isort

* Configure pre-commit hooks * Configure black in pre-commit and pyproject.toml * Configure codespell in pre-commit * Ignore formating config sections * Apply black forma to skrl folder * Apply black format to tests folder * Apply codespell * Update CHANGELOG

* Update docs configuration * Remove __init__ automethod entry in docs

* Add multivariate Gaussian model to runner in docs * Update shared model instantiator to allow specifying its structure * Update torch runner to specify shared model structure * Define model mixin from given structure * Use spaces utils to initialize jax model state dictionary * Add support for MultivariateGaussianMixin in shared models * Update model instantiators test in torch * Remove double argument definition * Parse device in jax spaces utils * Update model instantiators test in jax * Update CHANGELOG

…#233) * Add multivariate Gaussian model to runner in docs * Update shared model instantiator to allow specifying its structure * Update torch runner to specify shared model structure * Define model mixin from given structure * Use spaces utils to initialize jax model state dictionary * Add support for MultivariateGaussianMixin in shared models * Update model instantiators test in torch * Remove double argument definition * Parse device in jax spaces utils * Update model instantiators test in jax * Speed up distribution construction in PyTorch by disabling checking

* Add method to parse device in torch * Use ML framework configuration device parsing method to parse devices * Add option to validate parsed torch device * Add ML framework testing in jax * Add torch parse_device method to ML framework docs * Update docstrings and test content * Update docs * Disable device parsing validation when PyTorch config device

…m memory (#235) * Add multivariate Gaussian model to runner in docs * Update shared model instantiator to allow specifying its structure * Update torch runner to specify shared model structure * Define model mixin from given structure * Use spaces utils to initialize jax model state dictionary * Add support for MultivariateGaussianMixin in shared models * Update model instantiators test in torch * Remove double argument definition * Parse device in jax spaces utils * Update model instantiators test in jax * Speed up distribution construction in PyTorch by disabling checking * Replace torch BatchSampler for performance issue * Add base memory class test file in torch * Update CHANGELOG

* Add reduction parameter to gaussian_model instantiator * Specify shared model parameters' default values * Update CHANGELOG * Define reduction as string value

* Update PPO mixed-precision implementation in torch * Add A2C mixed precision support in torch * Add AMP mixed precision support in torch * Add CEM mixed precision support in torch * Add DDPG mixed precision support in torch * Add DQN and DDQN mixed precision support in torch * Add RPO mixed precision support in torch * Add SAC mixed precision support in torch * Add TD3 mixed precision support in torch * Update docs * Add PPO test in torch * Add agent tests in torch * Add agent tests in jax * Avoid TypeError: Got unsupported ScalarType BFloat16 * Update CHANGELOG

…itialize models' lazy modules (#247)

…ta, and the KL Adaptive learning rate scheduler (#253)

…ts and their models (#254)

…amp deprecation warning (#256) * Deal with PyTorch automatic mixed-precision deprecation warning * Add automatic mixed precision support for multi-agent

…on (#252)

lopatovsky and others added 30 commits September 25, 2024 22:11

Mixed double precision for PPO algorithm (#155)

bacb0f5

* Add mixed precision option into PPO algorithm * Expand mixed precision to forward passes during data sampling phase * Group setup statements * Remove unnecessary parenthesis from autocast function

Call agent's pre-interaction during evaluation (#210)

f585796

* Call agent's pre-interaction during evaluation * Increase MINOR version and update CHANGELOG

Merge branch 'main' into develop

ae4e09e

Fix: with SAC, a new training batch should be sampled for each gradie…

5fce807

…nt step (#208) * Fix: with SAC, a new training batch should be sampled for each gradient_step * Apply format --------- Co-authored-by: yibo di <[email protected]> Co-authored-by: Toni-SM <[email protected]>

Fix Sampling inside gradient loop issue (#183)

9252ec9

* Move sample inside gradient step loop in TD3 (RNN), DDPG (RNN), SAC, SAC (RNN), DQN and DDQN * Update CHANGELOG.md * Apply format --------- Co-authored-by: Deniz Seven <[email protected]> Co-authored-by: Toni-SM <[email protected]>

Add class mapping of categorical model (#216)

eff7295

* Add class mapping for Categorical model * Apply format --------- Co-authored-by: Toni-SM <[email protected]>

Update pre-commit hooks (#221)

88ac11f

* Add new pre-commit hooks (black, codespell, among others) * Remove yapf config and update isort

Docs update (#228)

bbe532d

* Update docs configuration * Remove __init__ automethod entry in docs

Update JAX installation warning note (#241)

d2aee9f

Shared model instantiator's default parameters (#242)

23b61dc

* Add reduction parameter to gaussian_model instantiator * Specify shared model parameters' default values * Update CHANGELOG * Define reduction as string value

Fix SAC experiment directory key name (#244)

f39aadc

Fix Optax's learning rate schedulers integration in JAX (#245)

e49f98f

Isaac Lab wrapper's multi-agent state retrieval with gymnasium 1.0

a7f82b2

Add method to initialize lazy modules' parameters

65da82b

Update examples that use model instantiators to the latest API and In…

deb28ff

…itialize models' lazy modules (#247)

Update environment loader and wrapper for Isaac Lab 2.0 (#248)

95663f2

Update Gymnasium checking for vectorized environments (#250)

e5c6b81

Update AMP agent to use the environment's terminated and truncated da…

95b0e02

…ta, and the KL Adaptive learning rate scheduler (#253)

Update runner implementations to support definition of arbitrary agen…

16224b0

…ts and their models (#254)

Fix multi-agent learning rate scheduler in JAX (#255)

9bda6c5

Add automatic mixed precision support for multi-agent and deal torch.…

d11a020

…amp deprecation warning (#256) * Deal with PyTorch automatic mixed-precision deprecation warning * Add automatic mixed precision support for multi-agent

Update gymnasium make vector API (#257)

cb21eba

Toni-SM added 3 commits January 16, 2025 15:22

Fix memory sampling when sequence_length is specified

7f9992b

Treat truncation signal when computing 'done' (environment reset) (#259)

663f546

Allow the use of the deterministic/stochastic actions during evaluati…

fdfb8a5

…on (#252)

Toni-SM merged commit 0c758b2 into main Jan 16, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release 1.4.0 #260

Release 1.4.0 #260

Toni-SM commented Jan 16, 2025

Release 1.4.0 #260

Release 1.4.0 #260

Conversation

Toni-SM commented Jan 16, 2025