-
-
Notifications
You must be signed in to change notification settings - Fork 928
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Proposal] Initial State Tracking - Bipedal Walker Hardcore #892
Comments
This should be possible with the options argument to pass an intended state for the environment to use |
Yes, but the options argument is not being used anywhere on the reset() method: It's possible that it was supposed to be included on the super().reset() method? |
What I mean is that this solution could be implemented using the |
Oh, I see, I'll make the changes then open a PR, thank you for your help! |
@pseudo-rnd-thoughts here's the PR: 1305. I'd appreciate your feedback, thanks in advance! |
Proposal
Create methods that are able to retrieve and set, the initial states that uniquely represents the terrain created for Bipedal Walker Hardcore.
Motivation
I'm currently working on a research on curriculum learning, based on a paper that used the bipedal walker hardcore problem to demonstrate that a selection on the initial conditions of the experiment could improve the overall performance. My current work implies extracting the states used to create the terrain, so that I can use it to automatically select this initial states along the experiment, and also set the states to generate a specific terrain. I was working on some changes in the bipedal environment locally, but it occurred to me that this could benefit other people that may have the same issue.
Pitch
The idea is to have a parameter in the
reset
method that allows the overwriting of the states used to generate the terrain, and also the persistence of those states, available through an attribute, similar to theterrain
attribute.Alternatives
We may have other ideas of how to do it, but I think the solution would involve a method to set the states and an attribute to access them. Feel free to make your suggestions 😄
Additional context
The paper: Automated curriculum learning for embodied agents a neuroevolutionary approach.
Current Fork: https://github.com/arthur-plautz/gym
Checklist
The text was updated successfully, but these errors were encountered: