-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not working? #4
Comments
@piorunm I had similar results when running locally. After about 5min pygame gets really slow. After letter it train for another couple of hours, no improvement. |
This is not working at all. |
The original code has some issues... have a look at a working version here |
Thanks, this is very helpful. Any chance you could push your model? 700K iterations take a while on my machine. |
sure why not. A added my current checkpoint of 975k steps and added a USE_MODEL Mode which ignores the training aspect. |
Thanks @wh33ler Very nice improvement! |
Hi, @wh33ler is it normal that at 300k timesteps, the ai player moves almost the same as the first 3k steps? I cloned your repo, make a few irrelevant changes(such as rename variables), and I got stupid results. You disabled issue functionality in your repo so I have to question here, LOL. |
I am not sure what exactly you mean. It has been a while since I looked at it. But it might take some time until the AI gets it :) |
I think on line 22 you have to set USE_MODEL = False for the net to actually train. |
Did anyone try to learn it so the agent really can play pong? I tried to learn it for over 30h on Tesla K80 and it doesn't look good at all.
I have also once concern about saving and restoring the learned weights. I modified the code to save the session once every 100000 iterations and I restore it like that:
Is it me doing something wrong or there is a bug somewhere in the code preventing it from learn the pong?
The text was updated successfully, but these errors were encountered: