Sandbox repo for trying diffirent learning algorithms on the 'CartPole-v0' environment from OpenAI gym library