File tree Expand file tree Collapse file tree 2 files changed +4
-4
lines changed Expand file tree Collapse file tree 2 files changed +4
-4
lines changed Original file line number Diff line number Diff line change 5050 " Args:\n " ,
5151 " Q: A dictionary that maps from state -> action-values.\n " ,
5252 " Each value is a numpy array of length nA (see below)\n " ,
53- " epsilon: The probability to select a random action . float between 0 and 1.\n " ,
53+ " epsilon: The probability to select a random action. Float between 0 and 1.\n " ,
5454 " nA: Number of actions in the environment.\n " ,
5555 " \n " ,
5656 " Returns:\n " ,
8282 " num_episodes: Number of episodes to run for.\n " ,
8383 " discount_factor: Gamma discount factor.\n " ,
8484 " alpha: TD learning rate.\n " ,
85- " epsilon: Chance the sample a random action. Float betwen 0 and 1.\n " ,
85+ " epsilon: Chance to sample a random action. Float between 0 and 1.\n " ,
8686 " \n " ,
8787 " Returns:\n " ,
8888 " A tuple (Q, episode_lengths).\n " ,
Original file line number Diff line number Diff line change 4949 " Args:\n " ,
5050 " Q: A dictionary that maps from state -> action-values.\n " ,
5151 " Each value is a numpy array of length nA (see below)\n " ,
52- " epsilon: The probability to select a random action . float between 0 and 1.\n " ,
52+ " epsilon: The probability to select a random action. Float between 0 and 1.\n " ,
5353 " nA: Number of actions in the environment.\n " ,
5454 " \n " ,
5555 " Returns:\n " ,
8181 " num_episodes: Number of episodes to run for.\n " ,
8282 " discount_factor: Gamma discount factor.\n " ,
8383 " alpha: TD learning rate.\n " ,
84- " epsilon: Chance the sample a random action. Float betwen 0 and 1.\n " ,
84+ " epsilon: Chance to sample a random action. Float between 0 and 1.\n " ,
8585 " \n " ,
8686 " Returns:\n " ,
8787 " A tuple (Q, episode_lengths).\n " ,
You can’t perform that action at this time.
0 commit comments