Skip to content

Commit 8d0f765

Browse files
rossbarmelissawm
authored andcommitted
PERF: reduce RL episodes and batch size from 10 to 3.
1 parent 50934f7 commit 8d0f765

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

Diff for: content/tutorial-deep-reinforcement-learning-with-pong-from-pixels.md

+4-4
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ This tutorial can also be run locally in an isolated environment, such as [Virtu
5151
3. Create the policy (the neural network) and the forward pass
5252
4. Set up the update step (backpropagation)
5353
5. Define the discounted rewards (expected return) function
54-
6. Train the agent for 100 episodes
54+
6. Train the agent for 3 episodes
5555
7. Next steps
5656
8. Appendix
5757
- Notes on RL and deep RL
@@ -486,12 +486,12 @@ You can stop the training at any time or/and check saved MP4 videos of saved pla
486486

487487
+++ {"id": "gD6XBqUqfNOV"}
488488

489-
1. For demo purposes, let's limit the number of episodes for training to 10. If you are using hardware acceleration (CPUs and GPUs), you can increase the number to 1,000 or beyond. For comparison, Andrej Karpathy's original experiment took about 8,000 episodes.
489+
1. For demo purposes, let's limit the number of episodes for training to 3. If you are using hardware acceleration (CPUs and GPUs), you can increase the number to 1,000 or beyond. For comparison, Andrej Karpathy's original experiment took about 8,000 episodes.
490490

491491
```{code-cell} ipython3
492492
:id: TdRXrc37Rfvo
493493
494-
max_episodes = 10
494+
max_episodes = 3
495495
```
496496

497497
+++ {"id": "ORj7JFGB0Gy8"}
@@ -503,7 +503,7 @@ max_episodes = 10
503503
```{code-cell} ipython3
504504
:id: eKLLYUKbG-5A
505505
506-
batch_size = 10
506+
batch_size = 3
507507
learning_rate = 1e-4
508508
```
509509

0 commit comments

Comments
 (0)