Skip to content

Add guiding comments to advantage calculation in REINFORCE #5260

Add guiding comments to advantage calculation in REINFORCE

Add guiding comments to advantage calculation in REINFORCE #5260

Triggered via pull request November 12, 2025 22:32
@bernhardpgbernhardpg
labeled #562
Status Failure
Total duration 17m 21s
Artifacts

main.yml

on: pull_request_target
Fit to window
Zoom out
Zoom in

Annotations

3 errors and 1 warning
dockerfile
Process completed with exit code 1.
ubuntu 24.04 noble
Process completed with exit code 3.
pip extra on noble
Process completed with exit code 1.
macos sonoma 14
Skipping pydantic: most recent version 2.12.4 not installed