This repository was archived by the owner on Aug 16, 2023. It is now read-only.
Detect when the nightly CI fails because of a cluster shutdown #150
Labels
area/ci
good first issue
Denotes an issue ready for a new contributor, according to the "help wanted" guidelines.
kind/flake
Categorizes issue or PR as related to a flaky test.
Every once in a while, the nightly testing fails because the cluster becomes unreachable:
This kind of failure is independent from the GPU Operator testing, and it should be made clear in the CI-Dashboard (Prow infrastructure restarts the testing when this happens). An orange dot could do the job, with a label
cluster issue detected
.To detect this, the
must-gather
script could simply create a filecluster-down
whenoc version
doesn't work.This the presence of this file would tell the
ci-dashboard
to set the orange flag.The text was updated successfully, but these errors were encountered: