This repository was archived by the owner on Aug 16, 2023. It is now read-only.
Cannot modify the GPU Operator ClusterPolicy before deploying it #140
Labels
kind/feature
Categorizes issue or PR as related to a new feature.
Currently, the GPU Operator ClusterPolicy is fetched from the ClusterServiceVersion
alm-example
and instantiated right away.However, in some cases, the default content is not the one we desire. See for instance this unmerged commit, where we need to set the
repoConfig
stanza when running with OCP 4.8 (using RHEL beta repositories).Another example would be when we want to customize the operator or operand image path to use custom ones.
The GPU Operator DaemonSets are never updated once created, so if they are created with the wrong values, the DaemonSets will never be fixed.
The hack above works (hopefully) because the driver container will fail to deploy without the right
repoConfig
configuration, so it's safe to manually delete it after the update, but in the general case, the Driver container should never be deleted once running, as thenvidia
driver cannot be removed from the kernel while other process (workload or operand) use it.We should find a way to allow patching the ClusterPolicy before deploying it. The solution should be generic, so that any kind of modification can be performed during the deployment.
The text was updated successfully, but these errors were encountered: