Skip to content

Commit 442526d

Browse files
authored
Merge pull request #653 from uniemimu/flag
more documentation for fractional resources
2 parents fa2a217 + b0130e6 commit 442526d

File tree

1 file changed

+14
-2
lines changed

1 file changed

+14
-2
lines changed

cmd/gpu_plugin/README.md

Lines changed: 14 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,7 @@ Table of Contents
1111
* [Deploying as a DaemonSet](#deploying-as-a-daemonset)
1212
* [Build the plugin image](#build-the-plugin-image)
1313
* [Deploy plugin DaemonSet](#deploy-plugin-daemonset)
14+
* [Fractional resources](#fractional-resources)
1415
* [Deploy by hand](#deploy-by-hand)
1516
* [Build the plugin](#build-the-plugin)
1617
* [Run the plugin as administrator](#run-the-plugin-as-administrator)
@@ -45,6 +46,7 @@ passthrough and acceleration.
4546
| Flag | Argument | Default | Meaning |
4647
|:---- |:-------- |:------- |:------- |
4748
| -enable-monitoring | - | disabled | Enable 'i915_monitoring' resource that provides access to all Intel GPU devices on the node |
49+
| -resource-manager | - | disabled | Enable fractional resource management, [see also dependencies](#fractional-resources) |
4850
| -shared-dev-num | int | 1 | Number of containers that can share the same GPU device |
4951

5052
The plugin also accepts a number of other arguments (common to all plugins) related to logging.
@@ -134,7 +136,17 @@ $ kubectl apply -k deployments/gpu_plugin/overlays/nfd_labeled_nodes
134136
daemonset.apps/intel-gpu-plugin created
135137
```
136138

137-
The experimental fractional-resource feature can be enabled by running:
139+
#### Fractional resources
140+
141+
With the experimental fractional resource feature you can use additional kubernetes extended
142+
resources, such as GPU memory, which can then be consumed by deployments. PODs will then only
143+
deploy to nodes where there are sufficient amounts of the extended resources for the containers.
144+
145+
Enabling the fractional resource feature isn't quite as simple as just enabling the related
146+
command line flag. The DaemonSet needs additional RBAC-permissions
147+
and access to the kubelet podresources gRPC service, plus there are other dependencies to
148+
take care of, which are explained below. For the RBAC-permissions, gRPC service access and
149+
the flag enabling, it is recommended to use kustomization by running:
138150

139151
```bash
140152
$ kubectl apply -k deployments/gpu_plugin/overlays/fractional_resources
@@ -144,7 +156,7 @@ clusterrolebinding.rbac.authorization.k8s.io/resource-reader-rb created
144156
daemonset.apps/intel-gpu-plugin created
145157
```
146158

147-
Usage of fractional GPU resources, such as GPU memory, requires that the cluster has node
159+
Usage of these fractional GPU resources requires that the cluster has node
148160
extended resources with the name prefix `gpu.intel.com/`. Those can be created with NFD
149161
by running the [hook](/cmd/gpu_nfdhook/) installed by the plugin initcontainer. When fractional resources are
150162
enabled, the plugin lets a [scheduler extender](https://github.com/intel/platform-aware-scheduling/tree/master/gpu-aware-scheduling)

0 commit comments

Comments
 (0)