Skip to content

Commit bb17d68

Browse files
committed
Update CR samples
Signed-off-by: Shiva Krishna, Merla <[email protected]>
1 parent 4fa26ae commit bb17d68

File tree

2 files changed

+42
-3
lines changed

2 files changed

+42
-3
lines changed

config/samples/apps_v1alpha1_nimcache.yaml

Lines changed: 22 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,4 +6,25 @@ metadata:
66
app.kubernetes.io/managed-by: kustomize
77
name: nimcache-sample
88
spec:
9-
# TODO(user): Add fields here
9+
source:
10+
ngc:
11+
modelPuller: nvcr.io/nim/meta/llama3-8b-instruct:1.0.0
12+
pullSecret: ngc-secret
13+
authSecret: ngc-api-secret
14+
model:
15+
profiles: []
16+
autoDetect: true
17+
precision: "fp8"
18+
engine: "tensorrt_llm"
19+
qosProfile: "throughput"
20+
gpus:
21+
product: "l40s"
22+
ids:
23+
- "26b5"
24+
tensorParallelism: "1"
25+
storage:
26+
pvc:
27+
create: true
28+
storageClass: "local-path"
29+
size: "50Gi"
30+
volumeAccessMode: ReadWriteOnce

config/samples/apps_v1alpha1_nimservice.yaml

Lines changed: 20 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,24 @@ metadata:
44
labels:
55
app.kubernetes.io/name: k8s-nim-operator
66
app.kubernetes.io/managed-by: kustomize
7-
name: nimservice-sample
7+
name: meta-llama3-8b-instruct
88
spec:
9-
# TODO(user): Add fields here
9+
image:
10+
repository: nvcr.io/nim/meta/llama3-8b-instruct
11+
tag: 1.0.0
12+
pullPolicy: IfNotPresent
13+
pullSecrets:
14+
- ngc-secret
15+
authSecret: ngc-api-secret
16+
storage:
17+
nimCache:
18+
name: meta-llama3-8b-instruct
19+
profile: ''
20+
replicas: 1
21+
resources:
22+
limits:
23+
nvidia.com/gpu: 1
24+
expose:
25+
service:
26+
type: ClusterIP
27+
openaiPort: 8000

0 commit comments

Comments
 (0)