Skip to content

Commit

Permalink
Update CR samples
Browse files Browse the repository at this point in the history
Signed-off-by: Shiva Krishna, Merla <[email protected]>
  • Loading branch information
shivamerla committed Aug 21, 2024
1 parent 4fa26ae commit bb17d68
Show file tree
Hide file tree
Showing 2 changed files with 42 additions and 3 deletions.
23 changes: 22 additions & 1 deletion config/samples/apps_v1alpha1_nimcache.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -6,4 +6,25 @@ metadata:
app.kubernetes.io/managed-by: kustomize
name: nimcache-sample
spec:
# TODO(user): Add fields here
source:
ngc:
modelPuller: nvcr.io/nim/meta/llama3-8b-instruct:1.0.0
pullSecret: ngc-secret
authSecret: ngc-api-secret
model:
profiles: []
autoDetect: true
precision: "fp8"
engine: "tensorrt_llm"
qosProfile: "throughput"
gpus:
product: "l40s"
ids:
- "26b5"
tensorParallelism: "1"
storage:
pvc:
create: true
storageClass: "local-path"
size: "50Gi"
volumeAccessMode: ReadWriteOnce
22 changes: 20 additions & 2 deletions config/samples/apps_v1alpha1_nimservice.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,24 @@ metadata:
labels:
app.kubernetes.io/name: k8s-nim-operator
app.kubernetes.io/managed-by: kustomize
name: nimservice-sample
name: meta-llama3-8b-instruct
spec:
# TODO(user): Add fields here
image:
repository: nvcr.io/nim/meta/llama3-8b-instruct
tag: 1.0.0
pullPolicy: IfNotPresent
pullSecrets:
- ngc-secret
authSecret: ngc-api-secret
storage:
nimCache:
name: meta-llama3-8b-instruct
profile: ''
replicas: 1
resources:
limits:
nvidia.com/gpu: 1
expose:
service:
type: ClusterIP
openaiPort: 8000

0 comments on commit bb17d68

Please sign in to comment.