Skip to content

Commit a5d6921

Browse files
Update cert and missed Gateway API migration
1 parent 5b934df commit a5d6921

File tree

4 files changed

+36
-22
lines changed

4 files changed

+36
-22
lines changed

README.md

+6-1
Original file line numberDiff line numberDiff line change
@@ -44,4 +44,9 @@ This project showcases an educational and experimental setup, offering a startin
4444
- **Hybrid Cloud Deployments**: Adapt the setup for hybrid or multi-cloud Kubernetes deployments.
4545
- **Natural Language Processing (NLP)**: Implement AI-powered features such as text summarization, sentiment analysis, or chatbot functionality for applications requiring language understanding.
4646
- **Image and Video Processing**: Use AI models to enable facial recognition, object detection, image classification, or video analytics for multimedia applications.
47-
- **Image and Video Processing**: Use AI models to enable facial recognition, object detection, image classification, or video analytics for multimedia applications.
47+
- **Real-Time Data Stream Processing**: Integrate AI models to process and analyze high-velocity data streams (e.g., IoT sensor data, live event tracking, or financial market feeds) for real-time insights and predictions.
48+
- **AI-Powered Infrastructure Management**: Automate cluster health monitoring and resource allocation using predictive analytics to identify performance bottlenecks and self-heal infrastructure issues before they escalate.
49+
- **Scientific Simulations and Modeling**: Use AI to accelerate complex scientific simulations, such as climate modeling, molecular dynamics, or astrophysical computations, leveraging Kubernetes' scalable GPU resources.
50+
- **Context-Aware API Gateways**: Use AI models on Kubernetes endpoints to dynamically analyze incoming API requests and provide context-aware routing, such as adjusting traffic flow based on user behavior, request intent, or predicted resource demands. This can enhance scalability and improve user experience by intelligently prioritizing requests.
51+
- **Personalized Response Generation**: Deploy AI models on endpoints to deliver tailored responses to users, such as real-time content recommendations, adaptive UI/UX experiences, or personalized chatbot interactions. By integrating AI with Kubernetes, these models can scale based on traffic while ensuring low-latency, user-specific outputs for high-demand applications.
52+
- **Predictive Autoscaling for Endpoint Workloads**: Use AI models deployed on Kubernetes endpoints to predict traffic patterns and proactively scale resources. By analyzing historical and real-time data, the AI can optimize pod scaling to handle peak loads efficiently, reducing latency and preventing over-provisioning while ensuring seamless endpoint performance.

helm-chart/localai-values.yaml

+3-16
Original file line numberDiff line numberDiff line change
@@ -8,27 +8,14 @@ service:
88

99
# Gateway API disabled (using ingress instead)
1010
gateway:
11-
enabled: false
12-
13-
# Ingress configuration
14-
ingress:
1511
enabled: true
16-
className: nginx
17-
annotations:
18-
cert-manager.io/cluster-issuer: selfsigned-issuer
19-
hosts:
20-
- host: example.local
21-
paths:
22-
- path: /
23-
pathType: Prefix
24-
tls:
25-
- secretName: example-cert-tls
26-
hosts:
27-
- example.local
2812

2913
# LocalAI configuration
3014
localai:
3115
enabled: true
16+
env:
17+
CONTEXT_SIZE: "0"
18+
THREADS: "4"
3219
persistence:
3320
models:
3421
enabled: true

helm-chart/test-certificate.yaml

+17
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
apiVersion: cert-manager.io/v1
2+
kind: Certificate
3+
metadata:
4+
name: test-cert
5+
namespace: localai
6+
spec:
7+
secretName: test-cert-tls
8+
duration: 2160h # 90 days
9+
renewBefore: 360h # 15 days
10+
commonName: test.local
11+
dnsNames:
12+
- test.local
13+
issuerRef:
14+
name: selfsigned-issuer
15+
kind: ClusterIssuer
16+
group: cert-manager.io
17+

helm-chart/values.yaml

+10-5
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,13 @@
11
# LocalAI configuration
2-
fullnameOverride: "localai"
2+
3+
certificate:
4+
create: true
5+
name: localai-tls
6+
dnsNames:
7+
- ai.example.com
8+
issuerRef:
9+
name: selfsigned-issuer
10+
kind: ClusterIssuer
311

412
autoscaling:
513
enabled: false
@@ -83,10 +91,7 @@ gateway:
8391
mode: Terminate
8492
certificateRefs:
8593
- name: localai-tls
86-
kind: Secret
87-
hostnames:
88-
- "ai.example.com"
89-
rules:
94+
rules:
9095
- matches:
9196
- path:
9297
type: PathPrefix

0 commit comments

Comments
 (0)