feat(kagent-adk): remove litellm as dependency from kagent-adk by jmhbh · Pull Request #1540 · kagent-dev/kagent

jmhbh · 2026-03-24T23:05:15Z

Removes litellm as a dependency from kagent. litellm is now replaced with provider specific sdks.

Testing

ollama

deploy ollama

kubectl apply -f - <<EOF
apiVersion: apps/v1
kind: Deployment
metadata:
  name: ollama
  namespace: kagent
spec:
  replicas: 1
  selector:
    matchLabels:
      app: ollama
  template:
    metadata:
      labels:
        app: ollama
    spec:
      containers:
      - name: ollama
        image: ollama/ollama:latest
        ports:
        - containerPort: 11434
        resources:
          requests:
            memory: "2Gi"
          limits:
            memory: "4Gi"
---
apiVersion: v1
kind: Service
metadata:
  name: ollama
  namespace: kagent
spec:
  selector:
    app: ollama
  ports:
  - port: 11434
    targetPort: 11434
EOF

Pull a small model into the ollama deployment - kubectl -n kagent exec -it deploy/ollama -- ollama pull llama3.2:1b
create model config an agent

kubectl apply -f - <<EOF
apiVersion: kagent.dev/v1alpha2
kind: ModelConfig
metadata:
  name: ollama-test-config
  namespace: kagent
spec:
  provider: Ollama
  model: llama3.2:1b
  ollama:
    host: "http://ollama:11434"
    options:
      num_ctx: "2048"
      temperature: "0.7"
      top_k: "40"
---
apiVersion: kagent.dev/v1alpha2
kind: Agent
metadata:
  name: ollama-test
  namespace: kagent
spec:
  type: Declarative
  description: "Ollama native SDK test"
  declarative:
    modelConfig: ollama-test-config
    systemMessage: "You are a helpful assistant. Answer concisely."
EOF

Port forward UI and test agent. kubectl port-forward -n kagent svc/kagent-ui 3000:8080

5. Test memory recall by pulling an embedding model `kubectl -n kagent exec -it deploy/ollama -- ollama pull nomic-embed-text` 6. Create the embedding model config an ollama memory test agent ```bash kubectl apply -f - <

embedding - google

Create secret and model config

kubectl apply -f - <<EOF
apiVersion: v1
kind: Secret
metadata:
  name: gemini-api-key-secret
  namespace: kagent
type: Opaque
data:
  GOOGLE_API_KEY: <your_api_key>
---
apiVersion: kagent.dev/v1alpha2
kind: ModelConfig
metadata:
  name: gemini-2-flash-config
  namespace: kagent
spec:
  model: gemini-2.0-flash
  provider: Gemini
  apiKeySecret: gemini-api-key-secret
  apiKeySecretKey: GOOGLE_API_KEY
  gemini: {}
---
apiVersion: kagent.dev/v1alpha2
kind: ModelConfig
metadata:
  name: gemini-embedding-config
  namespace: kagent
spec:
  model: gemini-embedding-001
  provider: Gemini
  apiKeySecret: gemini-api-key-secret
  apiKeySecretKey: GOOGLE_API_KEY
  gemini: {}
EOF

Create Agent

kubectl apply -f - <<EOF
apiVersion: kagent.dev/v1alpha2
kind: Agent
metadata:
  name: memory-openai-test
  namespace: kagent
spec:
  type: Declarative
  description: "Memory with Gemini embedding"
  declarative:
    modelConfig: gemini-2-flash-config
    systemMessage: "You are a helpful assistant with memory."
    memory:
      modelConfig: gemini-embedding-config
EOF

Port forward UI and test agent. kubectl port-forward -n kagent svc/kagent-ui 3000:8080

bedrock

Create aws-credentials

kubectl -n kagent create secret generic aws-credentials \
  --from-literal=AWS_ACCESS_KEY_ID="$AWS_ACCESS_KEY_ID" \
  --from-literal=AWS_SECRET_ACCESS_KEY="$AWS_SECRET_ACCESS_KEY" \
  --from-literal=AWS_DEFAULT_REGION="<your_region>" \
  --from-literal=AWS_SESSION_TOKEN="$AWS_SESSION_TOKEN" \
  --dry-run=client -o yaml | kubectl apply -f -

Create model config and agent

apiVersion: kagent.dev/v1alpha2
kind: ModelConfig
metadata:
  name: bedrock-model-config
  namespace: kagent
spec:
  model: us.anthropic.claude-haiku-4-5-20251001-v1:0
  provider: Bedrock
  bedrock:
    region: us-east-1
---
apiVersion: kagent.dev/v1alpha2
kind: Agent
metadata:
  name: bedrock-test
  namespace: kagent
spec:
  type: Declarative
  description: "Bedrock Converse API test"
  declarative:
    systemMessage: "You are a helpful assistant. Answer concisely."
    modelConfig: bedrock-model-config
    deployment:
      env:
      - name: AWS_ACCESS_KEY_ID
        valueFrom:
          secretKeyRef:
            name: aws-credentials
            key: AWS_ACCESS_KEY_ID
      - name: AWS_SECRET_ACCESS_KEY
        valueFrom:
          secretKeyRef:
            name: aws-credentials
            key: AWS_SECRET_ACCESS_KEY
      - name: AWS_DEFAULT_REGION
        valueFrom:
          secretKeyRef:
            name: aws-credentials
            key: AWS_DEFAULT_REGION
      - name: AWS_SESSION_TOKEN
        valueFrom:
          secretKeyRef:
            name: aws-credentials
            key: AWS_SESSION_TOKEN

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

iplay88keys

Minor — stale LiteLLM references in docstrings

A few docstrings in _memory_service.py still reference LiteLLM after this change:

Line 27 (class docstring): "Generates embeddings using LiteLLM"
Lines 60, 71 (add_session_to_memory / _add_session_to_memory_background docstrings): "Optional ADK model object (e.g., LiteLlm, OpenAI)"
Line 447 (_summarize_session_content_async docstring): same

These are cosmetic but worth updating for accuracy.

Comment left by Claude on behalf of @iplay88keys

python/packages/kagent-adk/src/kagent/adk/_memory_service.py

python/packages/kagent-adk/src/kagent/adk/models/_bedrock.py

python/packages/kagent-adk/src/kagent/adk/models/_ollama.py

python/packages/kagent-adk/src/kagent/adk/types.py

python/packages/kagent-adk/tests/unittests/models/test_ollama.py

python/packages/kagent-adk/src/kagent/adk/_memory_service.py

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

Copilot

Pull request overview

This PR removes the litellm dependency from kagent-adk by replacing LiteLLM-based model/embedding usage with provider-specific SDK implementations (Anthropic, Ollama, Bedrock, and OpenAI SDK calls), and adds unit tests to validate the new dispatch behavior.

Changes:

Drop litellm from kagent-adk dependencies and lockfile.
Replace LiteLLM model creation with native provider model classes (Anthropic/Ollama/Bedrock) and update model dispatch in types.py.
Rework embedding generation to call provider SDKs directly, and add new unit tests for embeddings and the new model adapters.

Reviewed changes

Copilot reviewed 12 out of 13 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
python/uv.lock	Removes `litellm` (and its transitive deps like `fastuuid`) from the workspace lock.
python/packages/kagent-adk/pyproject.toml	Removes `litellm` dependency; retains/uses provider SDK deps (openai/anthropic/boto3/ollama/numpy).
python/packages/kagent-adk/src/kagent/adk/_memory_service.py	Replaces LiteLLM embedding calls with provider-specific SDK embedding dispatch.
python/packages/kagent-adk/src/kagent/adk/types.py	Updates `_create_llm_from_model_config` to instantiate native Anthropic/Ollama/Bedrock implementations.
python/packages/kagent-adk/src/kagent/adk/models/_anthropic.py	Adds `KAgentAnthropicLlm` with base_url/headers and API key passthrough support.
python/packages/kagent-adk/src/kagent/adk/models/_bedrock.py	Adds `KAgentBedrockLlm` using Bedrock Converse / ConverseStream APIs via boto3.
python/packages/kagent-adk/src/kagent/adk/models/_ollama.py	Adds `KAgentOllamaLlm` using the native Ollama SDK and tool/function-call conversions.
python/packages/kagent-adk/src/kagent/adk/models/init.py	Exports new model classes instead of the removed LiteLLM wrapper.
python/packages/kagent-adk/src/kagent/adk/models/_litellm.py	Deletes the LiteLLM wrapper model class.
python/packages/kagent-adk/tests/unittests/test_embedding.py	Adds unit tests for embedding dispatch/truncation/normalization without LiteLLM.
python/packages/kagent-adk/tests/unittests/models/test_anthropic.py	Adds unit tests for the Anthropic adapter behavior.
python/packages/kagent-adk/tests/unittests/models/test_bedrock.py	Adds unit tests for Bedrock adapter + client region selection.
python/packages/kagent-adk/tests/unittests/models/test_ollama.py	Adds unit tests for Ollama adapter + option/header forwarding.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

python/packages/kagent-adk/src/kagent/adk/models/_ollama.py

python/packages/kagent-adk/src/kagent/adk/_memory_service.py

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

EItanya · 2026-03-26T09:29:17Z

I threw a claude team at this just to double check, lmk what you think. I'm happy to merge without everything fixed to get rid of litellm quickly, but we're definitely going to need to do deeper testing on bedrock

Review: `feat(kagent-adk): remove litellm as dependency`

Bugs

1. _normalize_l2 returns np.ndarray — JSON serialization will fail
_memory_service.py ~line 295. After truncation, embeddings are passed through _normalize_l2() which returns a numpy array. When httpx tries to serialize this via json.dumps, it will raise TypeError: Object of type ndarray is not JSON serializable. This affects any embedding model returning vectors > 768 dims (e.g., text-embedding-3-large at 3072).
Fix: embedding = self._normalize_l2(embedding).tolist()

2. Bedrock streaming blocks the event loop
_bedrock.py ~lines 177-218. asyncio.to_thread only wraps the initial converse_stream call, but the for event in stream_body loop iterates synchronously on the main thread. The entire streaming loop needs to be in a thread, or use aioboto3.

3. Bedrock region config field silently ignored
_bedrock.py:_get_bedrock_client() only reads AWS_DEFAULT_REGION/AWS_REGION env vars. The Bedrock model config's region field is never passed through from types.py:518. Pre-existing issue, but the new dedicated function is the right place to fix it.

4. api_key_passthrough causes AttributeError for Bedrock/Ollama
types.py ~lines 499, 518. Neither KAgentBedrockLlm nor KAgentOllamaLlm implement set_passthrough_key() or have api_key_passthrough. If a user configures api_key_passthrough: true, the passthrough plugin will raise AttributeError. Either add no-op implementations or guard in the plugin.

Issues

5. No finish_reason or usage_metadata in Ollama responses
_ollama.py ~lines 205-226. The Ollama adapter never populates these fields, even though ChatResponse has done_reason, prompt_eval_count, and eval_count. Token counts are lost.

6. dimensions=768 incompatible with older OpenAI embedding models
_memory_service.py ~line 374. The dimensions parameter is only supported by text-embedding-3-* models. Passing it to text-embedding-ada-002 (common with Azure) will raise an API error. Pre-existing from the litellm code, but worth fixing now.

7. Missing Anthropic/Bedrock embedding providers
_memory_service.py ~lines 354-361. The embedding dispatch handles openai, azure_openai, ollama, vertex_ai, gemini — but not anthropic or bedrock. Bedrock supports embeddings (Titan). Should at minimum raise a clear error for unsupported providers rather than falling through silently.

8. New boto3 client created on every request
_bedrock.py ~line 152. Unlike Anthropic/Ollama which cache clients, Bedrock creates a fresh boto3.client per generate_content_async call. Should be cached.

9. Bedrock adapter silently ignores inferenceConfig
The Bedrock adapter doesn't forward inferenceConfig (temperature, maxTokens, topP, stopSequences) to the Converse API. Users' model config values are silently dropped with no indication.

Comment left by Claude on behalf of @EItanya

supreme-gg-gg · 2026-03-26T15:40:53Z

Quick comments based on my context:

Bedrock region config field silently ignored
_bedrock.py:_get_bedrock_client() only reads AWS_DEFAULT_REGION/AWS_REGION env vars. The Bedrock model config's region field is never passed through from types.py:518. Pre-existing issue, but the new dedicated function is the right place to fix it.

We already set the AWS_REGION env var during translation, so it should work fine

dimensions=768 incompatible with older OpenAI embedding models
_memory_service.py ~line 374. The dimensions parameter is only supported by text-embedding-3-* models. Passing it to text-embedding-ada-002 (common with Azure) will raise an API error. Pre-existing from the litellm code, but worth fixing now.

Some models do not allow configuring embedding dimensions (returns a fixed size vector that is more than 768), that's the purpose of truncation and re-normalization. According to prior research this works fine in most cases, as long as the call to the model returns a vector longer than 768.

Missing Anthropic/Bedrock embedding providers
_memory_service.py ~lines 354-361. The embedding dispatch handles openai, azure_openai, ollama, vertex_ai, gemini — but not anthropic or bedrock. Bedrock supports embeddings (Titan). Should at minimum raise a clear error for unsupported providers rather than falling through silently.

Probably out of scope, we might want to rework the embedding interface in the future for wider support

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

jmhbh added 2 commits March 24, 2026 19:04

feat: remove litellm as dependency from kagent-adk

a3fa34e

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

fix: use boto for bedrock

e67bc57

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

This comment was marked as duplicate.

Sign in to view

iplay88keys reviewed Mar 25, 2026

View reviewed changes

supreme-gg-gg reviewed Mar 25, 2026

View reviewed changes

python/packages/kagent-adk/src/kagent/adk/_memory_service.py Outdated Show resolved Hide resolved

jmhbh and others added 3 commits March 25, 2026 16:58

fix: address PR feedback

f867d5d

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

style: format

da3c5e1

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

Merge branch 'main' into feat/remove-litellm

3176520

jmhbh marked this pull request as ready for review March 25, 2026 23:24

jmhbh requested a review from EItanya as a code owner March 25, 2026 23:24

Copilot AI review requested due to automatic review settings March 25, 2026 23:24

jmhbh requested review from peterj and yuval-k as code owners March 25, 2026 23:24

Copilot started reviewing on behalf of jmhbh March 25, 2026 23:25 View session

Copilot AI reviewed Mar 25, 2026

View reviewed changes

python/packages/kagent-adk/src/kagent/adk/models/_ollama.py Outdated Show resolved Hide resolved

python/packages/kagent-adk/src/kagent/adk/_memory_service.py Outdated Show resolved Hide resolved

jmhbh and others added 2 commits March 25, 2026 19:49

fix: address copilot feedback

83dfd14

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

Merge branch 'main' into feat/remove-litellm

f4e87aa

jmhbh added 2 commits March 26, 2026 16:51

fix: resolve invalid json schema issue when calling converse operation

856ed9d

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

fix: address pr feedback

02d4374

Signed-off-by: JM Huibonhoa <jm.huibonhoa@solo.io>

EItanya approved these changes Mar 26, 2026

View reviewed changes

EItanya merged commit 9dcee3a into kagent-dev:main Mar 26, 2026
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(kagent-adk): remove litellm as dependency from kagent-adk#1540

feat(kagent-adk): remove litellm as dependency from kagent-adk#1540
EItanya merged 9 commits intokagent-dev:mainfrom
jmhbh:feat/remove-litellm

jmhbh commented Mar 24, 2026 •

edited

Loading

Uh oh!

This comment was marked as duplicate.

Uh oh!

iplay88keys left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

EItanya commented Mar 26, 2026 •

edited

Loading

Uh oh!

supreme-gg-gg commented Mar 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

jmhbh commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

Uh oh!

This comment was marked as duplicate.

Uh oh!

iplay88keys left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

EItanya commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review: feat(kagent-adk): remove litellm as dependency

Bugs

Issues

Uh oh!

supreme-gg-gg commented Mar 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jmhbh commented Mar 24, 2026 •

edited

Loading

iplay88keys left a comment •

edited

Loading

EItanya commented Mar 26, 2026 •

edited

Loading

Review: `feat(kagent-adk): remove litellm as dependency`