feat: implement backend for models ui (#690)

UriZafrir · urizaf-work · onematchfox · web-flow · commit 1797ed9f504d · 2025-08-08T11:01:29.000+02:00
* feat: implement backend for models ui Signed-off-by: urizaf-work <uri.zafrir@kaltura.com> Signed-off-by: urizaf <urizaf@gmail.com> * add missing providers so they all appear in ui Signed-off-by: urizaf-work <uri.zafrir@kaltura.com> Signed-off-by: urizaf <urizaf@gmail.com> * fix(ui): correctly display args for tool calls in chat (#688) Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> Signed-off-by: urizaf <urizaf@gmail.com> * update READMEs based on new architecture (#684) * update READMEs based on new architecture Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> * PR comments Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> * Update ui/README.md --------- Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> Co-authored-by: Peter Jausovec <peterj@users.noreply.github.com> Signed-off-by: urizaf <urizaf@gmail.com> * [FIX ] - fixes adk performance tuning (#689) * - fixes adk performance tuning - dependency versions update Signed-off-by: Dmytro Rashko <dmitriy.rashko@amdocs.com> * fix VERSION in case forked repository without tags Signed-off-by: Dmytro Rashko <dmitriy.rashko@amdocs.com> * revert uv version Signed-off-by: Dmytro Rashko <dmitriy.rashko@amdocs.com> * updated golden e2e Signed-off-by: Dmytro Rashko <dmitriy.rashko@amdocs.com> * fix helm unit tests Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> --------- Signed-off-by: Dmytro Rashko <dmitriy.rashko@amdocs.com> Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> Co-authored-by: Eitan Yarmush <eitan.yarmush@solo.io> Signed-off-by: urizaf <urizaf@gmail.com> * feat: make streaming buffer size configurable (#696) * feat: make streaming buffer size configurable Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> * switch to resource quantities for buffer size Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> --------- Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> Signed-off-by: urizaf <urizaf@gmail.com> * eitanya/fix-python-release (#698) Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> Signed-off-by: urizaf <urizaf@gmail.com> * EP-685-kmcp (#686) * EP-685-kmcp Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> * Update design/EP-685-kmcp.md Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> --------- Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> Co-authored-by: Lin Sun <lin.sun@solo.io> Signed-off-by: urizaf <urizaf@gmail.com> * use the types defined in pkg/client/model.go Signed-off-by: urizaf <urizaf@gmail.com> * fix(ui): correct link to switch agent from within chat (#667) Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> Co-authored-by: Peter Jausovec <peterj@users.noreply.github.com> Signed-off-by: urizaf <urizaf@gmail.com> * fix(ui): display description for agent tools when editing an agent (#692) Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> Co-authored-by: Peter Jausovec <peterj@users.noreply.github.com> Signed-off-by: urizaf <urizaf@gmail.com> * fix(controller): watch secondary resources instead of updating unowned resources (#703) * fix(controller): watch secrets from agents controller Ref: https://book.kubebuilder.io/reference/watching-resources/secondary-resources-not-owned Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> * fix(controller): watch memory from agents controller Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> * fix(controller): watch toolservers from agents controller Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> * fix(controller): watch modelconfig from agent controller Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> * fix(controller): watch secrets from model config controller Agent watches ModelConfig -> ModelConfig watches Secret Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> * refactor(controller): consistent error logging Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> * refactor(controller): remove `reconcileAgents` This isn't needed any more - we only ever reconcile a single agent at a time now. Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> * fix(controller): ensure api key secret exists for model config Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> * refactor(controller): explicitly set error to nil for memory status Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> --------- Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: urizaf <urizaf@gmail.com> * change to correct Gemini icon Signed-off-by: urizaf <urizaf@gmail.com> --------- Signed-off-by: urizaf-work <uri.zafrir@kaltura.com> Signed-off-by: urizaf <urizaf@gmail.com> Signed-off-by: Brian Fox <878612+onematchfox@users.noreply.github.com> Signed-off-by: Eitan Yarmush <eitan.yarmush@solo.io> Signed-off-by: Dmytro Rashko <dmitriy.rashko@amdocs.com> Co-authored-by: urizaf-work <uri.zafrir@kaltura.com> Co-authored-by: Brian Fox <878612+onematchfox@users.noreply.github.com> Co-authored-by: Eitan Yarmush <eitan.yarmush@solo.io> Co-authored-by: Peter Jausovec <peterj@users.noreply.github.com> Co-authored-by: Dmytro Rashko <dmitriy.rashko@amdocs.com> Co-authored-by: Lin Sun <lin.sun@solo.io> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
diff --git a/go/internal/httpserver/handlers/models.go b/go/internal/httpserver/handlers/models.go
@@ -3,10 +3,12 @@ package handlers
 import (
 	"net/http"
 
-	"github.com/kagent-dev/kagent/go/internal/httpserver/errors"
+	"github.com/kagent-dev/kagent/go/pkg/client/api"
+	kclient "github.com/kagent-dev/kagent/go/pkg/client"
 	ctrllog "sigs.k8s.io/controller-runtime/pkg/log"
 )
 
+
 // ModelHandler handles model requests
 type ModelHandler struct {
 	*Base
@@ -22,8 +24,49 @@ func (h *ModelHandler) HandleListSupportedModels(w ErrorResponseWriter, r *http.
 
 	log.Info("Listing supported models")
 
-	// TODO: Implement this
+	// Create a map of provider names to their supported models
+	// The keys need to match what the UI expects (camelCase for API keys)
+	supportedModels := kclient.ProviderModels{
+		"openAI": {
+			{Name: "gpt-4o", FunctionCalling: true},
+			{Name: "gpt-4-turbo", FunctionCalling: true},
+			{Name: "gpt-4", FunctionCalling: true},
+			{Name: "gpt-3.5-turbo", FunctionCalling: true},
+		},
+		"anthropic": {
+			{Name: "claude-3-opus-20240229", FunctionCalling: true},
+			{Name: "claude-3-sonnet-20240229", FunctionCalling: true},
+			{Name: "claude-3-haiku-20240307", FunctionCalling: true},
+			{Name: "claude-2.1", FunctionCalling: false},
+			{Name: "claude-2.0", FunctionCalling: false},
+		},
+		"azureOpenAI": {
+			{Name: "gpt-4", FunctionCalling: true},
+			{Name: "gpt-35-turbo", FunctionCalling: true},
+		},
+		"ollama": {
+			{Name: "llama2", FunctionCalling: false},
+			{Name: "llama2:13b", FunctionCalling: false},
+			{Name: "llama2:70b", FunctionCalling: false},
+			{Name: "mistral", FunctionCalling: false},
+			{Name: "mixtral", FunctionCalling: false},
+		},
+		"gemini": {
+			{Name: "gemini-pro", FunctionCalling: true},
+			{Name: "gemini-pro-vision", FunctionCalling: false},
+		},
+		"geminiVertexAI": {
+			{Name: "gemini-pro", FunctionCalling: true},
+			{Name: "gemini-pro-vision", FunctionCalling: false},
+		},
+		"anthropicVertexAI": {
+			{Name: "claude-3-opus-20240229", FunctionCalling: true},
+			{Name: "claude-3-sonnet-20240229", FunctionCalling: true},
+			{Name: "claude-3-haiku-20240307", FunctionCalling: true},
+		},
+	}
 
-	w.RespondWithError(errors.NewNotImplementedError("Not implemented", nil))
-	return
+	log.Info("Successfully listed supported models", "count", len(supportedModels))
+	data := api.NewResponse(supportedModels, "Successfully listed supported models", false)
+	RespondWithJSON(w, http.StatusOK, data)
 }
diff --git a/go/internal/httpserver/handlers/providers.go b/go/internal/httpserver/handlers/providers.go
@@ -97,6 +97,9 @@ func (h *ProviderHandler) HandleListSupportedModelProviders(w ErrorResponseWrite
 		{v1alpha1.ModelProviderAnthropic, reflect.TypeOf(v1alpha1.AnthropicConfig{})},
 		{v1alpha1.ModelProviderAzureOpenAI, reflect.TypeOf(v1alpha1.AzureOpenAIConfig{})},
 		{v1alpha1.ModelProviderOllama, reflect.TypeOf(v1alpha1.OllamaConfig{})},
+		{v1alpha1.ModelProviderGemini, reflect.TypeOf(v1alpha1.GeminiConfig{})},
+		{v1alpha1.ModelProviderGeminiVertexAI, reflect.TypeOf(v1alpha1.GeminiVertexAIConfig{})},
+		{v1alpha1.ModelProviderAnthropicVertexAI, reflect.TypeOf(v1alpha1.AnthropicVertexAIConfig{})},
 	}
 
 	providersResponse := []map[string]interface{}{}
diff --git a/go/pkg/client/model.go b/go/pkg/client/model.go
@@ -1,8 +1,23 @@
 package client
 
+import (
+	"context"
+
+	"github.com/kagent-dev/kagent/go/pkg/client/api"
+)
+
+// ModelInfo represents information about a model
+type ModelInfo struct {
+	Name           string `json:"name"`
+	FunctionCalling bool   `json:"function_calling"`
+}
+
+// ProviderModels represents a map of provider names to their supported models
+type ProviderModels map[string][]ModelInfo
+
 // Model defines the model operations
 type Model interface {
-	// ListSupportedModels(ctx context.Context) (*api.StandardResponse[*client.ProviderModels], error)
+	ListSupportedModels(ctx context.Context) (*api.StandardResponse[ProviderModels], error)
 }
 
 // modelClient handles model-related requests
@@ -16,16 +31,16 @@ func NewModelClient(client *BaseClient) Model {
 }
 
 // ListSupportedModels lists all supported models
-// func (c *modelClient) ListSupportedModels(ctx context.Context) (*api.StandardResponse[*client.ProviderModels], error) {
-// 	resp, err := c.client.Get(ctx, "/api/models", "")
-// 	if err != nil {
-// 		return nil, err
-// 	}
-
-// 	var models api.StandardResponse[*client.ProviderModels]
-// 	if err := DecodeResponse(resp, &models); err != nil {
-// 		return nil, err
-// 	}
-
-// 	return &models, nil
-// }
+func (c *modelClient) ListSupportedModels(ctx context.Context) (*api.StandardResponse[ProviderModels], error) {
+	resp, err := c.client.Get(ctx, "/api/models", "")
+	if err != nil {
+		return nil, err
+	}
+
+	var models api.StandardResponse[ProviderModels]
+	if err := DecodeResponse(resp, &models); err != nil {
+		return nil, err
+	}
+
+	return &models, nil
+}
diff --git a/ui/src/components/ModelProviderCombobox.tsx b/ui/src/components/ModelProviderCombobox.tsx
@@ -10,6 +10,7 @@ import { OpenAI } from './icons/OpenAI';
 import { Anthropic } from './icons/Anthropic';
 import { Ollama } from './icons/Ollama';
 import { Azure } from './icons/Azure';
+import { Gemini } from './icons/Gemini';
 
 interface ComboboxOption {
     label: string; // e.g., "OpenAI - gpt-4o"
@@ -59,6 +60,9 @@ export function ModelProviderCombobox({
         'anthropic': Anthropic,
         'ollama': Ollama,
         'azure-openai': Azure,
+        'gemini': Gemini,
+        'gemini-vertex-ai': Gemini,
+        'anthropic-vertex-ai': Anthropic,
     };
 
     const getProviderIcon = (providerKey: ModelProviderKey | undefined): React.ReactNode | null => {
diff --git a/ui/src/components/icons/Gemini.tsx b/ui/src/components/icons/Gemini.tsx
@@ -0,0 +1,5 @@
+export function Gemini({ className }: { className?: string }) {
+  return (
+    <svg fill="none" xmlns="http://www.w3.org/2000/svg" viewBox="0 0 16 16" className={className}><path d="M16 8.016A8.522 8.522 0 008.016 16h-.032A8.521 8.521 0 000 8.016v-.032A8.521 8.521 0 007.984 0h.032A8.522 8.522 0 0016 7.984v.032z" fill="url(#prefix__paint0_radial_980_20147)"/><defs><radialGradient id="prefix__paint0_radial_980_20147" cx="0" cy="0" r="1" gradientUnits="userSpaceOnUse" gradientTransform="matrix(16.1326 5.4553 -43.70045 129.2322 1.588 6.503)"><stop offset=".067" stop-color="#9168C0"/><stop offset=".343" stop-color="#5684D1"/><stop offset=".672" stop-color="#1BA1E3"/></radialGradient></defs></svg>
+  );
+}
diff --git a/ui/src/components/onboarding/steps/ModelConfigStep.tsx b/ui/src/components/onboarding/steps/ModelConfigStep.tsx
@@ -26,7 +26,7 @@ import { k8sRefUtils } from '@/lib/k8sUtils';
 import { K8S_AGENT_DEFAULTS } from '../OnboardingWizard';
 import { NamespaceCombobox } from "@/components/NamespaceCombobox";
 
-const modelProviders = ["openai", "azure-openai", "anthropic", "ollama"] as const;
+const modelProviders = ["openai", "azure-openai", "anthropic", "ollama", "gemini", "gemini-vertex-ai", "anthropic-vertex-ai"] as const;
 const modelConfigSchema = z.object({
     providerName: z.enum(modelProviders, { required_error: "Please select a provider." }),
     configName: z.string().min(1, "Configuration name is required."),
@@ -190,6 +190,9 @@ export function ModelConfigStep({
                 payload.azureOpenAI = { azureEndpoint: values.azureEndpoint || "", apiVersion: values.azureApiVersion || "" }; break;
             case 'openai': payload.openAI = {}; break;
             case 'anthropic': payload.anthropic = {}; break;
+            case 'gemini': payload.gemini = {}; break;
+            case 'gemini-vertex-ai': payload.geminiVertexAI = {}; break;
+            case 'anthropic-vertex-ai': payload.anthropicVertexAI = {}; break;
             case 'ollama':
                 const modelTag = values.modelTag?.trim() || "";
                 if (modelTag && modelTag !== OLLAMA_DEFAULT_TAG) {
diff --git a/ui/src/lib/providers.ts b/ui/src/lib/providers.ts
@@ -1,6 +1,6 @@
 
-export type BackendModelProviderType = "OpenAI" | "AzureOpenAI" | "Anthropic" | "Ollama";
-export const modelProviders = ["openai", "azure-openai", "anthropic", "ollama"] as const;
+export type BackendModelProviderType = "OpenAI" | "AzureOpenAI" | "Anthropic" | "Ollama" | "Gemini" | "GeminiVertexAI" | "AnthropicVertexAI";
+export const modelProviders = ["openai", "azure-openai", "anthropic", "ollama", "gemini", "gemini-vertex-ai", "anthropic-vertex-ai"] as const;
 export type ModelProviderKey = typeof modelProviders[number];
 
 
@@ -41,6 +41,27 @@ export const PROVIDERS_INFO: {
         modelDocsLink: "https://github.com/kagent-dev/autogen/blob/main/python/packages/autogen-ext/src/autogen_ext/models/ollama/_model_info.py",
         help: "No API key needed. Ensure Ollama is running and accessible."
     },
+    gemini: {
+        name: "Gemini",
+        type: "Gemini",
+        apiKeyLink: "https://ai.google.dev/",
+        modelDocsLink: "https://ai.google.dev/docs",
+        help: "Get your API key from the Google AI Studio."
+    },
+    "gemini-vertex-ai": {
+        name: "Gemini Vertex AI",
+        type: "GeminiVertexAI",
+        apiKeyLink: "https://cloud.google.com/vertex-ai",
+        modelDocsLink: "https://cloud.google.com/vertex-ai/docs",
+        help: "Configure your Google Cloud project and credentials for Vertex AI."
+    },
+    "anthropic-vertex-ai": {
+        name: "Anthropic Vertex AI",
+        type: "AnthropicVertexAI",
+        apiKeyLink: "https://cloud.google.com/vertex-ai",
+        modelDocsLink: "https://cloud.google.com/vertex-ai/docs",
+        help: "Configure your Google Cloud project and credentials for Vertex AI."
+    },
 };
 
 export const isValidProviderInfoKey = (key: string): key is ModelProviderKey => {
@@ -54,6 +75,9 @@ export const getApiKeyForProviderFormKey = (providerFormKey: ModelProviderKey):
         case 'azure-openai': return 'azureOpenAI';
         case 'anthropic': return 'anthropic';
         case 'ollama': return 'ollama';
+        case 'gemini': return 'gemini';
+        case 'gemini-vertex-ai': return 'geminiVertexAI';
+        case 'anthropic-vertex-ai': return 'anthropicVertexAI';
         default: return providerFormKey;
     }
 };
diff --git a/ui/src/types/index.ts b/ui/src/types/index.ts
@@ -81,6 +81,32 @@ export interface OllamaConfigPayload {
   options?: Record<string, string>;
 }
 
+export interface GeminiConfigPayload {
+  baseUrl?: string;
+  temperature?: string;
+  maxTokens?: number;
+  topP?: string;
+  topK?: number;
+}
+
+export interface GeminiVertexAIConfigPayload {
+  project?: string;
+  location?: string;
+  temperature?: string;
+  maxTokens?: number;
+  topP?: string;
+  topK?: number;
+}
+
+export interface AnthropicVertexAIConfigPayload {
+  project?: string;
+  location?: string;
+  temperature?: string;
+  maxTokens?: number;
+  topP?: string;
+  topK?: number;
+}
+
 export interface CreateModelConfigPayload {
   ref: string;
   provider: Pick<Provider, "name" | "type">;
@@ -90,6 +116,9 @@ export interface CreateModelConfigPayload {
   anthropic?: AnthropicConfigPayload;
   azureOpenAI?: AzureOpenAIConfigPayload;
   ollama?: OllamaConfigPayload;
+  gemini?: GeminiConfigPayload;
+  geminiVertexAI?: GeminiVertexAIConfigPayload;
+  anthropicVertexAI?: AnthropicVertexAIConfigPayload;
 }
 
 export interface UpdateModelConfigPayload {
@@ -100,6 +129,9 @@ export interface UpdateModelConfigPayload {
   anthropic?: AnthropicConfigPayload;
   azureOpenAI?: AzureOpenAIConfigPayload;
   ollama?: OllamaConfigPayload;
+  gemini?: GeminiConfigPayload;
+  geminiVertexAI?: GeminiVertexAIConfigPayload;
+  anthropicVertexAI?: AnthropicVertexAIConfigPayload;
 }
 
 export interface MemoryResponse {

Original file line number	Diff line number	Diff line change
`@@ -97,6 +97,9 @@ func (h *ProviderHandler) HandleListSupportedModelProviders(w ErrorResponseWrite`
`97`	`97`	`{v1alpha1.ModelProviderAnthropic, reflect.TypeOf(v1alpha1.AnthropicConfig{})},`
`98`	`98`	`{v1alpha1.ModelProviderAzureOpenAI, reflect.TypeOf(v1alpha1.AzureOpenAIConfig{})},`
`99`	`99`	`{v1alpha1.ModelProviderOllama, reflect.TypeOf(v1alpha1.OllamaConfig{})},`
	`100`	`+ {v1alpha1.ModelProviderGemini, reflect.TypeOf(v1alpha1.GeminiConfig{})},`
	`101`	`+ {v1alpha1.ModelProviderGeminiVertexAI, reflect.TypeOf(v1alpha1.GeminiVertexAIConfig{})},`
	`102`	`+ {v1alpha1.ModelProviderAnthropicVertexAI, reflect.TypeOf(v1alpha1.AnthropicVertexAIConfig{})},`
`100`	`103`	`}`
`101`	`104`
`102`	`105`	`providersResponse := []map[string]interface{}{}`