Updates LLM performance matrix (#1245)

benironside · web-flow · commit 21777be53d75 · 2025-04-24T21:05:03.000Z
9.0 component of #1206 Updates the LLM matrix to reflect latest changes. [Preview](https://docs-v3-preview.elastic.dev/elastic/docs-content/pull/1245/solutions/security/ai/large-language-model-performance-matrix)
diff --git a/solutions/security/ai/large-language-model-performance-matrix.md b/solutions/security/ai/large-language-model-performance-matrix.md
@@ -24,17 +24,18 @@ Models from third-party LLM providers.
 
 | **Feature** | - | **Assistant - General** | **Assistant - {{esql}} generation** | **Assistant - Alert questions** | **Assistant - Knowledge retrieval** | **Attack Discovery** | **Automatic Migration** |
 | --- | --- | --- | --- | --- | --- | --- | --- |
-| **Model** | **Claude 3: Opus** | Excellent | Excellent | Excellent | Good      | Great     | Good
-|  | **Claude 3.7: Sonnet**      | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent
-|  | **Claude 3.5: Sonnet v2**   | Excellent | Excellent | Excellent | Excellent | Great     | Excellent
-|  | **Claude 3.5: Sonnet**      | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent
-|  | **Claude 3.5: Haiku**       | Excellent | Excellent | Excellent | Excellent | Poor      | Poor
-|  | **Claude 3: Haiku**         | Excellent | Excellent | Excellent | Excellent | Poor      | Poor
-|  | **GPT-4o**                  | Excellent | Excellent | Excellent | Excellent | Great     | Great
-|  | **GPT-4o-mini**             | Excellent | Great     | Great     | Great     | Poor      | Good
-|  | **Gemini 1.5 Pro 002**      | Excellent | Excellent | Excellent | Excellent | Excellent | Great
-|  | **Gemini 1.5 Flash 002**    | Excellent | Poor      | Good      | Excellent | Poor      | Excellent
-|  | **Gemini 2.0 Flash 001**    | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent
+| **Model** | **Claude 3.7: Sonnet**      | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent
+|           | **Claude 3.5: Sonnet v2**   | Excellent | Excellent | Excellent | Excellent | Great     | Excellent
+|           | **Claude 3.5: Sonnet**      | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent
+|           | **Claude 3.5: Haiku**       | Excellent | Excellent | Excellent | Excellent | Poor      | Poor
+|           | **Claude 3: Haiku**         | Excellent | Excellent | Excellent | Excellent | Poor      | Poor
+|           | **GPT-4o**                  | Excellent | Excellent | Excellent | Excellent | Great     | Great
+|           | **GPT-4o-mini**             | Excellent | Great     | Great     | Great     | Poor      | Good
+|           | **GPT-4.1**                 | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent
+|           | **Gemini 1.5 Pro 002**      | Excellent | Excellent | Excellent | Excellent | Excellent | Great
+|           | **Gemini 1.5 Flash 002**    | Excellent | Poor      | Good      | Excellent | Poor      | Excellent
+|           | **Gemini 2.0 Flash 001**    | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent
+|           | **Gemini 2.5 Pro**          | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent
 
 
 ## Open-source models [_open_source_models]
@@ -43,7 +44,8 @@ Models you can [deploy yourself](/solutions/security/ai/connect-to-own-local-llm
 
 | **Feature** | - | **Assistant - General** | **Assistant - {{esql}} generation** | **Assistant - Alert questions** | **Assistant - Knowledge retrieval** | **Attack Discovery** | **Automatic Migration**
 | --- | --- | --- | --- | --- | --- | --- |
-| **Model** | **Mistral Nemo** | Good | Good | Great | Good | Poor | Poor |
-|  | **LLama 3.2** | Good | Poor | Good | Poor | Poor | Good  |
-|  | **LLama 3.1 405b** | Good | Great | Good | Good | Poor | Poor |
-|  | **LLama 3.1 70b** | Good | Good | Poor | Poor | Poor | Good |
+| **Model** | **Mistral Nemo**   | Good | Good  | Great | Good | Poor | Poor |
+|           | **Mistral-Small-3.1-24B-Instruct-2503** | Excellent | Poor | Excellent | Excellent | Good | N/A
+|           | **LLama 3.2**      | Good | Poor  | Good  | Poor | Poor | Good |
+|           | **LLama 3.1 405b** | Good | Great | Good  | Good | Poor | Poor |
+|           | **LLama 3.1 70b**  | Good | Good  | Poor  | Poor | Poor | Good |