Skip to content

Commit 233b336

Browse files
authored
chore(model gallery): add mistralai_devstral-small-2505 (#5428)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent c587ac0 commit 233b336

File tree

1 file changed

+31
-0
lines changed

1 file changed

+31
-0
lines changed

gallery/index.yaml

Lines changed: 31 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11806,6 +11806,37 @@
1180611806
- filename: TheDrummer_Rivermind-Lux-12B-v1-Q4_K_M.gguf
1180711807
sha256: ccaf2e49661ba692a27f06871fb792ff8b8c9632afe92ad89600e389f4ee8fc2
1180811808
uri: huggingface://bartowski/TheDrummer_Rivermind-Lux-12B-v1-GGUF/TheDrummer_Rivermind-Lux-12B-v1-Q4_K_M.gguf
11809+
- !!merge <<: *mistral03
11810+
name: "mistralai_devstral-small-2505"
11811+
urls:
11812+
- https://huggingface.co/mistralai/Devstral-Small-2505
11813+
- https://huggingface.co/bartowski/mistralai_Devstral-Small-2505-GGUF
11814+
description: |
11815+
Devstral is an agentic LLM for software engineering tasks built under a collaboration between Mistral AI and All Hands AI 🙌. Devstral excels at using tools to explore codebases, editing multiple files and power software engineering agents. The model achieves remarkable performance on SWE-bench which positionates it as the #1 open source model on this benchmark.
11816+
11817+
It is finetuned from Mistral-Small-3.1, therefore it has a long context window of up to 128k tokens. As a coding agent, Devstral is text-only and before fine-tuning from Mistral-Small-3.1 the vision encoder was removed.
11818+
11819+
For enterprises requiring specialized capabilities (increased context, domain-specific knowledge, etc.), we will release commercial models beyond what Mistral AI contributes to the community.
11820+
11821+
Learn more about Devstral in our blog post.
11822+
Key Features:
11823+
11824+
Agentic coding: Devstral is designed to excel at agentic coding tasks, making it a great choice for software engineering agents.
11825+
lightweight: with its compact size of just 24 billion parameters, Devstral is light enough to run on a single RTX 4090 or a Mac with 32GB RAM, making it an appropriate model for local deployment and on-device use.
11826+
Apache 2.0 License: Open license allowing usage and modification for both commercial and non-commercial purposes.
11827+
Context Window: A 128k context window.
11828+
Tokenizer: Utilizes a Tekken tokenizer with a 131k vocabulary size.
11829+
overrides:
11830+
mmproj: mmproj-mistralai_Devstral-Small-2505-f16.gguf
11831+
parameters:
11832+
model: mistralai_Devstral-Small-2505-Q4_K_M.gguf
11833+
files:
11834+
- filename: mistralai_Devstral-Small-2505-Q4_K_M.gguf
11835+
sha256: 6bcda763d93e24e1aa37972869d58dccb3cf79d6a42466fc39094ebbe3a72185
11836+
uri: huggingface://bartowski/mistralai_Devstral-Small-2505-GGUF/mistralai_Devstral-Small-2505-Q4_K_M.gguf
11837+
- filename: mmproj-mistralai_Devstral-Small-2505-f16.gguf
11838+
sha256: f5add93ad360ef6ccba571bba15e8b4bd4471f3577440a8b18785f8707d987ed
11839+
uri: huggingface://bartowski/mistralai_Devstral-Small-2505-GGUF/mmproj-mistralai_Devstral-Small-2505-f16.gguf
1180911840
- &mudler
1181011841
url: "github:mudler/LocalAI/gallery/mudler.yaml@master" ### START mudler's LocalAI specific-models
1181111842
name: "LocalAI-llama3-8b-function-call-v0.2"

0 commit comments

Comments
 (0)