|
11806 | 11806 | - filename: TheDrummer_Rivermind-Lux-12B-v1-Q4_K_M.gguf
|
11807 | 11807 | sha256: ccaf2e49661ba692a27f06871fb792ff8b8c9632afe92ad89600e389f4ee8fc2
|
11808 | 11808 | uri: huggingface://bartowski/TheDrummer_Rivermind-Lux-12B-v1-GGUF/TheDrummer_Rivermind-Lux-12B-v1-Q4_K_M.gguf
|
| 11809 | +- !!merge <<: *mistral03 |
| 11810 | + name: "mistralai_devstral-small-2505" |
| 11811 | + urls: |
| 11812 | + - https://huggingface.co/mistralai/Devstral-Small-2505 |
| 11813 | + - https://huggingface.co/bartowski/mistralai_Devstral-Small-2505-GGUF |
| 11814 | + description: | |
| 11815 | + Devstral is an agentic LLM for software engineering tasks built under a collaboration between Mistral AI and All Hands AI 🙌. Devstral excels at using tools to explore codebases, editing multiple files and power software engineering agents. The model achieves remarkable performance on SWE-bench which positionates it as the #1 open source model on this benchmark. |
| 11816 | + |
| 11817 | + It is finetuned from Mistral-Small-3.1, therefore it has a long context window of up to 128k tokens. As a coding agent, Devstral is text-only and before fine-tuning from Mistral-Small-3.1 the vision encoder was removed. |
| 11818 | + |
| 11819 | + For enterprises requiring specialized capabilities (increased context, domain-specific knowledge, etc.), we will release commercial models beyond what Mistral AI contributes to the community. |
| 11820 | + |
| 11821 | + Learn more about Devstral in our blog post. |
| 11822 | + Key Features: |
| 11823 | + |
| 11824 | + Agentic coding: Devstral is designed to excel at agentic coding tasks, making it a great choice for software engineering agents. |
| 11825 | + lightweight: with its compact size of just 24 billion parameters, Devstral is light enough to run on a single RTX 4090 or a Mac with 32GB RAM, making it an appropriate model for local deployment and on-device use. |
| 11826 | + Apache 2.0 License: Open license allowing usage and modification for both commercial and non-commercial purposes. |
| 11827 | + Context Window: A 128k context window. |
| 11828 | + Tokenizer: Utilizes a Tekken tokenizer with a 131k vocabulary size. |
| 11829 | + overrides: |
| 11830 | + mmproj: mmproj-mistralai_Devstral-Small-2505-f16.gguf |
| 11831 | + parameters: |
| 11832 | + model: mistralai_Devstral-Small-2505-Q4_K_M.gguf |
| 11833 | + files: |
| 11834 | + - filename: mistralai_Devstral-Small-2505-Q4_K_M.gguf |
| 11835 | + sha256: 6bcda763d93e24e1aa37972869d58dccb3cf79d6a42466fc39094ebbe3a72185 |
| 11836 | + uri: huggingface://bartowski/mistralai_Devstral-Small-2505-GGUF/mistralai_Devstral-Small-2505-Q4_K_M.gguf |
| 11837 | + - filename: mmproj-mistralai_Devstral-Small-2505-f16.gguf |
| 11838 | + sha256: f5add93ad360ef6ccba571bba15e8b4bd4471f3577440a8b18785f8707d987ed |
| 11839 | + uri: huggingface://bartowski/mistralai_Devstral-Small-2505-GGUF/mmproj-mistralai_Devstral-Small-2505-f16.gguf |
11809 | 11840 | - &mudler
|
11810 | 11841 | url: "github:mudler/LocalAI/gallery/mudler.yaml@master" ### START mudler's LocalAI specific-models
|
11811 | 11842 | name: "LocalAI-llama3-8b-function-call-v0.2"
|
|
0 commit comments