Support DeepSeek-R1 Qwen #3431

cebtenzzre · 2025-01-28T21:35:44Z

This PR adds DeepSeek-R1 Qwen support by:

Rebasing llama.cpp on a slightly newer upstream commit (ggml-org/llama.cpp@a39ab216a from Oct 2 instead of ggml-org/llama.cpp@95bc82fbc from Sep 26)
Cherry-picking the one-line change required to load the DeepSeek GGUF
Adding a prompt template substitution that doesn't rely on things like namespace and str.split which Jinja2Cpp doesn't implement
Implementing a regex_replace function compatible with others seen in the wild to support the necessary text manipulation in the DeepSeek template.

Tested on DeepSeek-R1 Qwen 7B. Will update with results with 14B and 32B soon.

Note that the replacement prompt template removes the {{ bos_token }} from the Jinja template to avoid a double BOS, since at least the bartowski GGUF I tried specifies tokenizer.ggml.add_bos_token of true already.

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre · 2025-01-28T21:40:10Z

The 14B and 32B models also work as expected. All three models (7B, 14B, 32B) work on both CUDA and Kompute.
edit: Also tested 1.5B successfully.

WuShichao · 2025-01-29T10:09:53Z

How about Vulkan backend? I can ran CUDA with DeepSeek-R1-Distill-Llama-8B-Q4.

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre added 4 commits January 28, 2025 16:22

llama.cpp: update submodule for DeepSeek-R1 Qwen vocab support

aaa7c11

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

chatllm: implement regex_replace filter

3f67da2

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

jinja_replacements: add fixed template for DeepSeek R1 Qwen 7B

Loading
Loading status checks…

bae23e7

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

add changelog entry

Loading
Loading status checks…

11ae5e4

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre marked this pull request as ready for review January 28, 2025 21:36

cebtenzzre requested a review from manyoso January 28, 2025 21:40

manyoso approved these changes Jan 28, 2025

View reviewed changes

manyoso merged commit 343a4b6 into main Jan 29, 2025
4 of 13 checks passed

cgivre pushed a commit to cgivre/gpt4all that referenced this pull request Feb 4, 2025

Support DeepSeek-R1 Qwen (nomic-ai#3431)

32badd2

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre deleted the deepseek-r1 branch February 10, 2025 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support DeepSeek-R1 Qwen #3431

Support DeepSeek-R1 Qwen #3431

cebtenzzre commented Jan 28, 2025

cebtenzzre commented Jan 28, 2025 •

edited

Loading

WuShichao commented Jan 29, 2025

Support DeepSeek-R1 Qwen #3431

Support DeepSeek-R1 Qwen #3431

Conversation

cebtenzzre commented Jan 28, 2025

cebtenzzre commented Jan 28, 2025 • edited Loading

WuShichao commented Jan 29, 2025

cebtenzzre commented Jan 28, 2025 •

edited

Loading