Skip to content

Commit 8a8311c

Browse files
authored
Merge pull request #911 from kvcache-ai/patch_v0.2.3post2
🔧 update multi-gpu-fp8-linear and multi-gpu marlin yaml
2 parents 0e93a09 + 19f058e commit 8a8311c

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-multi-gpu-fp8-linear-ggml-experts.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,15 +10,15 @@
1010
name: "^model\\.layers\\.(0|[1-9]|[12][0-9])\\."
1111
class: ktransformers.models.modeling_deepseek_v3.DeepseekV3RotaryEmbedding
1212
replace:
13-
class: ktransformers.operators.RoPE.KMoEGateDeepSeekV3
13+
class: ktransformers.operators.RoPE.YarnRotaryEmbeddingV3
1414
kwargs:
1515
generate_device: "cuda:0"
1616
prefill_device: "cuda:0"
1717
- match:
1818
name: "^model\\.layers\\.([3456][0-9])\\."
1919
class: ktransformers.models.modeling_deepseek_v3.DeepseekV3RotaryEmbedding
2020
replace:
21-
class: ktransformers.operators.RoPE.KMoEGateDeepSeekV3
21+
class: ktransformers.operators.RoPE.YarnRotaryEmbeddingV3
2222
kwargs:
2323
generate_device: "cuda:1"
2424
prefill_device: "cuda:1"

ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-multi-gpu-marlin.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010
name: "^model\\.layers\\.(0|[1-9]|[12][0-9])\\."
1111
class: ktransformers.models.modeling_deepseek_v3.DeepseekV3RotaryEmbedding
1212
replace:
13-
class: ktransformers.operators.RoPE.KMoEGateDeepSeekV3
13+
class: ktransformers.operators.RoPE.YarnRotaryEmbeddingV3
1414
kwargs:
1515
generate_device: "cuda:0"
1616
prefill_device: "cuda:0"

0 commit comments

Comments
 (0)