fix: add gguf wrapper in quantized gemma3 #3220

junjunjd · 2025-12-02T02:28:59Z

Add Gguf wrapper in quantized Gemma3 loader to fix missing metadata lookups.

ivarflakstad · 2025-12-23T11:49:57Z

@junjunjd Thanks for this!
I tried running the embedding-gemma model mentioned in the issue and unfortunately this doesn't fix the issue. Luckily the problem isn't has complex as it may seem.
Default gemma3 uses the prefix gemma3.*, while the embedding-gemma model uses gemma-embedding.* (ref).
ModelWeights in quantized_gemma3.rs is hard coded to use gemma3.
I'll post the findings in the issue as well.

fix: add gguf wrapper in quantized gemma3

900e0d8

junjunjd force-pushed the fix/gemma3-gguf-loader branch from abd0e3d to 900e0d8 Compare December 6, 2025 04:55

ivarflakstad mentioned this pull request Dec 22, 2025

cannot find gemma3.attention.head_count in metadata #3215

Open

Merge branch 'main' into fix/gemma3-gguf-loader

3fabab8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: add gguf wrapper in quantized gemma3 #3220

fix: add gguf wrapper in quantized gemma3 #3220

Uh oh!

junjunjd commented Dec 2, 2025

Uh oh!

ivarflakstad commented Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: add gguf wrapper in quantized gemma3 #3220

Are you sure you want to change the base?

fix: add gguf wrapper in quantized gemma3 #3220

Uh oh!

Conversation

junjunjd commented Dec 2, 2025

Uh oh!

ivarflakstad commented Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants