Skip to content

Commit 901dba6

Browse files
authored
chore(model gallery): add gemma-3-27b-it-qat (#5124)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent b88a7a4 commit 901dba6

File tree

1 file changed

+18
-0
lines changed

1 file changed

+18
-0
lines changed

Diff for: gallery/index.yaml

+18
Original file line numberDiff line numberDiff line change
@@ -114,6 +114,24 @@
114114
- filename: gemma-3-4b-it-q4_0.gguf
115115
sha256: 2ca493d426ffcb43db27132f183a0230eda4a3621e58b328d55b665f1937a317
116116
uri: huggingface://vinimuchulski/gemma-3-4b-it-qat-q4_0-gguf/gemma-3-4b-it-q4_0.gguf
117+
- !!merge <<: *gemma3
118+
name: "gemma-3-27b-it-qat"
119+
urls:
120+
- https://huggingface.co/google/gemma-3-27b-it
121+
- https://huggingface.co/vinimuchulski/gemma-3-27b-it-qat-q4_0-gguf
122+
description: |
123+
This model corresponds to the 27B instruction-tuned version of the Gemma 3 model in GGUF format using Quantization Aware Training (QAT). The GGUF corresponds to Q4_0 quantization.
124+
125+
Thanks to QAT, the model is able to preserve similar quality as bfloat16 while significantly reducing the memory requirements to load the model.
126+
127+
You can find the half-precision version here.
128+
overrides:
129+
parameters:
130+
model: gemma-3-27b-it-q4_0.gguf
131+
files:
132+
- filename: gemma-3-27b-it-q4_0.gguf
133+
sha256: 45e586879bc5f5d7a5b6527e812952057ce916d9fc7ba16f7262ec9972c9e2a2
134+
uri: huggingface://vinimuchulski/gemma-3-27b-it-qat-q4_0-gguf/gemma-3-27b-it-q4_0.gguf
117135
- !!merge <<: *gemma3
118136
name: "qgallouedec_gemma-3-27b-it-codeforces-sft"
119137
urls:

0 commit comments

Comments
 (0)