πŸ“‹ Model Description


license: apache-2.0

Grok-1 GGUF Quantizations

This repository contains unofficial GGUF Quantizations of Grok-1, compatible with llama.cpp as of PR- Add grok-1 support #6204.

Updates

#### Native Split Support in llama.cpp

With this, there is no need to merge the split files before use. Just download all splits and run llama.cpp with the first split like you would previously. It'll detect the other splits and load them as well.

#### Direct Split Download from huggingface using llama.cpp

That means this downloads and runs the model:

server \
    --hf-repo Arki05/Grok-1-GGUF \
    --hf-file grok-1-IQ3_XS-split-00001-of-00009.gguf \
    --model models/grok-1-IQ3_XS-split-00001-of-00009.gguf \
    -ngl 999

And that is very cool (@phymbert)

Available Quantizations

The following Quantizations are currently available for download:

QuantSplit FilesSize
Q2K1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9112.4 GB
IQ3XS1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9125.4 GB
Q4K1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9186.0 GB
Q6K1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9259.8 GB
I would recommend the IQ3_XS version for now.

More Quantizations will be uploaded soon. All current Quants are created without any importance matrix.

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
grok-1-IQ3_XS-split-00001-of-00009.gguf
Recommended LFS Q3
16.33 GB Download
grok-1-IQ3_XS-split-00002-of-00009.gguf
LFS Q3
14.25 GB Download
grok-1-IQ3_XS-split-00003-of-00009.gguf
LFS Q3
14.15 GB Download
grok-1-IQ3_XS-split-00004-of-00009.gguf
LFS Q3
14.55 GB Download
grok-1-IQ3_XS-split-00005-of-00009.gguf
LFS Q3
14.37 GB Download
grok-1-IQ3_XS-split-00006-of-00009.gguf
LFS Q3
14.15 GB Download
grok-1-IQ3_XS-split-00007-of-00009.gguf
LFS Q3
14.15 GB Download
grok-1-IQ3_XS-split-00008-of-00009.gguf
LFS Q3
14.89 GB Download
grok-1-IQ3_XS-split-00009-of-00009.gguf
LFS Q3
3.9 GB Download
grok-1-Q2_K-split-00001-of-00009.gguf
LFS Q2
13.87 GB Download
grok-1-Q2_K-split-00002-of-00009.gguf
LFS Q2
12.87 GB Download
grok-1-Q2_K-split-00003-of-00009.gguf
LFS Q2
12.87 GB Download
grok-1-Q2_K-split-00004-of-00009.gguf
LFS Q2
13.25 GB Download
grok-1-Q2_K-split-00005-of-00009.gguf
LFS Q2
13.07 GB Download
grok-1-Q2_K-split-00006-of-00009.gguf
LFS Q2
12.89 GB Download
grok-1-Q2_K-split-00007-of-00009.gguf
LFS Q2
12.87 GB Download
grok-1-Q2_K-split-00008-of-00009.gguf
LFS Q2
13.17 GB Download
grok-1-Q2_K-split-00009-of-00009.gguf
LFS Q2
3.35 GB Download
grok-1-Q4_K-split-00001-of-00009.gguf
LFS Q4
24.11 GB Download
grok-1-Q4_K-split-00002-of-00009.gguf
LFS Q4
20.73 GB Download
grok-1-Q4_K-split-00003-of-00009.gguf
LFS Q4
21.02 GB Download
grok-1-Q4_K-split-00004-of-00009.gguf
LFS Q4
21.21 GB Download
grok-1-Q4_K-split-00005-of-00009.gguf
LFS Q4
21.34 GB Download
grok-1-Q4_K-split-00006-of-00009.gguf
LFS Q4
20.83 GB Download
grok-1-Q4_K-split-00007-of-00009.gguf
LFS Q4
20.83 GB Download
grok-1-Q4_K-split-00008-of-00009.gguf
LFS Q4
23.04 GB Download
grok-1-Q4_K-split-00009-of-00009.gguf
LFS Q4
5.95 GB Download
grok-1-Q6_K-split-00001-of-00009.gguf
LFS Q6
30.4 GB Download
grok-1-Q6_K-split-00002-of-00009.gguf
LFS Q6
28.86 GB Download
grok-1-Q6_K-split-00003-of-00009.gguf
LFS Q6
28.86 GB Download
grok-1-Q6_K-split-00004-of-00009.gguf
LFS Q6
29.72 GB Download
grok-1-Q6_K-split-00005-of-00009.gguf
LFS Q6
29.32 GB Download
grok-1-Q6_K-split-00006-of-00009.gguf
LFS Q6
28.86 GB Download
grok-1-Q6_K-split-00007-of-00009.gguf
LFS Q6
28.86 GB Download
grok-1-Q6_K-split-00008-of-00009.gguf
LFS Q6
29.56 GB Download
grok-1-Q6_K-split-00009-of-00009.gguf
LFS Q6
7.52 GB Download