π Model Description
license: apache-2.0
Grok-1 GGUF Quantizations
This repository contains unofficial GGUF Quantizations of Grok-1, compatible with llama.cpp
as of PR- Add grok-1 support #6204.
Updates
#### Native Split Support in llama.cpp
- The splits have been updated to utilize the improvements from PR: llamamodel_loader: support multiple split/shard GGUFs. As a result, manual merging with
gguf-split
is no longer required.
With this, there is no need to merge the split files before use. Just download all splits and run llama.cpp with the first split like you would previously. It'll detect the other splits and load them as well.
#### Direct Split Download from huggingface using llama.cpp
- Thanks to a new PR common: llamaloadmodelfrom_url split support #6192 from phymbert it's now possible load model splits from url.
That means this downloads and runs the model:
server \
--hf-repo Arki05/Grok-1-GGUF \
--hf-file grok-1-IQ3_XS-split-00001-of-00009.gguf \
--model models/grok-1-IQ3_XS-split-00001-of-00009.gguf \
-ngl 999
And that is very cool (@phymbert)
Available Quantizations
The following Quantizations are currently available for download:
Quant | Split Files | Size |
---|---|---|
Q2K | 1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9 | 112.4 GB |
IQ3XS | 1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9 | 125.4 GB |
Q4K | 1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9 | 186.0 GB |
Q6K | 1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9 | 259.8 GB |
IQ3_XS
version for now.
More Quantizations will be uploaded soon. All current Quants are created without any importance matrix.
π GGUF File List
π Filename | π¦ Size | β‘ Download |
---|---|---|
grok-1-IQ3_XS-split-00001-of-00009.gguf
Recommended
LFS
Q3
|
16.33 GB | Download |
grok-1-IQ3_XS-split-00002-of-00009.gguf
LFS
Q3
|
14.25 GB | Download |
grok-1-IQ3_XS-split-00003-of-00009.gguf
LFS
Q3
|
14.15 GB | Download |
grok-1-IQ3_XS-split-00004-of-00009.gguf
LFS
Q3
|
14.55 GB | Download |
grok-1-IQ3_XS-split-00005-of-00009.gguf
LFS
Q3
|
14.37 GB | Download |
grok-1-IQ3_XS-split-00006-of-00009.gguf
LFS
Q3
|
14.15 GB | Download |
grok-1-IQ3_XS-split-00007-of-00009.gguf
LFS
Q3
|
14.15 GB | Download |
grok-1-IQ3_XS-split-00008-of-00009.gguf
LFS
Q3
|
14.89 GB | Download |
grok-1-IQ3_XS-split-00009-of-00009.gguf
LFS
Q3
|
3.9 GB | Download |
grok-1-Q2_K-split-00001-of-00009.gguf
LFS
Q2
|
13.87 GB | Download |
grok-1-Q2_K-split-00002-of-00009.gguf
LFS
Q2
|
12.87 GB | Download |
grok-1-Q2_K-split-00003-of-00009.gguf
LFS
Q2
|
12.87 GB | Download |
grok-1-Q2_K-split-00004-of-00009.gguf
LFS
Q2
|
13.25 GB | Download |
grok-1-Q2_K-split-00005-of-00009.gguf
LFS
Q2
|
13.07 GB | Download |
grok-1-Q2_K-split-00006-of-00009.gguf
LFS
Q2
|
12.89 GB | Download |
grok-1-Q2_K-split-00007-of-00009.gguf
LFS
Q2
|
12.87 GB | Download |
grok-1-Q2_K-split-00008-of-00009.gguf
LFS
Q2
|
13.17 GB | Download |
grok-1-Q2_K-split-00009-of-00009.gguf
LFS
Q2
|
3.35 GB | Download |
grok-1-Q4_K-split-00001-of-00009.gguf
LFS
Q4
|
24.11 GB | Download |
grok-1-Q4_K-split-00002-of-00009.gguf
LFS
Q4
|
20.73 GB | Download |
grok-1-Q4_K-split-00003-of-00009.gguf
LFS
Q4
|
21.02 GB | Download |
grok-1-Q4_K-split-00004-of-00009.gguf
LFS
Q4
|
21.21 GB | Download |
grok-1-Q4_K-split-00005-of-00009.gguf
LFS
Q4
|
21.34 GB | Download |
grok-1-Q4_K-split-00006-of-00009.gguf
LFS
Q4
|
20.83 GB | Download |
grok-1-Q4_K-split-00007-of-00009.gguf
LFS
Q4
|
20.83 GB | Download |
grok-1-Q4_K-split-00008-of-00009.gguf
LFS
Q4
|
23.04 GB | Download |
grok-1-Q4_K-split-00009-of-00009.gguf
LFS
Q4
|
5.95 GB | Download |
grok-1-Q6_K-split-00001-of-00009.gguf
LFS
Q6
|
30.4 GB | Download |
grok-1-Q6_K-split-00002-of-00009.gguf
LFS
Q6
|
28.86 GB | Download |
grok-1-Q6_K-split-00003-of-00009.gguf
LFS
Q6
|
28.86 GB | Download |
grok-1-Q6_K-split-00004-of-00009.gguf
LFS
Q6
|
29.72 GB | Download |
grok-1-Q6_K-split-00005-of-00009.gguf
LFS
Q6
|
29.32 GB | Download |
grok-1-Q6_K-split-00006-of-00009.gguf
LFS
Q6
|
28.86 GB | Download |
grok-1-Q6_K-split-00007-of-00009.gguf
LFS
Q6
|
28.86 GB | Download |
grok-1-Q6_K-split-00008-of-00009.gguf
LFS
Q6
|
29.56 GB | Download |
grok-1-Q6_K-split-00009-of-00009.gguf
LFS
Q6
|
7.52 GB | Download |