π Model Description
basemodel: google/t5-v11-xxl library_name: gguf license: apache-2.0 quantized_by: city96 language: en
This is a GGUF conversion of Google's T5 v1.1 XXL encoder model.
The weights can be used with ./llama-embedding
or with the ComfyUI-GGUF custom node together with image generation models.
This is a non imatrix quant as llama.cpp doesn't support imatrix creation for T5 models at the time of writing. It's therefore recommended to use Q5KM or larger for the best results, although smaller models may also still provide decent results in resource constrained scenarios.
π GGUF File List
π Filename | π¦ Size | β‘ Download |
---|---|---|
t5-v1_1-xxl-encoder-Q3_K_L.gguf
LFS
Q3
|
2.29 GB | Download |
t5-v1_1-xxl-encoder-Q3_K_M.gguf
LFS
Q3
|
2.14 GB | Download |
t5-v1_1-xxl-encoder-Q3_K_S.gguf
LFS
Q3
|
1.96 GB | Download |
t5-v1_1-xxl-encoder-Q4_K_M.gguf
Recommended
LFS
Q4
|
2.7 GB | Download |
t5-v1_1-xxl-encoder-Q4_K_S.gguf
LFS
Q4
|
2.55 GB | Download |
t5-v1_1-xxl-encoder-Q5_K_M.gguf
LFS
Q5
|
3.15 GB | Download |
t5-v1_1-xxl-encoder-Q5_K_S.gguf
LFS
Q5
|
3.07 GB | Download |
t5-v1_1-xxl-encoder-Q6_K.gguf
LFS
Q6
|
3.64 GB | Download |
t5-v1_1-xxl-encoder-Q8_0.gguf
LFS
Q8
|
4.71 GB | Download |
t5-v1_1-xxl-encoder-f16.gguf
LFS
FP16
|
8.87 GB | Download |
t5-v1_1-xxl-encoder-f32.gguf
LFS
|
17.74 GB | Download |