πŸ“‹ Model Description


basemodel: google/t5-v11-xxl library_name: gguf license: apache-2.0 quantized_by: city96 language: en

This is a GGUF conversion of Google's T5 v1.1 XXL encoder model.

The weights can be used with ./llama-embedding or with the ComfyUI-GGUF custom node together with image generation models.

This is a non imatrix quant as llama.cpp doesn't support imatrix creation for T5 models at the time of writing. It's therefore recommended to use Q5KM or larger for the best results, although smaller models may also still provide decent results in resource constrained scenarios.

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
t5-v1_1-xxl-encoder-Q3_K_L.gguf
LFS Q3
2.29 GB Download
t5-v1_1-xxl-encoder-Q3_K_M.gguf
LFS Q3
2.14 GB Download
t5-v1_1-xxl-encoder-Q3_K_S.gguf
LFS Q3
1.96 GB Download
t5-v1_1-xxl-encoder-Q4_K_M.gguf
Recommended LFS Q4
2.7 GB Download
t5-v1_1-xxl-encoder-Q4_K_S.gguf
LFS Q4
2.55 GB Download
t5-v1_1-xxl-encoder-Q5_K_M.gguf
LFS Q5
3.15 GB Download
t5-v1_1-xxl-encoder-Q5_K_S.gguf
LFS Q5
3.07 GB Download
t5-v1_1-xxl-encoder-Q6_K.gguf
LFS Q6
3.64 GB Download
t5-v1_1-xxl-encoder-Q8_0.gguf
LFS Q8
4.71 GB Download
t5-v1_1-xxl-encoder-f16.gguf
LFS FP16
8.87 GB Download
t5-v1_1-xxl-encoder-f32.gguf
LFS
17.74 GB Download