city96/umt5-xxl-encoder-gguf

Name: city96/umt5-xxl-encoder-gguf
Author: city96

High-quality GGUF model

95.1K 📥 Downloads

132 ❤️ Likes

10 📁 GGUF Files

61.54 GB 💾 Total Size

10 months ago 🔄 Last Updated

📋 Model Description

base_model: google/umt5-xxl library_name: gguf license: apache-2.0 quantized_by: city96 language: en

This is a GGUF conversion of Google's UMT5 xxl model, specifically the encoder part.

The weights can be used with ./llama-embedding or with the ComfyUI-GGUF custom node together with image/video generation models.

This is a non imatrix quant as llama.cpp doesn't support imatrix creation for T5 models at the time of writing. It's therefore recommended to use Q5KM or larger for the best results, although smaller models may also still provide decent results in resource constrained scenarios.

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
umt5-xxl-encoder-F16.gguf LFS FP16	10.59 GB	Download
umt5-xxl-encoder-F32.gguf LFS	21.17 GB	Download
umt5-xxl-encoder-Q3_K_M.gguf LFS Q3	2.85 GB	Download
umt5-xxl-encoder-Q3_K_S.gguf LFS Q3	2.66 GB	Download
umt5-xxl-encoder-Q4_K_M.gguf Recommended LFS Q4	3.4 GB	Download
umt5-xxl-encoder-Q4_K_S.gguf LFS Q4	3.26 GB	Download
umt5-xxl-encoder-Q5_K_M.gguf LFS Q5	3.86 GB	Download
umt5-xxl-encoder-Q5_K_S.gguf LFS Q5	3.77 GB	Download
umt5-xxl-encoder-Q6_K.gguf LFS Q6	4.35 GB	Download
umt5-xxl-encoder-Q8_0.gguf LFS Q8	5.63 GB	Download

📊 Model Information

🆔 Model ID: city96/umt5-xxl-encoder-gguf

📅 Created: 10 months ago

🔄 Last Updated: 10 months ago

📥 Downloads: 95.1K

❤️ Likes: 132

🎯 Difficulty: Advanced

⚙️ Quantization: FP16, Q3, Q4, Q5, Q6, Q8

🏷️ Tags

ggufenbase_model:google/umt5-xxlbase_model:quantized:google/umt5-xxllicense:apache-2.0endpoints_compatibleregion:us

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download