mradermacher/translategemma-4b-it-GGUF

Name: mradermacher/translategemma-4b-it-GGUF
Author: mradermacher

High-quality GGUF model

28.4K 📥 Downloads

19 ❤️ Likes

14 📁 GGUF Files

34.71 GB 💾 Total Size

2 months ago 🔄 Last Updated

📋 Model Description

base_model: google/translategemma-4b-it extragatedbutton_content: Acknowledge license extragatedheading: Access Gemma on Hugging Face extragatedprompt: To access Gemma on Hugging Face, you’re required to review and agree to Google’s usage license. To do this, please ensure you’re logged in to Hugging Face and click below. Requests are processed immediately. language:

library_name: transformers license: gemma mradermacher: readme_rev: 1 quantized_by: mradermacher

About

static quants of https://huggingface.co/google/translategemma-4b-it

For a convenient overview and download list, visit our model page for this model.

weighted/imatrix quants are available at https://huggingface.co/mradermacher/translategemma-4b-it-i1-GGUF

Usage

If you are unsure how to use GGUF files, refer to one of TheBloke's
READMEs for
more details, including on how to concatenate multi-part files.

Provided Quants

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Link	Type	Size/GB	Notes
GGUF	mmproj-Q8_0	0.7	multi-modal supplement
GGUF	mmproj-f16	1.0	multi-modal supplement
GGUF	Q2_K	1.8
GGUF	Q3K_S	2.0
GGUF	Q3K_M	2.2	lower quality
GGUF	Q3K_L	2.3
GGUF	IQ4_XS	2.4
GGUF	Q4K_S	2.5	fast, recommended
GGUF	Q4K_M	2.6	fast, recommended
GGUF	Q5K_S	2.9
GGUF	Q5K_M	2.9
GGUF	Q6_K	3.3	very good quality
GGUF	Q8_0	4.2	fast, best quality
GGUF	f16	7.9	16 bpw, overkill

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

!image.png

And here are Artefact2's thoughts on the matter:
https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9

FAQ / Model Request

See https://huggingface.co/mradermacher/model_requests for some answers to
questions you might have and/or if you want some other model quantized.

Thanks

I thank my company, nethype GmbH, for letting
me use its servers and providing upgrades to my workstation to enable
this work in my free time.

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
translategemma-4b-it.IQ4_XS.gguf LFS Q4	2.12 GB	Download
translategemma-4b-it.Q2_K.gguf LFS Q2	1.61 GB	Download
translategemma-4b-it.Q3_K_L.gguf LFS Q3	2.08 GB	Download
translategemma-4b-it.Q3_K_M.gguf LFS Q3	1.95 GB	Download
translategemma-4b-it.Q3_K_S.gguf LFS Q3	1.8 GB	Download
translategemma-4b-it.Q4_K_M.gguf Recommended LFS Q4	2.32 GB	Download
translategemma-4b-it.Q4_K_S.gguf LFS Q4	2.21 GB	Download
translategemma-4b-it.Q5_K_M.gguf LFS Q5	2.64 GB	Download
translategemma-4b-it.Q5_K_S.gguf LFS Q5	2.57 GB	Download
translategemma-4b-it.Q6_K.gguf LFS Q6	2.97 GB	Download
translategemma-4b-it.Q8_0.gguf LFS Q8	3.85 GB	Download
translategemma-4b-it.f16.gguf LFS FP16	7.23 GB	Download
translategemma-4b-it.mmproj-Q8_0.gguf LFS Q8	563.98 MB	Download
translategemma-4b-it.mmproj-f16.gguf LFS FP16	811.82 MB	Download

📊 Model Information

🆔 Model ID: mradermacher/translategemma-4b-it-GGUF

📅 Created: 2 months ago

🔄 Last Updated: 2 months ago

📥 Downloads: 28.4K

❤️ Likes: 19

🎯 Difficulty: Intermediate

⚙️ Quantization: Q4, Q2, Q3, Q5, Q6, Q8, FP16

🏷️ Tags

transformersggufenbase_model:google/translategemma-4b-itbase_model:quantized:google/translategemma-4b-itlicense:gemmaendpoints_compatibleregion:usconversational

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download