geoffmunn/Qwen3-Coder-30B-A3B-Instruct-f16

Name: geoffmunn/Qwen3-Coder-30B-A3B-Instruct-f16
Author: geoffmunn

High-quality GGUF model

5.1K 📥 Downloads

1 ❤️ Likes

16 📁 GGUF Files

251.15 GB 💾 Total Size

5 days ago 🔄 Last Updated

📋 Model Description

license: apache-2.0 tags: - gguf - qwen - qwen3 - qwen3-coder - qwen3-coder-30B - qwen3-coder-30B-gguf - llama.cpp - quantized - text-generation - reasoning - agent - multilingual base_model: Qwen/Qwen3-Coder-30B-A3B-Instruct author: geoffmunn pipeline_tag: text-generation language: - en - zh - es - fr - de - ru - ar - ja - ko - hi

Qwen3-Coder-30B-A3B-Instruct-f16-GGUF

This is a GGUF-quantized version of the Qwen/Qwen3-Coder-30B-A3B-Instruct language model.

Converted for use with llama.cpp, LM Studio, OpenWebUI, GPT4All, and more.

💡 Key Features of Qwen3-Coder-30B-A3B-Instruct:

Available Quantizations (from f16)

Level	Quality	Speed	Size	Recommendation
Q2_K	Minimal	⚡ Fast	11.30 GB	Only on severely memory-constrained systems.

💡 Recommendations by Use Case

- 💻 Standard Laptop (i5/M1 Mac): Q5KM (optimal quality)
- 🧠 Reasoning, Coding, Math: Q5KM or Q6K
- 🔍 RAG, Retrieval, Precision Tasks: Q6K or Q80
- 🤖 Agent & Tool Integration: Q5KM
- 🛠️ Development & Testing: Test from Q4KM up to Q80

Usage

Load this model using:

OpenWebUI – self-hosted AI interface with RAG & tools
LM Studio – desktop app with GPU support
GPT4All – private, offline AI chatbot
Or directly via llama.cpp

Each quantized model includes its own README.md and shares a common MODELFILE.

Author

👤 Geoff Munn (@geoffmunn)
🔗 Hugging Face Profile

Disclaimer

This is a community conversion for local inference. Not affiliated with Alibaba Cloud or the Qwen team.

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
Qwen3-Coder-30B-A3B-Instruct-f16-imatrix-4697-coder.gguf LFS FP16	116.38 MB	Download
Qwen3-Coder-30B-A3B-Instruct-f16-imatrix-4697-generic.gguf LFS FP16	116.38 MB	Download
Qwen3-Coder-30B-A3B-Instruct-f16-imatrix:Q3_K_HIFI.gguf LFS Q3	19.05 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16-imatrix:Q3_K_M.gguf LFS Q3	17.28 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16-imatrix:Q3_K_S.gguf LFS Q3	16.26 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q2_K.gguf LFS Q2	10.49 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q3_K_HIFI.gguf LFS Q3	15.69 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q3_K_M.gguf LFS Q3	13.7 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q3_K_S.gguf LFS Q3	12.38 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q4_K_HIFI.gguf LFS Q4	19.05 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q4_K_M.gguf Recommended LFS Q4	17.28 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q4_K_S.gguf LFS Q4	16.26 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q5_K_M.gguf LFS Q5	20.23 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q5_K_S.gguf LFS Q5	19.63 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q6_K.gguf LFS Q6	23.37 GB	Download
Qwen3-Coder-30B-A3B-Instruct-f16:Q8_0.gguf LFS Q8	30.25 GB	Download

📊 Model Information

🆔 Model ID: geoffmunn/Qwen3-Coder-30B-A3B-Instruct-f16

📅 Created: 6 months ago

🔄 Last Updated: 5 days ago

📥 Downloads: 5.1K

❤️ Likes: 1

🎯 Difficulty: Advanced

⚙️ Quantization: FP16, Q3, Q2, Q4, Q5, Q6, Q8

🏷️ Tags

ggufqwenqwen3qwen3-coderqwen3-coder-30Bqwen3-coder-30B-ggufllama.cppquantizedtext-generationreasoningagentmultilingualenzhesfrderuarjakohibase_model:Qwen/Qwen3-Coder-30B-A3B-Instructbase_model:quantized:Qwen/Qwen3-Coder-30B-A3B-Instructlicense:apache-2.0endpoints_compatibleregion:usconversational

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download