AesSedai/Qwen3.5-397B-A17B-GGUF

Name: AesSedai/Qwen3.5-397B-A17B-GGUF
Author: AesSedai

High-quality GGUF model

3.7K 📥 Downloads

16 ❤️ Likes

4 📁 GGUF Files

3.99 GB 💾 Total Size

1 day ago 🔄 Last Updated

📋 Model Description

base_model:

Qwen/Qwen3.5-397B-A17B

This repo contains specialized MoE-quants for Qwen3.5-397B-A17B. The idea being that given the huge size of the FFN tensors compared to the rest of the tensors in the model,
it should be possible to achieve a better quality while keeping the overall size of the entire model smaller compared to a similar naive quantization.
To that end, the quantization type default is kept in high quality and the FFN UP + FFN GATE tensors are quanted down along with the FFN DOWN tensors.

Quant	Size	Mixture	PPL	1-(Mean PPL(Q)/PPL(base))	KLD
Q5KM	273.49 GiB (5.93 BPW)	Q80 / Q5K / Q5K / Q6K	4.617400 ± 0.057235	+0.0156%	0.002553 ± 0.000078
Q5KS	257.55 GiB (5.58 BPW)	Q80 / Q5K / Q5K / Q5K	4.620864 ± 0.057279	+0.0907%	0.002903 ± 0.000085
Q4KM	227.55 GiB (4.93 BPW)	Q80 / Q4K / Q4K / Q5K	4.624688 ± 0.057341	+0.1735%	0.004496 ± 0.000117
IQ4XS	176.92 GiB (3.83 BPW)	Q80 / IQ3S / IQ3S / IQ4_XS	4.653226 ± 0.057738	+0.7916%	0.011963 ± 0.000309
IQ3S	136.31 GiB (2.95 BPW)	Q6K / IQ2S / IQ2S / IQ3_S	4.745153 ± 0.059208	+2.7828%	0.033163 ± 0.000791

!kldgraph !pplgraph

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
mmproj-Qwen3.5-397B-A17B-BF16.gguf Recommended LFS FP16	879.01 MB	Download
mmproj-Qwen3.5-397B-A17B-F16.gguf LFS FP16	875.63 MB	Download
mmproj-Qwen3.5-397B-A17B-F32.gguf LFS	1.7 GB	Download
mmproj-Qwen3.5-397B-A17B-Q8_0.gguf LFS Q8	595.31 MB	Download

📊 Model Information

🆔 Model ID: AesSedai/Qwen3.5-397B-A17B-GGUF

📅 Created: 2 weeks ago

🔄 Last Updated: 1 day ago

📥 Downloads: 3.7K

❤️ Likes: 16

🎯 Difficulty: Beginner

⚙️ Quantization: FP16, Q8

🏷️ Tags

ggufbase_model:Qwen/Qwen3.5-397B-A17Bbase_model:quantized:Qwen/Qwen3.5-397B-A17Bendpoints_compatibleregion:usimatrixconversational

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download