ddh0/GLM-4.5-Air-Derestricted-GGUF

Name: ddh0/GLM-4.5-Air-Derestricted-GGUF
Author: ddh0

High-quality GGUF model

7.5K 📥 Downloads

3 ❤️ Likes

6 📁 GGUF Files

558.06 GB 💾 Total Size

3 months ago 🔄 Last Updated

📋 Model Description

base_model:

ArliAI/GLM-4.5-Air-Derestricted

basemodelrelation: quantized quantized_by: ddh0 license: mit

GLM-4.5-Air-Derestricted-GGUF

This repository contains several custom GGUF quantizations of ArliAI/GLM-4.5-Air-Derestricted, to be used with llama.cpp.

The naming scheme for these custom quantizations is as follows:

ModelName-DefaultType-FFN-UpType-GateType-DownType.gguf

Where DefaultType refers to the default tensor type, and UpType, GateType, and DownType refer to the tensor types used for the ffnupexps, ffngateexps, and ffndownexps tensors respectively.

Quantizations

These quantizations use Q80 for all tensors by default, including the dense FFN block. Only the conditional experts are downgraded. The shared expert is always kept in Q80. They were quantized using my own imatrix (the calibration text corpus can be found here).

Filename	Size (GB)	Size (GiB)	Average BPW	Direct link
GLM-4.5-Air-Derestricted-Q80-FFN-IQ4XS-IQ4XS-Q50.gguf	68.63	63.92	4.97	Download
GLM-4.5-Air-Derestricted-Q80-FFN-Q5K-Q5K-Q80.gguf	91.97	85.66	6.66	Download
GLM-4.5-Air-Derestricted-Q80-FFN-Q6K-Q6K-Q80.gguf	100.99	94.06	7.31	Download
GLM-4.5-Air-Derestricted-Q80.gguf	117.45	109.38	8.51	Download
GLM-4.5-Air-Derestricted-bf16.gguf	220.98	205.81	16.00	Download 1/2 Download 2/2

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
GLM-4.5-Air-Derestricted-Q8_0-FFN-IQ4_XS-IQ4_XS-Q5_0.gguf Recommended LFS Q4	63.93 GB	Download
GLM-4.5-Air-Derestricted-Q8_0-FFN-Q5_K-Q5_K-Q8_0.gguf LFS Q5	85.67 GB	Download
GLM-4.5-Air-Derestricted-Q8_0-FFN-Q6_K-Q6_K-Q8_0.gguf LFS Q6	94.07 GB	Download
GLM-4.5-Air-Derestricted-Q8_0.gguf LFS Q8	109.39 GB	Download
GLM-4.5-Air-Derestricted-bf16-00001-of-00002.gguf LFS FP16	102.5 GB	Download
GLM-4.5-Air-Derestricted-bf16-00002-of-00002.gguf LFS FP16	102.5 GB	Download

📊 Model Information

🆔 Model ID: ddh0/GLM-4.5-Air-Derestricted-GGUF

📅 Created: 3 months ago

🔄 Last Updated: 3 months ago

📥 Downloads: 7.5K

❤️ Likes: 3

🎯 Difficulty: Advanced

⚙️ Quantization: Q4, Q8, Q5, Q6, FP16

🏷️ Tags

ggufbase_model:ArliAI/GLM-4.5-Air-Derestrictedbase_model:quantized:ArliAI/GLM-4.5-Air-Derestrictedlicense:mitendpoints_compatibleregion:usimatrixconversational

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download