πŸ“‹ Model Description


license: mit tags: - uncensored - glm4 - moe language: - en - zh

GLM-4.7-Flash-Uncensored-HauhauCS-Balanced

GLM-4.7 Flash uncensored by HauhauCS.

About

No changes to datasets or capabilities. Fully functional, 100% of what the original authors intended - just without the refusals.

These are meant to be the best lossless uncensored models out there.

Agentic Coding

If you're doing agentic coding, use the Balanced variants. Good balance between capability and not refusing everything.

Downloads

| File | Quant | Size |
|------|-------|------|
| GLM-4.7-Flash-Uncensored-HauhauCS-Balanced-FP16.gguf | FP16 | 56 GB |
| GLM-4.7-Flash-Uncensored-HauhauCS-Balanced-Q80.gguf | Q80 | 30 GB |
| GLM-4.7-Flash-Uncensored-HauhauCS-Balanced-Q6K.gguf | Q6K | 23 GB |
| GLM-4.7-Flash-Uncensored-HauhauCS-Balanced-Q4KM.gguf | Q4KM | 17 GB |

Specs

Recommended Settings

From the official Z.ai authors:

General use:

  • --temp 1.0 --top-p 0.95

Tool-calling / agentic:

  • --temp 0.7 --top-p 1.0

Important:

  • Disable repeat penalty (or --repeat-penalty 1.0)
  • For llama.cpp: use --min-p 0.01 (default 0.05 is too high)
  • Use --jinja flag for llama.cpp

Note: Not recommended for Ollama due to chat template issues. Works well with llama.cpp, LM Studio, Jan.

Usage

Works with llama.cpp, LM Studio, Jan, koboldcpp, etc.

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
GLM-4.7-Flash-Uncensored-HauhauCS-Balanced-FP16.gguf
LFS FP16
55.79 GB Download
GLM-4.7-Flash-Uncensored-HauhauCS-Balanced-Q4_K_M.gguf
Recommended LFS Q4
16.89 GB Download
GLM-4.7-Flash-Uncensored-HauhauCS-Balanced-Q6_K.gguf
LFS Q6
22.92 GB Download
GLM-4.7-Flash-Uncensored-HauhauCS-Balanced-Q8_0.gguf
LFS Q8
29.66 GB Download