πŸ“‹ Model Description


pipeline_tag: text-generation base_model:
  • zai-org/GLM-4.7-Flash

This is a MXFP4MOE quantization of the model GLM-4.7-Flash.

The suggested parameters from the official docs for general chat are:

--temp 1.0
--top-p 0.95
--min-p 0.01
--repeat-penalty 1.0

And for tool-calling:
--temp 0.7
--top-p 1.0
--min-p 0.01
--repeat-penalty 1.0

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
GLM-4.7-Flash-MXFP4_MOE.gguf
Recommended LFS
16.08 GB Download