noctrex/Kimi-Linear-48B-A3B-Instruct-MXFP4_MOE-GGUF

Name: noctrex/Kimi-Linear-48B-A3B-Instruct-MXFP4_MOE-GGUF
Author: noctrex

High-quality GGUF model

1.8K Downloads

2 Likes

2 GGUF Files

54.11 GB Total Size

6 days ago Updated

Model Description

pipeline_tag: text-generation base_model:

moonshotai/Kimi-Linear-48B-A3B-Instruct

This is a MXFP4MOE quantization of the model Kimi-Linear-48B-A3B-Instruct
The mainline standard is to use MXFP4 for the MoE tensors, and Q8 for the rest.
So I created 2 new variants, where the other tensors are either BF16 or FP16 instead of Q8.
The order of preference is BF16, then F16.
On some architectures BF16 will be slower, but its the highest quality, essentialy its the original tensors from the model copied over unquantized.

GGUF File List

📁 Filename 📦 Size ⚡ Download

Kimi-Linear-48B-A3B-Instruct-MXFP4_MOE_BF16.gguf

Recommended LFS FP16
27.05 GB Download

Kimi-Linear-48B-A3B-Instruct-MXFP4_MOE_F16.gguf

LFS FP16
27.05 GB Download

Model Information

Model ID: noctrex/Kimi-Linear-48B-A3B-Instruct-MXFP4_MOE-GGUF

Created: 6 days ago

Last Updated: 6 days ago

Downloads: 1.8K

Likes: 2

Difficulty: Advanced

Quantization: FP16

Tags

gguftext-generationbase_model:moonshotai/Kimi-Linear-48B-A3B-Instructbase_model:quantized:moonshotai/Kimi-Linear-48B-A3B-Instructendpoints_compatibleregion:usconversational

Related Links

Visit Hugging Face Quick Download

📁 Filename	📦 Size	⚡ Download
Kimi-Linear-48B-A3B-Instruct-MXFP4_MOE_BF16.gguf Recommended LFS FP16	27.05 GB	Download
Kimi-Linear-48B-A3B-Instruct-MXFP4_MOE_F16.gguf LFS FP16	27.05 GB	Download