πŸ“‹ Model Description


base_model:
  • Qwen/Qwen3-Coder-Next
tags:
  • text-generation-inference
license: apache-2.0

!Qwen3-coder-next-reap

Qwen3-Coder-Next-REAP-60B-A3B has the following specifications:

  • Type: Causal Language Models
  • Number of Parameters: 60B in total and 3B activated
  • Hidden Dimension: 2048
  • Number of Layers: 48
  • Hybrid Layout: 12 (3 (Gated DeltaNet -> MoE) -> 1 * (Gated Attention -> MoE))
  • Gated Attention:
  • Number of Attention Heads: 16 for Q and 2 for KV
  • Head Dimension: 256
  • Rotary Position Embedding Dimension: 64
  • Gated DeltaNet:
Number of Linear Attention Heads: 32 for V and 16 for QK Head Dimension: 128
  • Mixture of Experts:
  • Number of Experts: 384 (uniformly pruned from 512)
  • Number of Activated Experts: 10
  • Number of Shared Experts: 1
  • Context Length: 262,144 natively
  • Compression Method: REAP (Router-weighted Expert Activation Pruning)
  • Compression Ratio: 25% expert pruning

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
Qwen3-Coder-Next-REAP-60B-A3B-Q3_K_XL.gguf
Recommended LFS Q3
31.71 GB Download
Qwen3-Coder-Next-REAP-60B-A3B-Q4_K_XL.gguf
LFS Q4
38.13 GB Download
Qwen3-Coder-Next-REAP-60B-A3B-Q6_K_XL.gguf
LFS Q6
48.76 GB Download