πŸ“‹ Model Description


pipeline_tag: text-generation base_model:
  • nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

This is a MXFP4MOE imatrix quantization of the model NVIDIA-Nemotron-3-Nano-30B-A3B, based on the imatrix from unsloth.

Get the latest llama.cpp in order to run it.

Also see the instructions here: Unsloth NVIDIA Nemotron 3 Nano - How To Run Guide

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
NVIDIA-Nemotron-3-Nano-30B-A3B-MXFP4_MOE.gguf
Recommended LFS
16.75 GB Download