π Model Description
No documentation available
π GGUF File List
π Filename | π¦ Size | β‘ Download |
---|---|---|
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-IQ2_M.gguf
LFS
Q2
|
2.75 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-IQ3_M.gguf
LFS
Q3
|
3.52 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-IQ3_XS.gguf
LFS
Q3
|
3.28 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-IQ3_XXS.gguf
LFS
Q3
|
3.05 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-IQ4_NL.gguf
LFS
Q4
|
4.36 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-IQ4_XS.gguf
LFS
Q4
|
4.14 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q2_K.gguf
LFS
Q2
|
2.96 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q2_K_L.gguf
LFS
Q2
|
3.44 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q3_K_L.gguf
LFS
Q3
|
4.03 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q3_K_M.gguf
LFS
Q3
|
3.74 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q3_K_S.gguf
LFS
Q3
|
3.41 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q3_K_XL.gguf
LFS
Q3
|
4.45 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q4_0.gguf
Recommended
LFS
Q4
|
4.35 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q4_1.gguf
LFS
Q4
|
4.78 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q4_K_L.gguf
LFS
Q4
|
4.95 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q4_K_M.gguf
LFS
Q4
|
4.58 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q4_K_S.gguf
LFS
Q4
|
4.37 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q5_K_L.gguf
LFS
Q5
|
5.64 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q5_K_M.gguf
LFS
Q5
|
5.34 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q5_K_S.gguf
LFS
Q5
|
5.21 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q6_K.gguf
LFS
Q6
|
6.14 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q6_K_L.gguf
LFS
Q6
|
6.38 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-Q8_0.gguf
LFS
Q8
|
7.95 GB | Download |
nvidia_Llama-3.1-Nemotron-Nano-8B-v1-bf16.gguf
LFS
FP16
|
14.97 GB | Download |