IlyaGusev/saiga_nemo_12b_gguf

Name: IlyaGusev/saiga_nemo_12b_gguf
Author: IlyaGusev

High-quality GGUF model

3.5K 📥 Downloads

93 ❤️ Likes

11 📁 GGUF Files

95.84 GB 💾 Total Size

2 years ago 🔄 Last Updated

📋 Model Description

datasets:

language:

inference: false license: apache-2.0

Llama.cpp compatible versions of an original 12B model.

Download one of the versions, for example saiganemo12b.Q4KM.gguf.

wget https://huggingface.co/IlyaGusev/saiganemo12bgguf/resolve/main/saiganemo12b.Q4K_M.gguf

wget https://raw.githubusercontent.com/IlyaGusev/rulm/master/selfinstruct/src/interactllama3_llamacpp.py

How to run:

pip install llama-cpp-python fire

python3 interactllama3llamacpp.py saiganemo12b.Q4KM.gguf

System requirements:

📁 Filename	📦 Size	⚡ Download
saiga_nemo_12b.BF16.gguf LFS FP16	22.82 GB	Download
saiga_nemo_12b.Q2_K.gguf LFS Q2	4.46 GB	Download
saiga_nemo_12b.Q3_K_M.gguf LFS Q3	5.67 GB	Download
saiga_nemo_12b.Q3_K_S.gguf LFS Q3	5.15 GB	Download
saiga_nemo_12b.Q4_0.gguf Recommended LFS Q4	6.59 GB	Download
saiga_nemo_12b.Q4_K_M.gguf LFS Q4	6.96 GB	Download
saiga_nemo_12b.Q4_K_S.gguf LFS Q4	6.63 GB	Download
saiga_nemo_12b.Q5_K_M.gguf LFS Q5	8.13 GB	Download
saiga_nemo_12b.Q5_K_S.gguf LFS Q5	7.93 GB	Download
saiga_nemo_12b.Q6_K.gguf LFS Q6	9.37 GB	Download
saiga_nemo_12b.Q8_0.gguf LFS Q8	12.13 GB	Download

🆔 Model ID: IlyaGusev/saiga_nemo_12b_gguf

📅 Created: 2 years ago

🔄 Last Updated: 2 years ago

📥 Downloads: 3.5K

❤️ Likes: 93

🎯 Difficulty: Advanced

⚙️ Quantization: FP16, Q2, Q3, Q4, Q5, Q6, Q8

ggufrudataset:IlyaGusev/saiga_scoreddataset:IlyaGusev/saiga_preferenceslicense:apache-2.0region:usconversational