RichardErkhov/ruslandev_-_llama-3-8b-samantha-gguf

Name: RichardErkhov/ruslandev_-_llama-3-8b-samantha-gguf
Author: RichardErkhov

High-quality GGUF model

5.3K 📥 Downloads

0 ❤️ Likes

22 📁 GGUF Files

100.19 GB 💾 Total Size

2 years ago 🔄 Last Updated

📋 Model Description

Quantization made by Richard Erkhov.

Github

Discord

Request more models

llama-3-8b-samantha - GGUF

Model creator: https://huggingface.co/ruslandev/
Original model: https://huggingface.co/ruslandev/llama-3-8b-samantha/

Name	Quant method	Size
llama-3-8b-samantha.Q2K.gguf	Q2K	2.96GB
llama-3-8b-samantha.IQ3XS.gguf	IQ3XS	3.28GB
llama-3-8b-samantha.IQ3S.gguf	IQ3S	3.43GB
llama-3-8b-samantha.Q3KS.gguf	Q3K_S	3.41GB
llama-3-8b-samantha.IQ3M.gguf	IQ3M	3.52GB
llama-3-8b-samantha.Q3K.gguf	Q3K	3.74GB
llama-3-8b-samantha.Q3KM.gguf	Q3K_M	3.74GB
llama-3-8b-samantha.Q3KL.gguf	Q3K_L	4.03GB
llama-3-8b-samantha.IQ4XS.gguf	IQ4XS	4.18GB
llama-3-8b-samantha.Q40.gguf	Q40	4.34GB
llama-3-8b-samantha.IQ4NL.gguf	IQ4NL	4.38GB
llama-3-8b-samantha.Q4KS.gguf	Q4K_S	4.37GB
llama-3-8b-samantha.Q4K.gguf	Q4K	4.58GB
llama-3-8b-samantha.Q4KM.gguf	Q4K_M	4.58GB
llama-3-8b-samantha.Q41.gguf	Q41	4.78GB
llama-3-8b-samantha.Q50.gguf	Q50	5.21GB
llama-3-8b-samantha.Q5KS.gguf	Q5K_S	5.21GB
llama-3-8b-samantha.Q5K.gguf	Q5K	5.34GB
llama-3-8b-samantha.Q5KM.gguf	Q5K_M	5.34GB
llama-3-8b-samantha.Q51.gguf	Q51	5.65GB
llama-3-8b-samantha.Q6K.gguf	Q6K	6.14GB
llama-3-8b-samantha.Q80.gguf	Q80	7.95GB

Original model description:

language:

license: apache-2.0
tags:

text-generation-inference
transformers
unsloth
llama
trl

base_model: unsloth/llama-3-8b-bnb-4bit
datasets:

cognitivecomputations/samantha-data

Uploaded model

Developed by: ruslandev
License: apache-2.0
Finetuned from model : unsloth/llama-3-8b-bnb-4bit

This model is finetuned on the data of Samantha.
Prompt format is Alpaca. I used the same system prompt as the original Samantha.

"""Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

Instruction:
{SYSTEM_PROMPT}

Input:
{QUESTION}

Response:
"""

Training

gptchain framework has been used for training.

Training hyperparameters

learningrate: 2e-4
seed: 3407
gradientaccumulationsteps: 4
perdevicetrainbatchsize: 2
optimizer: adamw8bit
lrschedulertype: linear
warmupsteps: 5
numepochs: 2
weight_decay: 0.01

Training results

Training Loss	Epoch	Step
2.0778	0.0	1
0.6255	0.18	120
0.6208	0.94	620
0.6244	2.0	1306

2 epoch finetuning from llama-3-8b took 1 hour on a single A100 with Unsloth and Huggingface's TRL library.

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
llama-3-8b-samantha.IQ3_M.gguf LFS Q3	3.52 GB	Download
llama-3-8b-samantha.IQ3_S.gguf LFS Q3	3.43 GB	Download
llama-3-8b-samantha.IQ3_XS.gguf LFS Q3	3.28 GB	Download
llama-3-8b-samantha.IQ4_NL.gguf LFS Q4	4.38 GB	Download
llama-3-8b-samantha.IQ4_XS.gguf LFS Q4	4.18 GB	Download
llama-3-8b-samantha.Q2_K.gguf LFS Q2	2.96 GB	Download
llama-3-8b-samantha.Q3_K.gguf LFS Q3	3.74 GB	Download
llama-3-8b-samantha.Q3_K_L.gguf LFS Q3	4.03 GB	Download
llama-3-8b-samantha.Q3_K_M.gguf LFS Q3	3.74 GB	Download
llama-3-8b-samantha.Q3_K_S.gguf LFS Q3	3.41 GB	Download
llama-3-8b-samantha.Q4_0.gguf Recommended LFS Q4	4.34 GB	Download
llama-3-8b-samantha.Q4_1.gguf LFS Q4	4.78 GB	Download
llama-3-8b-samantha.Q4_K.gguf LFS Q4	4.58 GB	Download
llama-3-8b-samantha.Q4_K_M.gguf LFS Q4	4.58 GB	Download
llama-3-8b-samantha.Q4_K_S.gguf LFS Q4	4.37 GB	Download
llama-3-8b-samantha.Q5_0.gguf LFS Q5	5.21 GB	Download
llama-3-8b-samantha.Q5_1.gguf LFS Q5	5.65 GB	Download
llama-3-8b-samantha.Q5_K.gguf LFS Q5	5.34 GB	Download
llama-3-8b-samantha.Q5_K_M.gguf LFS Q5	5.34 GB	Download
llama-3-8b-samantha.Q5_K_S.gguf LFS Q5	5.21 GB	Download
llama-3-8b-samantha.Q6_K.gguf LFS Q6	6.14 GB	Download
llama-3-8b-samantha.Q8_0.gguf LFS Q8	7.95 GB	Download

📊 Model Information

🆔 Model ID: RichardErkhov/ruslandev_-_llama-3-8b-samantha-gguf

📅 Created: 2 years ago

🔄 Last Updated: 2 years ago

📥 Downloads: 5.3K

❤️ Likes: 0

🎯 Difficulty: Advanced

⚙️ Quantization: Q3, Q4, Q2, Q5, Q6, Q8

🏷️ Tags

ggufendpoints_compatibleregion:us

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download