📋 Model Description
Quantization made by Richard Erkhov.
SELM-Llama-3-8B-Instruct-iter-3 - GGUF
- Model creator: https://huggingface.co/ZhangShenao/
- Original model: https://huggingface.co/ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3/
Original model description:
license: mit
base_model: ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
tags:
- alignment-handbook
- dpo
- trl
- selm
datasets:
- HuggingFaceH4/ultrafeedback_binarized
model-index:
- name: SELM-Llama-3-8B-Instruct-iter-3
results: []
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment.
SELM-Llama-3-8B-Instruct-iter-3
This model is a fine-tuned version of ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2 using synthetic data based on on the HuggingFaceH4/ultrafeedbackbinarized dataset.
Model description
- Model type: A 8B parameter Llama3-instruct-based Self-Exploring Language Models (SELM).
- License: MIT
Results
| AlpacaEval 2.0 (LC WR) | MT-Bench (Average) | |
|---|---|---|
| SELM-Llama-3-8B-Instruct-iter-3 | 33.47 | 8.29 |
| SELM-Llama-3-8B-Instruct-iter-2 | 35.65 | 8.09 |
| SELM-Llama-3-8B-Instruct-iter-1 | 32.02 | 7.92 |
| Meta-Llama-3-8B-Instruct | 24.31 | 7.93 |
Training hyperparameters
The following hyperparameters were used during training:
- alpha: 0.0001
- beta: 0.01
- trainbatchsize: 4
- seed: 42
- distributedtype: multi-GPU
- numdevices: 8
- gradientaccumulationsteps: 4
- totaltrainbatchsize: 128
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- numepochs: 1
Framework versions
- Transformers 4.40.2
- Pytorch 2.1.2+cu121
- Datasets 2.14.6
- Tokenizers 0.19.1
📂 GGUF File List
| 📁 Filename | 📦 Size | ⚡ Download |
|---|---|---|
|
SELM-Llama-3-8B-Instruct-iter-3.IQ3_M.gguf
LFS
Q3
|
3.52 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.IQ3_S.gguf
LFS
Q3
|
3.43 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.IQ3_XS.gguf
LFS
Q3
|
3.28 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.IQ4_NL.gguf
LFS
Q4
|
4.38 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.IQ4_XS.gguf
LFS
Q4
|
4.18 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q2_K.gguf
LFS
Q2
|
2.96 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q3_K.gguf
LFS
Q3
|
3.74 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q3_K_L.gguf
LFS
Q3
|
4.03 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q3_K_M.gguf
LFS
Q3
|
3.74 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q3_K_S.gguf
LFS
Q3
|
3.41 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q4_0.gguf
Recommended
LFS
Q4
|
4.34 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q4_1.gguf
LFS
Q4
|
4.78 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q4_K.gguf
LFS
Q4
|
4.58 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q4_K_M.gguf
LFS
Q4
|
4.58 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q4_K_S.gguf
LFS
Q4
|
4.37 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q5_0.gguf
LFS
Q5
|
5.21 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q5_1.gguf
LFS
Q5
|
5.65 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q5_K.gguf
LFS
Q5
|
5.34 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q5_K_M.gguf
LFS
Q5
|
5.34 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q5_K_S.gguf
LFS
Q5
|
5.21 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q6_K.gguf
LFS
Q6
|
6.14 GB | Download |
|
SELM-Llama-3-8B-Instruct-iter-3.Q8_0.gguf
LFS
Q8
|
7.95 GB | Download |