π Model Description
license: other license_link: LICENSE base_model:
- Nitral-AI/CaptainErisNebula-12B-Chimera-v1.1
- text-generation-inference
- testing
- mistral
- chatml
[!TIP]
# GGUF quants for Nitral-AI/CaptainErisNebula-12B-Chimera-v1.1's recipe.
[!IMPORTANT]
Author recommended initial SillyTavern presets:
!https://iili.io/KKtCMf2.md.jpg
[!NOTE]
## This is an improvement on the previous experimental version.
- Not "chaotic", and at a usable size for most people seeking to perform inference locally with good speeds.
- The model does not show excessive alignment, so it should be good for most scenarios/writing situations.
- Feel free to use some light system prompting to nudge it out of a blocker if needed.
- It does well in adhering to characters and instructions.
Thank you so much, "crazy chef" and "mad scientist", Nitral!
# Using the latest llama.cpp ...
release version at the time: b6258.
Imatrix was based on the full ...
FP16 precision GGUF.
START: BF16 HuggingFace Model
β
(1) Conversion to Full-Precision GGUF
β
FP16 GGUF (for Calibration Imatrix)
BF16 GGUF (for Quantization)
β
(2) Generate Imatrix (from FP16 GGUF)
β
imatrix.fp16.gguf
β
(3) Quantize with Imatrix (using BF16 GGUF)
β
Final Quantized GGUF Models
β
END
π GGUF File List
| π Filename | π¦ Size | β‘ Download |
|---|---|---|
|
ARM-CaptainErisNebula-12B-Chimera-v1.1-Q4_0-imat.gguf
Recommended
LFS
Q4
|
6.61 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-BF16.gguf
LFS
FP16
|
22.82 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-F16.gguf
LFS
FP16
|
22.82 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-IQ3_M-imat.gguf
LFS
Q3
|
5.33 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-IQ3_S-imat.gguf
LFS
Q3
|
5.18 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-IQ3_XS-imat.gguf
LFS
Q3
|
4.94 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-IQ3_XXS-imat.gguf
LFS
Q3
|
4.61 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-IQ4_XS-imat.gguf
LFS
Q4
|
6.28 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-Q3_K_L-imat.gguf
LFS
Q3
|
6.11 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-Q3_K_M-imat.gguf
LFS
Q3
|
5.67 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-Q4_K_M-imat.gguf
LFS
Q4
|
6.96 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-Q4_K_S-imat.gguf
LFS
Q4
|
6.63 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-Q5_K_M-imat.gguf
LFS
Q5
|
8.13 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-Q5_K_S-imat.gguf
LFS
Q5
|
7.93 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-Q6_K-imat.gguf
LFS
Q6
|
9.37 GB | Download |
|
CaptainErisNebula-12B-Chimera-v1.1-Q8_0-imat.gguf
LFS
Q8
|
12.13 GB | Download |
|
imatrix-fp16.gguf
LFS
FP16
|
6.76 MB | Download |