π Model Description
Quantization made by Richard Erkhov.
final-tc-deepseek-coder-1.3b-instruct - GGUF
- Model creator: https://huggingface.co/Yhhhhhhhhh/
- Original model: https://huggingface.co/Yhhhhhhhhh/final-tc-deepseek-coder-1.3b-instruct/
Original model description:
library_name: transformers
license: other
base_model: deepseek-ai/deepseek-coder-1.3b-instruct
tags:
- llama-factory
- full
- generatedfromtrainer
model-index:
- name: nopytcfinalsftdeepseek-coder-1.3b-instruct
results: []
nopytcfinalsftdeepseek-coder-1.3b-instruct
This model is a fine-tuned version of deepseek-ai/deepseek-coder-1.3b-instruct on the output dataset.
It achieves the following results on the evaluation set:
- Loss: 0.2609
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learningrate: 5e-06
- trainbatchsize: 8
- evalbatchsize: 1
- seed: 42
- gradientaccumulationsteps: 2
- totaltrainbatchsize: 16
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lrschedulertype: cosine
- lrschedulerwarmupratio: 0.03
- numepochs: 4.0
Training results
Framework versions
- Transformers 4.44.2
- Pytorch 2.5.0+cu121
- Datasets 2.21.0
- Tokenizers 0.19.1
π GGUF File List
π Filename | π¦ Size | β‘ Download |
---|---|---|
final-tc-deepseek-coder-1.3b-instruct.IQ3_M.gguf
LFS
Q3
|
641.62 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.IQ3_S.gguf
LFS
Q3
|
612.09 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.IQ3_XS.gguf
LFS
Q3
|
584.95 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.IQ4_NL.gguf
LFS
Q4
|
746.04 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.IQ4_XS.gguf
LFS
Q4
|
715.94 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q2_K.gguf
LFS
Q2
|
533.79 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q3_K.gguf
LFS
Q3
|
671.51 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q3_K_L.gguf
LFS
Q3
|
709.97 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q3_K_M.gguf
LFS
Q3
|
671.51 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q3_K_S.gguf
LFS
Q3
|
612.09 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q4_0.gguf
Recommended
LFS
Q4
|
739.99 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q4_1.gguf
LFS
Q4
|
816.3 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q4_K.gguf
LFS
Q4
|
832.99 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q4_K_M.gguf
LFS
Q4
|
832.99 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q4_K_S.gguf
LFS
Q4
|
776.26 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q5_0.gguf
LFS
Q5
|
892.62 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q5_1.gguf
LFS
Q5
|
968.93 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q5_K.gguf
LFS
Q5
|
955.43 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q5_K_M.gguf
LFS
Q5
|
955.43 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q5_K_S.gguf
LFS
Q5
|
908.74 MB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q6_K.gguf
LFS
Q6
|
1.09 GB | Download |
final-tc-deepseek-coder-1.3b-instruct.Q8_0.gguf
LFS
Q8
|
1.33 GB | Download |