π Model Description
base_model: sentence-transformers/all-MiniLM-L6-v2 license: apache-2.0 library_name: sentence-transformers model_creator: Sentence Transformers quantized_by: Second State Inc. language: en tags:
- sentence-transformers
- feature-extraction
- sentence-similarity
- transformers
All-MiniLM-L6-v2-GGUF
Original Model
sentence-transformers/all-MiniLM-L6-v2
Run with LlamaEdge
- LlamaEdge version: v0.8.2 and above
- Context size:
384 - Vector size:
256 - Run as LlamaEdge service
wasmedge --dir .:. --nn-preload default:GGML:AUTO:all-MiniLM-L6-v2-ggml-model-f16.gguf \
llama-api-server.wasm \
--prompt-template llama-2-chat \
--ctx-size 256 \
--model-name all-MiniLM-L6-v2
Quantized GGUF Models
| Name | Quant method | Bits | Size | Use case |
|---|---|---|---|---|
| all-MiniLM-L6-v2-Q2K.gguf | Q2K | 2 | 19.2 MB | smallest, significant quality loss - not recommended for most purposes |
| all-MiniLM-L6-v2-Q3KL.gguf | Q3K_L | 3 | 20.5 MB | small, substantial quality loss |
| all-MiniLM-L6-v2-Q3KM.gguf | Q3K_M | 3 | 19.9 MB | very small, high quality loss |
| all-MiniLM-L6-v2-Q3KS.gguf | Q3K_S | 3 | 19.2 MB | very small, high quality loss |
| all-MiniLM-L6-v2-Q40.gguf | Q40 | 4 | 19.7 MB | legacy; small, very high quality loss - prefer using Q3KM |
| all-MiniLM-L6-v2-Q4KM.gguf | Q4K_M | 4 | 21 MB | medium, balanced quality - recommended |
| all-MiniLM-L6-v2-Q4KS.gguf | Q4K_S | 4 | 20.7 MB | small, greater quality loss |
| all-MiniLM-L6-v2-Q50.gguf | Q50 | 5 | 21 MB | legacy; medium, balanced quality - prefer using Q4KM |
| all-MiniLM-L6-v2-Q5KM.gguf | Q5K_M | 5 | 21.7 MB | large, very low quality loss - recommended |
| all-MiniLM-L6-v2-Q5KS.gguf | Q5K_S | 5 | 21.5 MB | large, low quality loss - recommended |
| all-MiniLM-L6-v2-Q6K.gguf | Q6K | 6 | 24.2 MB | very large, extremely low quality loss |
| all-MiniLM-L6-v2-Q80.gguf | Q80 | 8 | 25 MB | very large, extremely low quality loss - not recommended |
| all-MiniLM-L6-v2-ggml-model-f16.gguf | Q80 | 8 | 45.9 MB | very large, extremely low quality loss - not recommended |
π GGUF File List
| π Filename | π¦ Size | β‘ Download |
|---|---|---|
|
all-MiniLM-L6-v2-Q2_K.gguf
LFS
Q2
|
18.34 MB | Download |
|
all-MiniLM-L6-v2-Q3_K_L.gguf
LFS
Q3
|
19.53 MB | Download |
|
all-MiniLM-L6-v2-Q3_K_M.gguf
LFS
Q3
|
19.02 MB | Download |
|
all-MiniLM-L6-v2-Q3_K_S.gguf
LFS
Q3
|
18.34 MB | Download |
|
all-MiniLM-L6-v2-Q4_0.gguf
Recommended
LFS
Q4
|
18.79 MB | Download |
|
all-MiniLM-L6-v2-Q4_K_M.gguf
LFS
Q4
|
20.03 MB | Download |
|
all-MiniLM-L6-v2-Q4_K_S.gguf
LFS
Q4
|
19.74 MB | Download |
|
all-MiniLM-L6-v2-Q5_0.gguf
LFS
Q5
|
20.05 MB | Download |
|
all-MiniLM-L6-v2-Q5_K_M.gguf
LFS
Q5
|
20.71 MB | Download |
|
all-MiniLM-L6-v2-Q5_K_S.gguf
LFS
Q5
|
20.47 MB | Download |
|
all-MiniLM-L6-v2-Q6_K.gguf
LFS
Q6
|
23.03 MB | Download |
|
all-MiniLM-L6-v2-Q8_0.gguf
LFS
Q8
|
23.85 MB | Download |
|
all-MiniLM-L6-v2-ggml-model-f16.gguf
LFS
FP16
|
43.82 MB | Download |