πŸ“‹ Model Description


base_model: sentence-transformers/all-MiniLM-L6-v2 license: apache-2.0 library_name: sentence-transformers model_creator: Sentence Transformers quantized_by: Second State Inc. language: en tags:
  • sentence-transformers
  • feature-extraction
  • sentence-similarity
  • transformers








All-MiniLM-L6-v2-GGUF

Original Model

sentence-transformers/all-MiniLM-L6-v2

Run with LlamaEdge

  • LlamaEdge version: v0.8.2 and above
  • Context size: 384
  • Vector size: 256
  • Run as LlamaEdge service
wasmedge --dir .:. --nn-preload default:GGML:AUTO:all-MiniLM-L6-v2-ggml-model-f16.gguf \
    llama-api-server.wasm \
    --prompt-template llama-2-chat \
    --ctx-size 256 \
    --model-name all-MiniLM-L6-v2

Quantized GGUF Models

NameQuant methodBitsSizeUse case
all-MiniLM-L6-v2-Q2K.ggufQ2K219.2 MBsmallest, significant quality loss - not recommended for most purposes
all-MiniLM-L6-v2-Q3KL.ggufQ3K_L320.5 MBsmall, substantial quality loss
all-MiniLM-L6-v2-Q3KM.ggufQ3K_M319.9 MBvery small, high quality loss
all-MiniLM-L6-v2-Q3KS.ggufQ3K_S319.2 MBvery small, high quality loss
all-MiniLM-L6-v2-Q40.ggufQ40419.7 MBlegacy; small, very high quality loss - prefer using Q3KM
all-MiniLM-L6-v2-Q4KM.ggufQ4K_M421 MBmedium, balanced quality - recommended
all-MiniLM-L6-v2-Q4KS.ggufQ4K_S420.7 MBsmall, greater quality loss
all-MiniLM-L6-v2-Q50.ggufQ50521 MBlegacy; medium, balanced quality - prefer using Q4KM
all-MiniLM-L6-v2-Q5KM.ggufQ5K_M521.7 MBlarge, very low quality loss - recommended
all-MiniLM-L6-v2-Q5KS.ggufQ5K_S521.5 MBlarge, low quality loss - recommended
all-MiniLM-L6-v2-Q6K.ggufQ6K624.2 MBvery large, extremely low quality loss
all-MiniLM-L6-v2-Q80.ggufQ80825 MBvery large, extremely low quality loss - not recommended
all-MiniLM-L6-v2-ggml-model-f16.ggufQ80845.9 MBvery large, extremely low quality loss - not recommended
Quantized with llama.cpp b2334

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
all-MiniLM-L6-v2-Q2_K.gguf
LFS Q2
18.34 MB Download
all-MiniLM-L6-v2-Q3_K_L.gguf
LFS Q3
19.53 MB Download
all-MiniLM-L6-v2-Q3_K_M.gguf
LFS Q3
19.02 MB Download
all-MiniLM-L6-v2-Q3_K_S.gguf
LFS Q3
18.34 MB Download
all-MiniLM-L6-v2-Q4_0.gguf
Recommended LFS Q4
18.79 MB Download
all-MiniLM-L6-v2-Q4_K_M.gguf
LFS Q4
20.03 MB Download
all-MiniLM-L6-v2-Q4_K_S.gguf
LFS Q4
19.74 MB Download
all-MiniLM-L6-v2-Q5_0.gguf
LFS Q5
20.05 MB Download
all-MiniLM-L6-v2-Q5_K_M.gguf
LFS Q5
20.71 MB Download
all-MiniLM-L6-v2-Q5_K_S.gguf
LFS Q5
20.47 MB Download
all-MiniLM-L6-v2-Q6_K.gguf
LFS Q6
23.03 MB Download
all-MiniLM-L6-v2-Q8_0.gguf
LFS Q8
23.85 MB Download
all-MiniLM-L6-v2-ggml-model-f16.gguf
LFS FP16
43.82 MB Download