๐Ÿ“‹ Model Description


license: apache-2.0 language:
  • ja
datasets:
  • TFMC/imatrix-dataset-for-japanese-llm
base_model:
  • tokyotech-llm/GPT-OSS-Swallow-20B-SFT-v0.1

GPT-OSS-Swallow-20B-SFT-v0.1-gguf

tokyotech-llmใ•ใ‚“ใŒๅ…ฌ้–‹ใ—ใฆใ„ใ‚‹GPT-OSS-Swallow-20B-SFT-v0.1ใฎggufใƒ•ใ‚ฉใƒผใƒžใƒƒใƒˆๅค‰ๆ›็‰ˆใงใ™ใ€‚

imatrixใฎใƒ‡ใƒผใ‚ฟใฏTFMC/imatrix-dataset-for-japanese-llmใ‚’ไฝฟ็”จใ—ใฆไฝœๆˆใ—ใพใ—ใŸใ€‚

Usage

git clone https://github.com/ggml-org/llama.cpp.git
cd llama.cpp
cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release
build/bin/llama-cli -m 'GPT-OSS-Swallow-20B-SFT-v0.1-gguf' -n 128 -c 128 -p 'ใ‚ใชใŸใฏใƒ—ใƒญใฎๆ–™็†ไบบใงใ™ใ€‚ใƒฌใ‚ทใƒ”ใ‚’ๆ•™ใˆใฆ' -cnv

๐Ÿ“‚ GGUF File List

๐Ÿ“ Filename ๐Ÿ“ฆ Size โšก Download
GPT-OSS-Swallow-20B-SFT-v0.1-IQ3_M.gguf
LFS Q3
11.36 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-IQ4_NL.gguf
LFS Q4
11.27 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-IQ4_XS.gguf
LFS Q4
11.27 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-MXFP4_MOE.gguf
LFS
11.28 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-Q3_K_L.gguf
LFS Q3
12.42 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-Q3_K_M.gguf
LFS Q3
12.03 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-Q4_0.gguf
Recommended LFS Q4
11.27 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-Q4_K_M.gguf
LFS Q4
14.72 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-Q4_K_S.gguf
LFS Q4
13.65 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-Q5_0.gguf
LFS Q5
13.63 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-Q5_K_M.gguf
LFS Q5
15.73 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-Q5_K_S.gguf
LFS Q5
14.8 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-Q6_K.gguf
LFS Q6
20.67 GB Download
GPT-OSS-Swallow-20B-SFT-v0.1-Q8_0.gguf
LFS Q8
20.73 GB Download