πŸ“‹ Model Description


base_model: glogwa68/granite-4.0-h-tiny-DISTILL-OPUS-4.5-think library_name: gguf license: apache-2.0 language:
  • en
  • fr
tags:
  • granite
  • gguf
  • quantized
  • llama.cpp
  • ollama
quantized_by: llama.cpp pipeline_tag: text-generation

granite-4.0-h-tiny-DISTILL-OPUS-4.5-think-GGUF

GGUF quantized versions of granite-4.0-h-tiny-DISTILL-OPUS-4.5-think

Available Formats

FilenameSizeQuant TypeDescription
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-f16.gguf13.22 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-F16
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q2k.gguf2.46 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q2K
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q3kl.gguf3.42 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q3K_L
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q3km.gguf3.18 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q3K_M
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q3ks.gguf2.91 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q3K_S
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q40.gguf3.77 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q40
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q41.gguf4.18 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q41
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q4km.gguf4.02 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q4K_M
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q4ks.gguf3.80 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q4K_S
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q50.gguf4.58 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q50
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q51.gguf4.98 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q51
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q5km.gguf4.71 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q5K_M
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q5ks.gguf4.58 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q5K_S
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q6k.gguf5.44 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q6K
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q80.gguf7.04 GBGRANITE-4.0-H-TINY-DISTILL-4.5-OPUS-HIGH-THINK-Q80

Quick Start

Ollama

# Use Q4KM (recommended)
ollama run hf.co/glogwa68/granite-4.0-h-tiny-DISTILL-OPUS-4.5-think-GGUF:Q4KM

Or other quantizations

ollama run hf.co/glogwa68/granite-4.0-h-tiny-DISTILL-OPUS-4.5-think-GGUF:Q8_0 ollama run hf.co/glogwa68/granite-4.0-h-tiny-DISTILL-OPUS-4.5-think-GGUF:Q2_K

llama.cpp

# Download and run
llama-cli --hf-repo glogwa68/granite-4.0-h-tiny-DISTILL-OPUS-4.5-think-GGUF --hf-file granite-4.0-h-tiny-distill-opus-4.5-think-q4km.gguf -p "Hello, how are you?"

With server

llama-server --hf-repo glogwa68/granite-4.0-h-tiny-DISTILL-OPUS-4.5-think-GGUF --hf-file granite-4.0-h-tiny-distill-opus-4.5-think-q4km.gguf -c 2048

LM Studio / GPT4All

Download the .gguf file of your choice and load it in your application.

Quantization Details

TypeBitsUse Case
Q2_K2Extreme compression, low quality
Q3KM3Very compressed
Q4KM4Recommended - Best size/quality
Q5KM5High quality
Q6_K6Very high quality
Q8_08Near lossless
F1616Original precision

Original Model

This is the quantized version of granite-4.0-h-tiny-DISTILL-OPUS-4.5-think

  • Base Model: ibm-granite/granite-4.0-h-tiny
  • Fine-tuning Dataset: TeichAI/claude-4.5-opus-high-reasoning-250x
  • Special Feature: Thinking/Reasoning with tags

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-f16.gguf
LFS FP16
13.22 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q2_k.gguf
LFS Q2
2.46 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q3_k_l.gguf
LFS Q3
3.42 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q3_k_m.gguf
LFS Q3
3.18 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q3_k_s.gguf
LFS Q3
2.91 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q4_0.gguf
Recommended LFS Q4
3.77 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q4_1.gguf
LFS Q4
4.18 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q4_k_m.gguf
LFS Q4
4.02 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q4_k_s.gguf
LFS Q4
3.8 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q5_0.gguf
LFS Q5
4.58 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q5_1.gguf
LFS Q5
4.98 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q5_k_m.gguf
LFS Q5
4.71 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q5_k_s.gguf
LFS Q5
4.58 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q6_k.gguf
LFS Q6
5.44 GB Download
granite-4.0-h-tiny-DISTILL-4.5-opus-high-think-q8_0.gguf
LFS Q8
7.04 GB Download