πŸ“‹ Model Description


library_name: transformers tags:
  • falcon-h1
  • edge
license: other license_name: falcon-llm-license license_link: https://falconllm.tii.ae/falcon-terms-and-conditions.html

drawing

Table of Contents

  1. TL;DR
  2. Model Details
  3. Training Details
  4. Usage
  5. Evaluation
  6. Citation

TL;DR

Model Details

Model Description

  • Developed by: https://www.tii.ae
  • Model type: Causal decoder-only
  • Architecture: Hybrid Transformers + Mamba architecture
  • Language(s) (NLP): English
  • Number of Parameters: 90M
  • License: Falcon-LLM License

Training details

For more details about the training protocol of this model, please refer to the Falcon-H1-Tiny technical blogpost.

Usage

Currently to use this model you can either rely on Hugging Face transformers, vLLM, sglang, llama.cpp, ollama or mlx library.

Inference

πŸ€— transformers

Refer to the snippet below to run H1 models using πŸ€— transformers:

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "tiiuae/Falcon-H1-Tiny-90M-Instruct-pre-DPO"

model = AutoModelForCausalLM.from_pretrained(
model_id,
torch_dtype=torch.bfloat16,
device_map="auto"
)

Perform text generation

or

transformers serve tiiuae/Falcon-H1-Tiny-90M-Instruct-pre-DPO

llama.cpp

You can find all GGUF files compatible with llama.cpp under [our official collection]() - an example setup could be:

brew install llama.cpp 
pip install huggingface_hub 
hf download tiiuae/Falcon-H1-Tiny-90M-Instruct-pre-DPO Falcon-H1-Tiny-90M-Instruct-pre-DPO-Q8_0.gguf --local-dir ./ 
llama-cli ./ Falcon-H1-Tiny-90M-Instruct-pre-DPO-Q8_0.gguf -cnv

ollama

ollama run hf.co/tiiuae/Falcon-H1-Tiny-90M-Instruct-GGUF:Q8_0

Apple mlx

mlx_lm.chat --model tiiuae/Tiny-H1-SF

vLLM

For vLLM, simply start a server by executing the command below:

# pip install vllm>=0.9.0
vllm serve tiiuae/Falcon-H1-Tiny-90M-Instruct-pre-DPO --tensor-parallel-size 2 --data-parallel-size 1

sglang

python -m sglang.launch_server \
  --model ttiiuae/Falcon-H1-Tiny-90M-Instruct-pre-DPO \
  --tensor-parallel-size 1

Evaluation

For detailed evaluation of Tiny-H1 series, please refer to our technical blogpost

Useful links

Citation

If the Falcon-H1-Tiny family of models were helpful to your work, feel free to give us a cite.

@misc{falconh1tiny,
  title={Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale},
  author={Falcon-LLM Team},
  year={2026}, 
}

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
Falcon-H1-Tiny-90M-Instruct-BF16.gguf
LFS FP16
175.15 MB Download
Falcon-H1-Tiny-90M-Instruct-IQ2_M.gguf
LFS Q2
12.08 MB Download
Falcon-H1-Tiny-90M-Instruct-IQ3_M.gguf
LFS Q3
46.28 MB Download
Falcon-H1-Tiny-90M-Instruct-IQ3_S.gguf
LFS Q3
45.33 MB Download
Falcon-H1-Tiny-90M-Instruct-IQ3_XS.gguf
LFS Q3
44.35 MB Download
Falcon-H1-Tiny-90M-Instruct-IQ3_XXS.gguf
LFS Q3
12.08 MB Download
Falcon-H1-Tiny-90M-Instruct-IQ4_NL.gguf
LFS Q4
54.86 MB Download
Falcon-H1-Tiny-90M-Instruct-IQ4_XS.gguf
LFS Q4
52.73 MB Download
Falcon-H1-Tiny-90M-Instruct-Q2_K.gguf
LFS Q2
39.82 MB Download
Falcon-H1-Tiny-90M-Instruct-Q3_K.gguf
LFS Q3
47.39 MB Download
Falcon-H1-Tiny-90M-Instruct-Q3_K_L.gguf
LFS Q3
49.39 MB Download
Falcon-H1-Tiny-90M-Instruct-Q3_K_M.gguf
LFS Q3
47.39 MB Download
Falcon-H1-Tiny-90M-Instruct-Q3_K_S.gguf
LFS Q3
45.13 MB Download
Falcon-H1-Tiny-90M-Instruct-Q4_0.gguf
Recommended LFS Q4
54.53 MB Download
Falcon-H1-Tiny-90M-Instruct-Q4_1.gguf
LFS Q4
58.96 MB Download
Falcon-H1-Tiny-90M-Instruct-Q4_K.gguf
LFS Q4
55.89 MB Download
Falcon-H1-Tiny-90M-Instruct-Q4_K_M.gguf
LFS Q4
55.89 MB Download
Falcon-H1-Tiny-90M-Instruct-Q4_K_S.gguf
LFS Q4
54.7 MB Download
Falcon-H1-Tiny-90M-Instruct-Q5_0.gguf
LFS Q5
63.38 MB Download
Falcon-H1-Tiny-90M-Instruct-Q5_1.gguf
LFS Q5
67.8 MB Download
Falcon-H1-Tiny-90M-Instruct-Q5_K.gguf
LFS Q5
64.08 MB Download
Falcon-H1-Tiny-90M-Instruct-Q5_K_M.gguf
LFS Q5
64.08 MB Download
Falcon-H1-Tiny-90M-Instruct-Q5_K_S.gguf
LFS Q5
63.38 MB Download
Falcon-H1-Tiny-90M-Instruct-Q6_K.gguf
LFS Q6
72.78 MB Download
Falcon-H1-Tiny-90M-Instruct-Q8_0.gguf
LFS Q8
93.8 MB Download
Falcon-H1-Tiny-90M-Instruct-TQ1_0.gguf
LFS
29.65 MB Download
Falcon-H1-Tiny-90M-Instruct-TQ2_0.gguf
LFS Q2
32.97 MB Download