πŸ“‹ Model Description


base_model: HuggingFaceTB/SmolLM2-135M-Instruct language:
  • en
library_name: transformers license: apache-2.0 tags:
  • llama
  • unsloth
  • transformers

Finetune SmolLM2, Llama 3.2, Gemma 2, Mistral 2-5x faster with 70% less memory via Unsloth!

We have a free Google Colab Tesla T4 notebook for Llama 3.2 (3B) here: https://colab.research.google.com/drive/1Ys44kVvmeZtnICzWz0xgpRnrIOjZAuxp?usp=sharing


unsloth/SmolLM2-135M-Instruct-GGUF

For more details on the model, please go to Hugging Face's original model card

✨ Finetune for Free

All notebooks are beginner friendly! Add your dataset, click "Run All", and you'll get a 2x faster finetuned model which can be exported to GGUF, vLLM or uploaded to Hugging Face.

Unsloth supportsFree NotebooksPerformanceMemory use
Llama-3.2 (3B)▢️ Start on Colab2.4x faster58% less
Llama-3.2 (11B vision)▢️ Start on Colab2.4x faster58% less
Llama-3.1 (8B)▢️ Start on Colab2.4x faster58% less
Phi-3.5 (mini)▢️ Start on Colab2x faster50% less
Gemma 2 (9B)▢️ Start on Colab2.4x faster58% less
Mistral (7B)▢️ Start on Colab2.2x faster62% less
DPO - Zephyr▢️ Start on Colab1.9x faster19% less

Special Thanks

A huge thank you to the Hugging Face team for creating and releasing these models.

Model Summary

SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters. They are capable of solving a wide range of tasks while being lightweight enough to run on-device.

The 1.7B variant demonstrates significant advances over its predecessor SmolLM1-1.7B, particularly in instruction following, knowledge, reasoning, and mathematics. It was trained on 11 trillion tokens using a diverse dataset combination: FineWeb-Edu, DCLM, The Stack, along with new mathematics and coding datasets that we curated and will release soon. We developed the instruct version through supervised fine-tuning (SFT) using a combination of public datasets and our own curated datasets. We then applied Direct Preference Optimization (DPO) using UltraFeedback.

The instruct model additionally supports tasks such as text rewriting, summarization and function calling thanks to datasets developed by Argilla such as Synth-APIGen-v0.1.

SmolLM2

!image/png

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
SmolLM2-135M-Instruct-F16.gguf
LFS FP16
258.34 MB Download
SmolLM2-135M-Instruct-Q2_K.gguf
LFS Q2
84.12 MB Download
SmolLM2-135M-Instruct-Q3_K_M.gguf
LFS Q3
89.18 MB Download
SmolLM2-135M-Instruct-Q4_K_M.gguf
Recommended LFS Q4
100.57 MB Download
SmolLM2-135M-Instruct-Q5_K_M.gguf
LFS Q5
106.91 MB Download
SmolLM2-135M-Instruct-Q6_K.gguf
LFS Q6
131.97 MB Download
SmolLM2-135M-Instruct-Q8_0.gguf
LFS Q8
138.1 MB Download