AaryanK/Youtu-LLM-2B-GGUF

Name: AaryanK/Youtu-LLM-2B-GGUF
Author: AaryanK

High-quality GGUF model

2.3K 📥 Downloads

11 ❤️ Likes

15 📁 GGUF Files

20.61 GB 💾 Total Size

2 months ago 🔄 Last Updated

📋 Model Description

base_model: tencent/Youtu-LLM-2B basemodelrelation: quantized language: - en - zh library_name: gguf license: other license_name: youtu-llm license_link: https://huggingface.co/tencent/Youtu-LLM-2B/blob/main/LICENSE.txt pipeline_tag: text-generation tags: - text-generation-inference - agent - code - 128k - mla - gguf

Youtu-LLM-2B-GGUF

Description

This repository contains GGUF format model files for Tencent's Youtu-LLM-2B.

Youtu-LLM-2B is a highly efficient 1.96B parameter model featuring a Dense MLA architecture and a native 128K context window. Despite its small size, it supports Agentic capabilities and "Reasoning Mode" (Chain of Thought), outperforming many larger models in STEM, coding, and agentic benchmarks.

Evaluation Results

Comparison between Youtu-LLM-2B and baselines

Files & Quantization

To see the available files, please verify the Files and versions tab.

How to Run (llama.cpp)

Note: This model uses the Dense MLA architecture. Please ensure you are using the latest version of llama.cpp to support this architecture correctly.

Recommended Parameters:
This model supports two modes. Adjust your temperature accordingly:

Reasoning Mode (CoT): Temperature 1.0 (Recommended for complex logic/math).
Normal Mode: Temperature 0.7 (Recommended for chat/stability).
Context: -c (Supports up to 131072).

CLI Example

./llama-cli -m Youtu-LLM-2B.Q4KM.gguf \
  -c 8192 \
  --temp 1.0 \
  --top-p 0.95 \
  -p "User: Explain the theory of relativity.\nAssistant:" \
  -cnv

Server Example

./llama-server -m Youtu-LLM-2B.Q4KM.gguf \
  --port 8080 \
  --host 0.0.0.0 \
  -c 16384 \
  -ngl 99

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
Youtu-LLM-2B.fp16.gguf LFS FP16	3.66 GB	Download
Youtu-LLM-2B.q2_k.gguf LFS Q2	801.81 MB	Download
Youtu-LLM-2B.q3_k_l.gguf LFS Q3	1.02 GB	Download
Youtu-LLM-2B.q3_k_m.gguf LFS Q3	982.09 MB	Download
Youtu-LLM-2B.q3_k_s.gguf LFS Q3	911.09 MB	Download
Youtu-LLM-2B.q4_0.gguf Recommended LFS Q4	1.1 GB	Download
Youtu-LLM-2B.q4_1.gguf LFS Q4	1.19 GB	Download
Youtu-LLM-2B.q4_k_m.gguf LFS Q4	1.15 GB	Download
Youtu-LLM-2B.q4_k_s.gguf LFS Q4	1.11 GB	Download
Youtu-LLM-2B.q5_0.gguf LFS Q5	1.29 GB	Download
Youtu-LLM-2B.q5_1.gguf LFS Q5	1.39 GB	Download
Youtu-LLM-2B.q5_k_m.gguf LFS Q5	1.32 GB	Download
Youtu-LLM-2B.q5_k_s.gguf LFS Q5	1.3 GB	Download
Youtu-LLM-2B.q6_k.gguf LFS Q6	1.51 GB	Download
Youtu-LLM-2B.q8_0.gguf LFS Q8	1.95 GB	Download

📊 Model Information

🆔 Model ID: AaryanK/Youtu-LLM-2B-GGUF

📅 Created: 2 months ago

🔄 Last Updated: 2 months ago

📥 Downloads: 2.3K

❤️ Likes: 11

🎯 Difficulty: Intermediate

⚙️ Quantization: FP16, Q2, Q3, Q4, Q5, Q6, Q8

🏷️ Tags

gguftext-generation-inferenceagentcode128kmlatext-generationenzhbase_model:tencent/Youtu-LLM-2Bbase_model:quantized:tencent/Youtu-LLM-2Blicense:otherendpoints_compatibleregion:usconversational

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download