RichardErkhov/dumping-grounds_-_llama-3.1-minitron-6b-width-base-chatml-gguf

Name: RichardErkhov/dumping-grounds_-_llama-3.1-minitron-6b-width-base-chatml-gguf
Author: RichardErkhov

High-quality GGUF model

3.3K 📥 Downloads

0 ❤️ Likes

22 📁 GGUF Files

79.52 GB 💾 Total Size

2 years ago 🔄 Last Updated

📋 Model Description

Quantization made by Richard Erkhov.

Github

Discord

Request more models

llama-3.1-minitron-6b-width-base-chatml - GGUF

Model creator: https://huggingface.co/dumping-grounds/
Original model: https://huggingface.co/dumping-grounds/llama-3.1-minitron-6b-width-base-chatml/

Name	Quant method	Size
llama-3.1-minitron-6b-width-base-chatml.Q2K.gguf	Q2K	2.36GB
llama-3.1-minitron-6b-width-base-chatml.IQ3XS.gguf	IQ3XS	2.6GB
llama-3.1-minitron-6b-width-base-chatml.IQ3S.gguf	IQ3S	2.72GB
llama-3.1-minitron-6b-width-base-chatml.Q3KS.gguf	Q3K_S	2.7GB
llama-3.1-minitron-6b-width-base-chatml.IQ3M.gguf	IQ3M	2.82GB
llama-3.1-minitron-6b-width-base-chatml.Q3K.gguf	Q3K	2.97GB
llama-3.1-minitron-6b-width-base-chatml.Q3KM.gguf	Q3K_M	2.97GB
llama-3.1-minitron-6b-width-base-chatml.Q3KL.gguf	Q3K_L	3.21GB
llama-3.1-minitron-6b-width-base-chatml.IQ4XS.gguf	IQ4XS	3.32GB
llama-3.1-minitron-6b-width-base-chatml.Q40.gguf	Q40	3.44GB
llama-3.1-minitron-6b-width-base-chatml.IQ4NL.gguf	IQ4NL	3.48GB
llama-3.1-minitron-6b-width-base-chatml.Q4KS.gguf	Q4K_S	3.46GB
llama-3.1-minitron-6b-width-base-chatml.Q4K.gguf	Q4K	3.62GB
llama-3.1-minitron-6b-width-base-chatml.Q4KM.gguf	Q4K_M	3.62GB
llama-3.1-minitron-6b-width-base-chatml.Q41.gguf	Q41	3.79GB
llama-3.1-minitron-6b-width-base-chatml.Q50.gguf	Q50	4.14GB
llama-3.1-minitron-6b-width-base-chatml.Q5KS.gguf	Q5K_S	4.14GB
llama-3.1-minitron-6b-width-base-chatml.Q5K.gguf	Q5K	4.23GB
llama-3.1-minitron-6b-width-base-chatml.Q5KM.gguf	Q5K_M	4.23GB
llama-3.1-minitron-6b-width-base-chatml.Q51.gguf	Q51	4.49GB
llama-3.1-minitron-6b-width-base-chatml.Q6K.gguf	Q6K	4.88GB
llama-3.1-minitron-6b-width-base-chatml.Q80.gguf	Q80	6.32GB

Original model description:

base_model:

IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml

library_name: transformers
tags:

mergekit
merge

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the passthrough merge method.

Models Merged

The following models were included in the merge:

IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml

Configuration

The following YAML configuration was used to produce this model:

slices:
  - sources:
      - model: IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml
        layer_range: [0, 24]
  - sources: # add middle layers with residuals scaled to zero
      - model: IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml
        layer_range: [8, 24]
        parameters:
          scale:
            - filter: o_proj
              value: 0.0
            - filter: down_proj
              value: 0.0
            - value: 1.0
  - sources:
      - model: IntervitensInc/Llama-3.1-Minitron-4B-Width-Base-chatml
        layer_range: [24, 32]
merge_method: passthrough
dtype: bfloat16

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
llama-3.1-minitron-6b-width-base-chatml.IQ3_M.gguf LFS Q3	2.82 GB	Download
llama-3.1-minitron-6b-width-base-chatml.IQ3_S.gguf LFS Q3	2.72 GB	Download
llama-3.1-minitron-6b-width-base-chatml.IQ3_XS.gguf LFS Q3	2.6 GB	Download
llama-3.1-minitron-6b-width-base-chatml.IQ4_NL.gguf LFS Q4	3.48 GB	Download
llama-3.1-minitron-6b-width-base-chatml.IQ4_XS.gguf LFS Q4	3.32 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q2_K.gguf LFS Q2	2.36 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q3_K.gguf LFS Q3	2.97 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q3_K_L.gguf LFS Q3	3.21 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q3_K_M.gguf LFS Q3	2.97 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q3_K_S.gguf LFS Q3	2.7 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q4_0.gguf Recommended LFS Q4	3.44 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q4_1.gguf LFS Q4	3.79 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q4_K.gguf LFS Q4	3.62 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q4_K_M.gguf LFS Q4	3.62 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q4_K_S.gguf LFS Q4	3.46 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q5_0.gguf LFS Q5	4.14 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q5_1.gguf LFS Q5	4.49 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q5_K.gguf LFS Q5	4.23 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q5_K_M.gguf LFS Q5	4.23 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q5_K_S.gguf LFS Q5	4.14 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q6_K.gguf LFS Q6	4.88 GB	Download
llama-3.1-minitron-6b-width-base-chatml.Q8_0.gguf LFS Q8	6.32 GB	Download

📊 Model Information

🆔 Model ID: RichardErkhov/dumping-grounds_-_llama-3.1-minitron-6b-width-base-chatml-gguf

📅 Created: 2 years ago

🔄 Last Updated: 2 years ago

📥 Downloads: 3.3K

❤️ Likes: 0

🎯 Difficulty: Advanced

⚙️ Quantization: Q3, Q4, Q2, Q5, Q6, Q8

🏷️ Tags

ggufendpoints_compatibleregion:us

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download