RichardErkhov/huihui-ai_-_Qwen2.5-32B-Instruct-abliterated-gguf

Name: RichardErkhov/huihui-ai_-_Qwen2.5-32B-Instruct-abliterated-gguf
Author: RichardErkhov

High-quality GGUF model

3.5K 📥 Downloads

1 ❤️ Likes

22 📁 GGUF Files

401.75 GB 💾 Total Size

2 years ago 🔄 Last Updated

📋 Model Description

Quantization made by Richard Erkhov.

Github

Discord

Request more models

Qwen2.5-32B-Instruct-abliterated - GGUF

Model creator: https://huggingface.co/huihui-ai/
Original model: https://huggingface.co/huihui-ai/Qwen2.5-32B-Instruct-abliterated/

Name	Quant method	Size
Qwen2.5-32B-Instruct-abliterated.Q2K.gguf	Q2K	11.47GB
Qwen2.5-32B-Instruct-abliterated.IQ3XS.gguf	IQ3XS	12.76GB
Qwen2.5-32B-Instruct-abliterated.IQ3S.gguf	IQ3S	13.45GB
Qwen2.5-32B-Instruct-abliterated.Q3KS.gguf	Q3K_S	13.4GB
Qwen2.5-32B-Instruct-abliterated.IQ3M.gguf	IQ3M	13.79GB
Qwen2.5-32B-Instruct-abliterated.Q3K.gguf	Q3K	14.84GB
Qwen2.5-32B-Instruct-abliterated.Q3KM.gguf	Q3K_M	14.84GB
Qwen2.5-32B-Instruct-abliterated.Q3KL.gguf	Q3K_L	16.06GB
Qwen2.5-32B-Instruct-abliterated.IQ4XS.gguf	IQ4XS	16.64GB
Qwen2.5-32B-Instruct-abliterated.Q40.gguf	Q40	17.36GB
Qwen2.5-32B-Instruct-abliterated.IQ4NL.gguf	IQ4NL	17.53GB
Qwen2.5-32B-Instruct-abliterated.Q4KS.gguf	Q4K_S	17.49GB
Qwen2.5-32B-Instruct-abliterated.Q4K.gguf	Q4K	18.49GB
Qwen2.5-32B-Instruct-abliterated.Q4KM.gguf	Q4K_M	18.49GB
Qwen2.5-32B-Instruct-abliterated.Q41.gguf	Q41	19.22GB
Qwen2.5-32B-Instruct-abliterated.Q50.gguf	Q50	21.08GB
Qwen2.5-32B-Instruct-abliterated.Q5KS.gguf	Q5K_S	21.08GB
Qwen2.5-32B-Instruct-abliterated.Q5K.gguf	Q5K	21.66GB
Qwen2.5-32B-Instruct-abliterated.Q5KM.gguf	Q5K_M	21.66GB
Qwen2.5-32B-Instruct-abliterated.Q51.gguf	Q51	22.95GB
Qwen2.5-32B-Instruct-abliterated.Q6K.gguf	Q6K	25.04GB
Qwen2.5-32B-Instruct-abliterated.Q80.gguf	Q80	32.43GB

Original model description:

library_name: transformers
license: apache-2.0
license_link: https://huggingface.co/huihui-ai/Qwen2.5-32B-Instruct-abliterated/blob/main/LICENSE
language:

pipeline_tag: text-generation
base_model: Qwen/Qwen2.5-32B-Instruct
tags:

chat
abliterated
uncensored

huihui-ai/Qwen2.5-32B-Instruct-abliterated

This is an uncensored version of Qwen2.5-32B-Instruct created with abliteration (see this article to know more about it).

Special thanks to @FailSpy for the original code and technique. Please follow him if you're interested in abliterated models.

Usage

You can use this model in your applications by loading it with Hugging Face's transformers library:

from transformers import AutoModelForCausalLM, AutoTokenizer

Load the model and tokenizer
model_name = "huihui-ai/Qwen2.5-32B-Instruct-abliterated"
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.frompretrained(modelname)

Initialize conversation context
initial_messages = [
    {"role": "system", "content": "You are Qwen, created by Alibaba Cloud. You are a helpful assistant."}
]
messages = initial_messages.copy()  # Copy the initial conversation context

Enter conversation loop
while True:
    # Get user input
    user_input = input("User: ").strip()  # Strip leading and trailing spaces

# If the user types '/exit', end the conversation
    if user_input.lower() == "/exit":
        print("Exiting chat.")
        break

# If the user types '/clean', reset the conversation context
    if user_input.lower() == "/clean":
        messages = initial_messages.copy()  # Reset conversation context
        print("Chat history cleared. Starting a new conversation.")
        continue

# If input is empty, prompt the user and continue
    if not user_input:
        print("Input cannot be empty. Please enter something.")
        continue

# Add user input to the conversation
    messages.append({"role": "user", "content": user_input})

# Build the chat template
    text = tokenizer.applychattemplate(
        messages,
        tokenize=False,
        addgenerationprompt=True
    )

# Tokenize input and prepare it for the model
    modelinputs = tokenizer([text], returntensors="pt").to(model.device)

# Generate a response from the model
    generated_ids = model.generate(
        model_inputs,
        maxnewtokens=8192
    )

# Extract model output, removing special tokens
    generated_ids = [
        outputids[len(inputids):] for inputids, outputids in zip(modelinputs.inputids, generated_ids)
    ]
    response = tokenizer.batchdecode(generatedids, skipspecialtokens=True)[0]

# Add the model's response to the conversation
    messages.append({"role": "assistant", "content": response})

# Print the model's response
    print(f"Qwen: {response}")

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
Qwen2.5-32B-Instruct-abliterated.IQ3_M.gguf LFS Q3	13.79 GB	Download
Qwen2.5-32B-Instruct-abliterated.IQ3_S.gguf LFS Q3	13.45 GB	Download
Qwen2.5-32B-Instruct-abliterated.IQ3_XS.gguf LFS Q3	12.76 GB	Download
Qwen2.5-32B-Instruct-abliterated.IQ4_NL.gguf LFS Q4	17.53 GB	Download
Qwen2.5-32B-Instruct-abliterated.IQ4_XS.gguf LFS Q4	16.64 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q2_K.gguf LFS Q2	11.47 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q3_K.gguf LFS Q3	14.84 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q3_K_L.gguf LFS Q3	16.06 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q3_K_M.gguf LFS Q3	14.84 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q3_K_S.gguf LFS Q3	13.4 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q4_0.gguf Recommended LFS Q4	17.36 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q4_1.gguf LFS Q4	19.22 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q4_K.gguf LFS Q4	18.49 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q4_K_M.gguf LFS Q4	18.49 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q4_K_S.gguf LFS Q4	17.49 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q5_0.gguf LFS Q5	21.08 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q5_1.gguf LFS Q5	22.95 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q5_K.gguf LFS Q5	21.66 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q5_K_M.gguf LFS Q5	21.66 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q5_K_S.gguf LFS Q5	21.08 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q6_K.gguf LFS Q6	25.04 GB	Download
Qwen2.5-32B-Instruct-abliterated.Q8_0.gguf LFS Q8	32.43 GB	Download

📊 Model Information

🆔 Model ID: RichardErkhov/huihui-ai_-_Qwen2.5-32B-Instruct-abliterated-gguf

📅 Created: 2 years ago

🔄 Last Updated: 2 years ago

📥 Downloads: 3.5K

❤️ Likes: 1

🎯 Difficulty: Advanced

⚙️ Quantization: Q3, Q4, Q2, Q5, Q6, Q8

🏷️ Tags

ggufendpoints_compatibleregion:usconversational

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download