πŸ“‹ Model Description

Quantization made by Richard Erkhov.

Github

Discord

Request more models

Phi-Delthanar - GGUF

  • Model creator: https://huggingface.co/Replete-AI/
  • Original model: https://huggingface.co/Replete-AI/Phi-Delthanar/

NameQuant methodSize
Phi-Delthanar.Q2K.ggufQ2K2.82GB
Phi-Delthanar.IQ3XS.ggufIQ3XS3.05GB
Phi-Delthanar.IQ3S.ggufIQ3S3.19GB
Phi-Delthanar.Q3KS.ggufQ3K_S3.19GB
Phi-Delthanar.IQ3M.ggufIQ3M3.38GB
Phi-Delthanar.Q3K.ggufQ3K3.67GB
Phi-Delthanar.Q3KM.ggufQ3K_M3.67GB
Phi-Delthanar.Q3KL.ggufQ3K_L4.09GB
Phi-Delthanar.IQ4XS.ggufIQ4XS3.96GB
Phi-Delthanar.Q40.ggufQ404.14GB
Phi-Delthanar.IQ4NL.ggufIQ4NL4.17GB
Phi-Delthanar.Q4KS.ggufQ4K_S4.18GB
Phi-Delthanar.Q4K.ggufQ4K4.51GB
Phi-Delthanar.Q4KM.ggufQ4K_M4.51GB
Phi-Delthanar.Q41.ggufQ414.58GB
Phi-Delthanar.Q50.ggufQ505.03GB
Phi-Delthanar.Q5KS.ggufQ5K_S5.03GB
Phi-Delthanar.Q5K.ggufQ5K5.22GB
Phi-Delthanar.Q5KM.ggufQ5K_M5.22GB
Phi-Delthanar.Q51.ggufQ515.48GB
Phi-Delthanar.Q6K.ggufQ6K5.98GB
Phi-Delthanar.Q80.ggufQ807.74GB

Original model description:



license: mit
language:
  • en

thumbnail: "https://cdn-uploads.huggingface.co/production/uploads/6589d7e6586088fd2784a12c/iYImJKf2HZZZJ9IwDSN00.png"


The forest is with you.


!image/png

Named after the method used to create it, interleaving the layers of its predecessor to become far larger, giving it much more potential.

Del'thanar is a supposed ancient treeant, and I couldn't think of a better naming convention for a model that was created using the passthrough method.

By concatenating layers from different LLMs, it can produce models with an exotic number of parameters (e.g., 9B with two 7B parameter models). These models are often referred to as "frankenmerges" or "Frankenstein models" by the community.

Many thanks to Abacaj for providing the fine tuned weights that were used in the creation of this base model. You can find the full script for how the model was merged here...thanks to KatyTheCutie for inspiring me to test out this script.

This idea was brought to me by The Face of Goonery, also known as Caleb Morgan. I have him to thank if fine-tuning this model turns out to be a success...he also helped me to make this model even larger than the prior one.

How to run inference:

import transformers
import torch

if name == "main":
model_name = "Replete-AI/Phi-Delthanar"
tokenizer = transformers.AutoTokenizer.frompretrained(modelname)

model = (
transformers.AutoModelForCausalLM.from_pretrained(
model_name,
)
.to("cuda:0")
.eval()
)

messages = [
{"role": "user", "content": "Hello, who are you?"}
]
inputs = tokenizer.applychattemplate(messages, return_tensors="pt").to(model.device)
inputidscutoff = inputs.size(dim=1)

with torch.no_grad():
generated_ids = model.generate(
input_ids=inputs,
use_cache=True,
maxnewtokens=512,
temperature=0.2,
top_p=0.95,
do_sample=True,
eostokenid=tokenizer.eostokenid,
padtokenid=tokenizer.padtokenid,
)

completion = tokenizer.decode(
generatedids[0][inputids_cutoff:],
skipspecialtokens=True,
)

print(completion)

Chat template

The model uses the same chat template as found in Mistral instruct models:

Join the Replete AI Discord here!

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
Phi-Delthanar.IQ3_M.gguf
LFS Q3
3.38 GB Download
Phi-Delthanar.IQ3_S.gguf
LFS Q3
3.19 GB Download
Phi-Delthanar.IQ3_XS.gguf
LFS Q3
3.05 GB Download
Phi-Delthanar.IQ4_NL.gguf
LFS Q4
4.17 GB Download
Phi-Delthanar.IQ4_XS.gguf
LFS Q4
3.96 GB Download
Phi-Delthanar.Q2_K.gguf
LFS Q2
2.82 GB Download
Phi-Delthanar.Q3_K.gguf
LFS Q3
3.67 GB Download
Phi-Delthanar.Q3_K_L.gguf
LFS Q3
4.09 GB Download
Phi-Delthanar.Q3_K_M.gguf
LFS Q3
3.67 GB Download
Phi-Delthanar.Q3_K_S.gguf
LFS Q3
3.19 GB Download
Phi-Delthanar.Q4_0.gguf
Recommended LFS Q4
4.14 GB Download
Phi-Delthanar.Q4_1.gguf
LFS Q4
4.58 GB Download
Phi-Delthanar.Q4_K.gguf
LFS Q4
4.51 GB Download
Phi-Delthanar.Q4_K_M.gguf
LFS Q4
4.51 GB Download
Phi-Delthanar.Q4_K_S.gguf
LFS Q4
4.18 GB Download
Phi-Delthanar.Q5_0.gguf
LFS Q5
5.03 GB Download
Phi-Delthanar.Q5_1.gguf
LFS Q5
5.48 GB Download
Phi-Delthanar.Q5_K.gguf
LFS Q5
5.22 GB Download
Phi-Delthanar.Q5_K_M.gguf
LFS Q5
5.22 GB Download
Phi-Delthanar.Q5_K_S.gguf
LFS Q5
5.03 GB Download
Phi-Delthanar.Q6_K.gguf
LFS Q6
5.98 GB Download
Phi-Delthanar.Q8_0.gguf
LFS Q8
7.74 GB Download