stabilityai/stable-code-instruct-3b

Name: stabilityai/stable-code-instruct-3b
Author: stabilityai

High-quality GGUF model

1.9K 📥 Downloads

179 ❤️ Likes

2 📁 GGUF Files

3.45 GB 💾 Total Size

2 years ago 🔄 Last Updated

📋 Model Description

license: other language:

tags:

causal-lm
code

metrics:

code_eval

library_name: transformers model-index:

name: stabilityai/stable-code-instruct-3b

results: - task: type: text-generation dataset: type: nuprl/MultiPL-E name: MultiPL-HumanEval (Python) metrics: - name: pass@1 type: pass@1 value: 32.4 verified: false - task: type: text-generation dataset: type: nuprl/MultiPL-E name: MultiPL-HumanEval (C++) metrics: - name: pass@1 type: pass@1 value: 30.9 verified: false - task: type: text-generation dataset: type: nuprl/MultiPL-E name: MultiPL-HumanEval (Java) metrics: - name: pass@1 type: pass@1 value: 32.1 verified: false - task: type: text-generation dataset: type: nuprl/MultiPL-E name: MultiPL-HumanEval (JavaScript) metrics: - name: pass@1 type: pass@1 value: 32.1 verified: false - task: type: text-generation dataset: type: nuprl/MultiPL-E name: MultiPL-HumanEval (PHP) metrics: - name: pass@1 type: pass@1 value: 24.2 verified: false - task: type: text-generation dataset: type: nuprl/MultiPL-E name: MultiPL-HumanEval (Rust) metrics: - name: pass@1 type: pass@1 value: 23.0 verified: false

Stable Code Instruct 3B

Try it out here: https://huggingface.co/spaces/stabilityai/stable-code-instruct-3b

!image/png

Model Description

stable-code-instruct-3b is a 2.7B billion parameter decoder-only language model tuned from stable-code-3b. This model was trained on a mix of publicly available datasets, synthetic datasets using Direct Preference Optimization (DPO).

This instruct tune demonstrates state-of-the-art performance (compared to models of similar size) on the MultiPL-E metrics across multiple programming languages tested using BigCode's Evaluation Harness, and on the code portions of
MT Bench.
The model is finetuned to make it useable in tasks like,
- General purpose Code/Software Engineering like conversations.
- SQL related generation and conversation.

Please note: For commercial use, please refer to https://stability.ai/license.

Usage

Here's how you can run the model use the model:

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
tokenizer = AutoTokenizer.frompretrained("stabilityai/stable-code-instruct-3b", trustremote_code=True)
model = AutoModelForCausalLM.frompretrained("stabilityai/stable-code-instruct-3b", torchdtype=torch.bfloat16, trustremotecode=True)
model.eval()
model = model.cuda()

messages = [
    {
        "role": "system",
        "content": "You are a helpful and polite assistant",
    },
    {
        "role": "user",
        "content": "Write a simple website in HTML. When a user clicks the button, it shows a random joke from a list of 4 jokes."
    },
]

prompt = tokenizer.applychattemplate(messages, addgenerationprompt=True, tokenize=False)

inputs = tokenizer([prompt], return_tensors="pt").to(model.device)

tokens = model.generate(
    inputs,
    maxnewtokens=1024,
    temperature=0.5,
    top_p=0.95,
    top_k=100,
    do_sample=True,
    use_cache=True
)

output = tokenizer.batchdecode(tokens[:, inputs.inputids.shape[-1]:], skipspecialtokens=False)[0]

Model Details

Developed by: Stability AI
Model type: Stable Code Instruct 3B model is an auto-regressive language model based on the transformer decoder architecture.
Language(s): English
Paper: Stable Code Technical Report
Library: Alignment Handbook
Finetuned from model: https://huggingface.co/stabilityai/stable-code-3b
License: StabilityAI Community License.
Commercial License: to use this model commercially, please refer to https://stability.ai/license
Contact: For questions and comments about the model, please email [email protected]

Performance

Multi-PL Benchmark:

Model	Size	Avg	Python	C++	JavaScript	Java	PHP	Rust
Codellama Instruct	7B	0.30	0.33	0.31	0.31	0.29	0.31	0.25
Deepseek Instruct	1.3B	0.44	0.52	0.52	0.41	0.46	0.45	0.28
Stable Code Instruct (SFT)	3B	0.44	0.55	0.45	0.42	0.42	0.44	0.32
Stable Code Instruct (DPO)	3B	0.47	0.59	0.49	0.49	0.44	0.45	0.37

MT-Bench Coding:

Model	Size	Score
DeepSeek Coder	1.3B	4.6
Stable Code Instruct (DPO)	3B	5.8(ours)
Stable Code Instruct (SFT)	3B	5.5
DeepSeek Coder	6.7B	6.9
CodeLlama Instruct	7B	3.55
StarChat2	15B	5.7

SQL Performance

Model	Size	Date	Group By	Order By	Ratio	Join	Where
Stable Code Instruct (DPO)	3B	24.0%	54.2%	68.5%	40.0%	54.2%	42.8%
DeepSeek-Coder Instruct	1.3B	24.0%	37.1%	51.4%	34.3%	45.7%	45.7%
SQLCoder	7B	64.0%	82.9%	74.3%	54.3%	74.3%	74.3%

How to Cite

@misc{stable-code-instruct-3b,
      url={https://huggingface.co/stabilityai/stable-code-3b},
      title={Stable Code 3B},
      author={Phung, Duy, and Pinnaparaju, Nikhil and Adithyan, Reshinth and Zhuravinskyi, Maksym and Tow, Jonathan and Cooper, Nathan}
}

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
stable-code-3b-q4_k_m.gguf Recommended LFS Q4	1.59 GB	Download
stable-code-3b-q5_k_m.gguf LFS Q5	1.86 GB	Download

📊 Model Information

🆔 Model ID: stabilityai/stable-code-instruct-3b

📅 Created: 2 years ago

🔄 Last Updated: 2 years ago

📥 Downloads: 1.9K

❤️ Likes: 179

🎯 Difficulty: Beginner

⚙️ Quantization: Q4, Q5

🏷️ Tags

transformerssafetensorsggufstablelmtext-generationcausal-lmcodeconversationalenarxiv:2305.18290license:othermodel-indexendpoints_compatibleregion:us

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download