NexaAI/Qwen3-VL-4B-Instruct-GGUF

Name: NexaAI/Qwen3-VL-4B-Instruct-GGUF
Author: NexaAI

High-quality GGUF model

3.1K 📥 Downloads

28 ❤️ Likes

9 📁 GGUF Files

24.08 GB 💾 Total Size

2 months ago 🔄 Last Updated

📋 Model Description

pipeline_tag: image-text-to-text base_model:

Qwen/Qwen3-VL-4B-Instruct

Qwen3-VL-4B-Instruct

Run Qwen3-VL-4B-Instruct optimized for CPU/GPU with NexaSDK.

Quickstart

Install NexaSDK
Run the model locally with one line of code:

nexa infer NexaAI/Qwen3-VL-4B-Instruct-GGUF

Model Description

Qwen3-VL-4B-Instruct is a 4-billion-parameter instruction-tuned multimodal large language model from Alibaba Cloud’s Qwen team. As part of the Qwen3-VL series, it fuses powerful vision-language understanding with conversational fine-tuning, optimized for real-world applications such as chat-based reasoning, document analysis, and visual dialogue.

The Instruct variant is tuned for following user prompts naturally and safely — producing concise, relevant, and user-aligned responses across text, image, and video contexts.

Features

Instruction-Following: Optimized for dialogue, explanation, and user-friendly task completion.
Vision-Language Fusion: Understands and reasons across text, images, and video frames.
Multilingual Capability: Handles multiple languages for diverse global use cases.
Contextual Coherence: Balances reasoning ability with natural, grounded conversational tone.
Lightweight & Deployable: 4B parameters make it efficient for edge and device-level inference.

Use Cases

Visual chatbots and assistants
Image captioning and scene understanding
Chart, document, or screenshot analysis
Educational or tutoring systems with visual inputs
Multilingual, multimodal question answering

Inputs and Outputs

Input:

Text prompts, image(s), or mixed multimodal instructions.

Output:

Natural-language responses or visual reasoning explanations.
Can return structured text (summaries, captions, answers, etc.) depending on the prompt.

License

Refer to the official Qwen license for terms of use and redistribution.

📂 GGUF File List

📁 Filename	📦 Size	⚡ Download
Qwen3-VL-4B-Instruct.F16.gguf LFS FP16	7.49 GB	Download
Qwen3-VL-4B-Instruct.Q4_0.gguf Recommended LFS Q4	2.11 GB	Download
Qwen3-VL-4B-Instruct.Q4_K.gguf LFS Q4	2.11 GB	Download
Qwen3-VL-4B-Instruct.Q5_K.gguf LFS Q5	2.58 GB	Download
Qwen3-VL-4B-Instruct.Q6_K.gguf LFS Q6	3.07 GB	Download
Qwen3-VL-4B-Instruct.Q8_0.gguf LFS Q8	3.98 GB	Download
mmproj.F16.gguf LFS FP16	797.45 MB	Download
mmproj.F32.gguf LFS	1.55 GB	Download
mmproj.Q8_0.gguf LFS Q8	428.54 MB	Download

📊 Model Information

🆔 Model ID: NexaAI/Qwen3-VL-4B-Instruct-GGUF

📅 Created: 3 months ago

🔄 Last Updated: 2 months ago

📥 Downloads: 3.1K

❤️ Likes: 28

🎯 Difficulty: Intermediate

⚙️ Quantization: FP16, Q4, Q5, Q6, Q8

🏷️ Tags

ggufimage-text-to-textbase_model:Qwen/Qwen3-VL-4B-Instructbase_model:quantized:Qwen/Qwen3-VL-4B-Instructregion:us

🔗 Related Links

🤗 Visit HuggingFace ⚡ Quick Download