π Model Description
pipeline_tag: image-text-to-text base_model:
- Qwen/Qwen3-VL-4B-Instruct
Qwen3-VL-4B-Instruct
Run Qwen3-VL-4B-Instruct optimized for CPU/GPU with NexaSDK.
Quickstart
- Install NexaSDK
- Run the model locally with one line of code:
nexa infer NexaAI/Qwen3-VL-4B-Instruct-GGUF
Model Description
Qwen3-VL-4B-Instruct is a 4-billion-parameter instruction-tuned multimodal large language model from Alibaba Cloudβs Qwen team. As part of the Qwen3-VL series, it fuses powerful vision-language understanding with conversational fine-tuning, optimized for real-world applications such as chat-based reasoning, document analysis, and visual dialogue.The Instruct variant is tuned for following user prompts naturally and safely β producing concise, relevant, and user-aligned responses across text, image, and video contexts.
Features
- Instruction-Following: Optimized for dialogue, explanation, and user-friendly task completion.
- Vision-Language Fusion: Understands and reasons across text, images, and video frames.
- Multilingual Capability: Handles multiple languages for diverse global use cases.
- Contextual Coherence: Balances reasoning ability with natural, grounded conversational tone.
- Lightweight & Deployable: 4B parameters make it efficient for edge and device-level inference.
Use Cases
- Visual chatbots and assistants
- Image captioning and scene understanding
- Chart, document, or screenshot analysis
- Educational or tutoring systems with visual inputs
- Multilingual, multimodal question answering
Inputs and Outputs
Input:- Text prompts, image(s), or mixed multimodal instructions.
Output:
- Natural-language responses or visual reasoning explanations.
- Can return structured text (summaries, captions, answers, etc.) depending on the prompt.
License
Refer to the official Qwen license for terms of use and redistribution.π GGUF File List
| π Filename | π¦ Size | β‘ Download |
|---|---|---|
|
Qwen3-VL-4B-Instruct.F16.gguf
LFS
FP16
|
7.49 GB | Download |
|
Qwen3-VL-4B-Instruct.Q4_0.gguf
Recommended
LFS
Q4
|
2.11 GB | Download |
|
Qwen3-VL-4B-Instruct.Q4_K.gguf
LFS
Q4
|
2.11 GB | Download |
|
Qwen3-VL-4B-Instruct.Q5_K.gguf
LFS
Q5
|
2.58 GB | Download |
|
Qwen3-VL-4B-Instruct.Q6_K.gguf
LFS
Q6
|
3.07 GB | Download |
|
Qwen3-VL-4B-Instruct.Q8_0.gguf
LFS
Q8
|
3.98 GB | Download |
|
mmproj.F16.gguf
LFS
FP16
|
797.45 MB | Download |
|
mmproj.F32.gguf
LFS
|
1.55 GB | Download |
|
mmproj.Q8_0.gguf
LFS
Q8
|
428.54 MB | Download |