πŸ“‹ Model Description


pipeline_tag: image-text-to-text base_model:
  • Qwen/Qwen3-VL-4B-Instruct

Qwen3-VL-4B-Instruct

Run Qwen3-VL-4B-Instruct optimized for CPU/GPU with NexaSDK.

Quickstart

  1. Install NexaSDK
  2. Run the model locally with one line of code:
nexa infer NexaAI/Qwen3-VL-4B-Instruct-GGUF

Model Description

Qwen3-VL-4B-Instruct is a 4-billion-parameter instruction-tuned multimodal large language model from Alibaba Cloud’s Qwen team. As part of the Qwen3-VL series, it fuses powerful vision-language understanding with conversational fine-tuning, optimized for real-world applications such as chat-based reasoning, document analysis, and visual dialogue.

The Instruct variant is tuned for following user prompts naturally and safely β€” producing concise, relevant, and user-aligned responses across text, image, and video contexts.

Features

  • Instruction-Following: Optimized for dialogue, explanation, and user-friendly task completion.
  • Vision-Language Fusion: Understands and reasons across text, images, and video frames.
  • Multilingual Capability: Handles multiple languages for diverse global use cases.
  • Contextual Coherence: Balances reasoning ability with natural, grounded conversational tone.
  • Lightweight & Deployable: 4B parameters make it efficient for edge and device-level inference.

Use Cases

  • Visual chatbots and assistants
  • Image captioning and scene understanding
  • Chart, document, or screenshot analysis
  • Educational or tutoring systems with visual inputs
  • Multilingual, multimodal question answering

Inputs and Outputs

Input:
  • Text prompts, image(s), or mixed multimodal instructions.

Output:

  • Natural-language responses or visual reasoning explanations.
  • Can return structured text (summaries, captions, answers, etc.) depending on the prompt.

License

Refer to the official Qwen license for terms of use and redistribution.

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
Qwen3-VL-4B-Instruct.F16.gguf
LFS FP16
7.49 GB Download
Qwen3-VL-4B-Instruct.Q4_0.gguf
Recommended LFS Q4
2.11 GB Download
Qwen3-VL-4B-Instruct.Q4_K.gguf
LFS Q4
2.11 GB Download
Qwen3-VL-4B-Instruct.Q5_K.gguf
LFS Q5
2.58 GB Download
Qwen3-VL-4B-Instruct.Q6_K.gguf
LFS Q6
3.07 GB Download
Qwen3-VL-4B-Instruct.Q8_0.gguf
LFS Q8
3.98 GB Download
mmproj.F16.gguf
LFS FP16
797.45 MB Download
mmproj.F32.gguf
LFS
1.55 GB Download
mmproj.Q8_0.gguf
LFS Q8
428.54 MB Download