πŸ“‹ Model Description


license: apache-2.0 language:
  • en
  • zh
base_model:
  • ByteDance/Dolphin-v2
pipeline_tag: image-text-to-text library_name: transformers tags:
  • text-generation-inference
  • document-parsing
  • document-understanding
  • document-intelligence
  • ocr
  • layout-analysis
  • table-extraction
  • formula-recognition
  • code-extraction
  • vision-language-model
  • multimodal

Dolphin-v2-f32-GGUF

ByteDance Dolphin-v2 is a 3B-parameter vision-language model built on Qwen2.5-VL-3B with Native Resolution Vision Transformer (NaViT) encoder and autoregressive decoder, designed as a universal document parsing solution via a document-type-aware two-stage architecture that classifies digital-born vs. photographed documents before applying hybrid strategiesβ€”element-wise parallel parsing for clean PDFs and holistic parsing for distorted scans. It supports 21 element categories (headings sec0-5, paragraphs, formulas in LaTeX, HTML tables, indented code blocks, figures, lists, etc.) with absolute pixel coordinates for precise localization, achieving state-of-the-art OmniDocBench v1.5 scores of 89.45 overall (+14.78 over original Dolphin), 0.054 edit distance for text/reading order, 86.72% CDM for formulas, and 87.02/90.48 TEDS/TEDS-S for tables at 0.1729 FPS on 8-12GB VRAM GPUs. Specialized modules (Pformula, Pcode, Ptable, P_paragraph) enable structured JSON/Markdown/HTML outputs for privacy-focused local inference in healthcare/legal/finance, outperforming general VLMs in speed (2x faster) and accuracy across distortions, skews, and perspectives.

Dolphin-v2 [GGUF]

File NameQuant TypeFile SizeFile Link
Dolphin-v2.BF16.ggufBF166.18 GBDownload
Dolphin-v2.F32.ggufF3212.3 GBDownload
Dolphin-v2.IQ4XS.ggufIQ4XS1.75 GBDownload
Dolphin-v2.Q2K.ggufQ2K1.27 GBDownload
Dolphin-v2.Q3KL.ggufQ3KL1.71 GBDownload
Dolphin-v2.Q3KM.ggufQ3KM1.59 GBDownload
Dolphin-v2.Q3KS.ggufQ3KS1.45 GBDownload
Dolphin-v2.Q4KM.ggufQ4KM1.93 GBDownload
Dolphin-v2.Q4KS.ggufQ4KS1.83 GBDownload
Dolphin-v2.Q5KM.ggufQ5KM2.22 GBDownload
Dolphin-v2.Q5KS.ggufQ5KS2.17 GBDownload
Dolphin-v2.Q6K.ggufQ6K2.54 GBDownload
Dolphin-v2.Q80.ggufQ803.29 GBDownload
Dolphin-v2.f16.ggufF166.18 GBDownload
Dolphin-v2.i1-IQ1M.ggufi1-IQ1M850 MBDownload
Dolphin-v2.i1-IQ1S.ggufi1-IQ1S791 MBDownload
Dolphin-v2.i1-IQ2M.ggufi1-IQ2M1.14 GBDownload
Dolphin-v2.i1-IQ2S.ggufi1-IQ2S1.06 GBDownload
Dolphin-v2.i1-IQ2XS.ggufi1-IQ2XS1.03 GBDownload
Dolphin-v2.i1-IQ2XXS.ggufi1-IQ2XXS948 MBDownload
Dolphin-v2.i1-IQ3M.ggufi1-IQ3M1.49 GBDownload
Dolphin-v2.i1-IQ3S.ggufi1-IQ3S1.46 GBDownload
Dolphin-v2.i1-IQ3XS.ggufi1-IQ3XS1.39 GBDownload
Dolphin-v2.i1-IQ3XXS.ggufi1-IQ3XXS1.28 GBDownload
Dolphin-v2.i1-IQ4NL.ggufi1-IQ4NL1.83 GBDownload
Dolphin-v2.i1-IQ4XS.ggufi1-IQ4XS1.74 GBDownload
Dolphin-v2.i1-Q2K.ggufi1-Q2K1.27 GBDownload
Dolphin-v2.i1-Q2KS.ggufi1-Q2KS1.2 GBDownload
Dolphin-v2.i1-Q3KL.ggufi1-Q3KL1.71 GBDownload
Dolphin-v2.i1-Q3KM.ggufi1-Q3KM1.59 GBDownload
Dolphin-v2.i1-Q3KS.ggufi1-Q3KS1.45 GBDownload
Dolphin-v2.i1-Q40.ggufi1-Q401.83 GBDownload
Dolphin-v2.i1-Q41.ggufi1-Q412 GBDownload
Dolphin-v2.i1-Q4KM.ggufi1-Q4KM1.93 GBDownload
Dolphin-v2.i1-Q4KS.ggufi1-Q4KS1.83 GBDownload
Dolphin-v2.i1-Q5KM.ggufi1-Q5KM2.22 GBDownload
Dolphin-v2.i1-Q5KS.ggufi1-Q5KS2.17 GBDownload
Dolphin-v2.i1-Q6K.ggufi1-Q6K2.54 GBDownload
Dolphin-v2.imatrix.ggufimatrix3.39 MBDownload
Dolphin-v2.mmproj-Q80.ggufmmproj-Q80848 MBDownload
Dolphin-v2.mmproj-bf16.ggufmmproj-bf161.34 GBDownload
Dolphin-v2.mmproj-f16.ggufmmproj-f161.34 GBDownload
Dolphin-v2.mmproj-f32.ggufmmproj-f322.67 GBDownload

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant
types (lower is better):

!image.png

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
Dolphin-v2.BF16.gguf
LFS FP16
5.75 GB Download
Dolphin-v2.F32.gguf
LFS
11.5 GB Download
Dolphin-v2.IQ4_XS.gguf
LFS Q4
1.63 GB Download
Dolphin-v2.Q2_K.gguf
LFS Q2
1.19 GB Download
Dolphin-v2.Q3_K_L.gguf
LFS Q3
1.59 GB Download
Dolphin-v2.Q3_K_M.gguf
LFS Q3
1.48 GB Download
Dolphin-v2.Q3_K_S.gguf
LFS Q3
1.35 GB Download
Dolphin-v2.Q4_K_M.gguf
Recommended LFS Q4
1.8 GB Download
Dolphin-v2.Q4_K_S.gguf
LFS Q4
1.71 GB Download
Dolphin-v2.Q5_K_M.gguf
LFS Q5
2.07 GB Download
Dolphin-v2.Q5_K_S.gguf
LFS Q5
2.02 GB Download
Dolphin-v2.Q6_K.gguf
LFS Q6
2.36 GB Download
Dolphin-v2.Q8_0.gguf
LFS Q8
3.06 GB Download
Dolphin-v2.f16.gguf
LFS FP16
5.75 GB Download
Dolphin-v2.i1-IQ1_M.gguf
LFS
810.65 MB Download
Dolphin-v2.i1-IQ1_S.gguf
LFS
754.45 MB Download
Dolphin-v2.i1-IQ2_M.gguf
LFS Q2
1.06 GB Download
Dolphin-v2.i1-IQ2_S.gguf
LFS Q2
1012.74 MB Download
Dolphin-v2.i1-IQ2_XS.gguf
LFS Q2
983.76 MB Download
Dolphin-v2.i1-IQ2_XXS.gguf
LFS Q2
904.32 MB Download
Dolphin-v2.i1-IQ3_M.gguf
LFS Q3
1.39 GB Download
Dolphin-v2.i1-IQ3_S.gguf
LFS Q3
1.36 GB Download
Dolphin-v2.i1-IQ3_XS.gguf
LFS Q3
1.3 GB Download
Dolphin-v2.i1-IQ3_XXS.gguf
LFS Q3
1.19 GB Download
Dolphin-v2.i1-IQ4_NL.gguf
LFS Q4
1.7 GB Download
Dolphin-v2.i1-IQ4_XS.gguf
LFS Q4
1.62 GB Download
Dolphin-v2.i1-Q2_K.gguf
LFS Q2
1.19 GB Download
Dolphin-v2.i1-Q2_K_S.gguf
LFS Q2
1.12 GB Download
Dolphin-v2.i1-Q3_K_L.gguf
LFS Q3
1.59 GB Download
Dolphin-v2.i1-Q3_K_M.gguf
LFS Q3
1.48 GB Download
Dolphin-v2.i1-Q3_K_S.gguf
LFS Q3
1.35 GB Download
Dolphin-v2.i1-Q4_0.gguf
LFS Q4
1.7 GB Download
Dolphin-v2.i1-Q4_1.gguf
LFS Q4
1.86 GB Download
Dolphin-v2.i1-Q4_K_M.gguf
LFS Q4
1.8 GB Download
Dolphin-v2.i1-Q4_K_S.gguf
LFS Q4
1.71 GB Download
Dolphin-v2.i1-Q5_K_M.gguf
LFS Q5
2.07 GB Download
Dolphin-v2.i1-Q5_K_S.gguf
LFS Q5
2.02 GB Download
Dolphin-v2.i1-Q6_K.gguf
LFS Q6
2.36 GB Download
Dolphin-v2.imatrix.gguf
LFS
3.24 MB Download
Dolphin-v2.mmproj-Q8_0.gguf
LFS Q8
808.5 MB Download
Dolphin-v2.mmproj-bf16.gguf
LFS FP16
1.25 GB Download
Dolphin-v2.mmproj-f16.gguf
LFS FP16
1.25 GB Download
Dolphin-v2.mmproj-f32.gguf
LFS
2.49 GB Download