πŸ“‹ Model Description


base_model:
  • black-forest-labs/FLUX.1-dev
pipeline_tag: text-to-image library_name: gguf license: other license_name: flux-1-dev-non-commercial-license tags:
  • gguf
  • flux
  • text-to-image
  • imatrix

Supported?

Expect broken or faulty items for the time being. Use at your own discretion.

  • ComfyUI-GGUF: all? (CPU/CUDA)
- Fast dequant: BF16, Q80, Q51, Q50, Q41, Q40, Q6K, Q5K, Q4K, Q3K, Q2K - Slow dequant: others via GGUF/NumPy - CPU: all - Cuda: all? - Vulkan: >= Q3KS, > IQ4S; PR IQ1S, IQ1M PR IQ4XS - other: ?

Disco

Dynamic quantization:

  • timein.inlayer: Q80/Q6K
  • finallayer, vectorin.inlayer, guidancein: Q80
  • vectorin.outlayer, timein.outlayer, txtin, imgin: F16
  • singleblocks.[> 10 && < 37].modulation.lin: one down?

FilenameQuant typeFile SizeDescription / L2 Loss Step 25Example Image

Caesar

Combined imatrix multiple images 512x512 and 768x768, 25, 30 and 50 steps city96/flux1-dev-Q8_0 euler

data: loadimatrix: loaded 314 importance matrix entries from imatrixcaesar.dat computed on 475 chunks

Using llama.cpp quantize cae9fb4 with modified lcpp.patch.

Dynamic quantization:

  • imgin, guidancein.inlayer, finallayer.linear: f32/bf16/f16
  • guidancein, finallayer: bf16/f16
  • imgattn.qkv, linear1: some layers two bits up
  • txtmod.lin, txtmlp, txtattn.proj: some layers one bit down

Experimental from f16

FilenameQuant typeFile SizeDescription / L2 Loss Step 25Example Image
flux1-dev-IQ1S.ggufIQ1S2.41GBworst / 173Example
flux1-dev-TQ10.ggufTQ102.64GBworst / 195Example
flux1-dev-IQ1M.ggufIQ1M2.72GBworst / 171Example
flux1-dev-IQ2XXS.ggufIQ2XXS3.10GBworst * / 126Example
flux1-dev-TQ20.ggufTQ203.12GBworst / 202Example
flux1-dev-IQ2XS.ggufIQ2XS3.48GBworst / 140Example
flux1-dev-IQ2S.ggufIQ2S3.51GBworst / 142Example
flux1-dev-IQ2M.ggufIQ2M3.84GBbad / 120Example
flux1-dev-Q2KS.ggufQ2KS4.00GBok * / 52Example
flux1-dev-Q2K.ggufQ2K4.03GBok / 55Example
flux1-dev-IQ3XXS.ggufIQ3XXS4.56GBok / 92Example
flux1-dev-IQ3XS.ggufIQ3XS5.05GBbad / 125Example
flux1-dev-Q3KS.ggufQ3KS5.10GBok / 48Example
flux1-dev-IQ3S.ggufIQ3S5.11GBbad / 123Example
flux1-dev-Q3KM.ggufQ3KM5.13GBok / 50Example
flux1-dev-IQ3M.ggufIQ3M5.14GBbad / 123Example
flux1-dev-Q3KL.ggufQ3KL5.17GBok / 61Example
flux1-dev-IQ4XS.ggufIQ4XS6.33GBgood / 33Example
flux1-dev-Q4KS.ggufQ4KS6.66GBgood / 22Example
flux1-dev-Q4KM.ggufQ4KM6.69GBgood / 21Example
flux1-dev-IQ4NL.ggufIQ4NL6.69GBgood / 24Example
flux1-dev-Q40.ggufQ406.81GBgood / 30Example
flux1-dev-Q41.ggufQ417.55GBgood / 27Example
flux1-dev-Q5KS.ggufQ5KS8.26GBnice / 21Example
flux1-dev-Q50.ggufQ508.27GBgood / 30Example
flux1-dev-Q5KM.ggufQ5KM8.30GBnice / 23Example
flux1-dev-Q51.ggufQ518.99GBnice * / 14Example
flux1-dev-Q6K.ggufQ6K9.80GBnice / 20Example
flux1-dev-Q80.ggufQ8012.3GBnear perfect * / 8Example
-F1623.8GBreferenceExample
FilenameBits img_attn.qkv & linear1
flux1-dev-IQ1_S.gguf333M MMMM M111 ... 11MM MM11
flux1-dev-TQ1_0.gguf3332 2222 2111 ... 1122 2211
flux1-dev-IQ1_M.gguf3332 2222 2111 ... 1122 2211
flux1-dev-IQ2_XXS.gguf4433 3333 3222 ... 2222
flux1-dev-TQ2_0.gguf3332 2222 2111 ... 1122 2211
flux1-dev-IQ2_XS.gguf4443 3333 3222 ... 2233 3322
flux1-dev-IQ2_S.gguf4444 4444 4444 4444 4433 3222 ... 2233 3322
flux1-dev-IQ2_M.gguf4444 4444 4444 4444 4433 3222 ... 2223 3333 3322
flux1-dev-Q2K_S.gguf4443 3333 3222 ... 2222
flux1-dev-Q2_K.gguf4443 3333 3222 ... 2233 3322
flux1-dev-IQ3_XXS.gguf444S SSSS S333 ... 3333
flux1-dev-IQ3_XS.gguf444S SSSS S333 ... 33SS SS33
flux1-dev-Q3K_S.gguf5554 4444 4333 ... 3333
flux1-dev-IQ3_S.gguf5554 4444 4333 ... 3344 4433
flux1-dev-Q3K_M.gguf5554 4444 4333 ... 3344 4433
flux1-dev-IQ3_M.gguf5554 4444 4444 4444 4433 ... 3344 4433
flux1-dev-Q3K_L.gguf5554 4444 4444 4444 4433 ... 3344 4433
flux1-dev-IQ4_XS.gguf8885 5555 5444 ... 4444
flux1-dev-Q4K_S.gguf8885 5555 5444 ... 4444
flux1-dev-Q4K_M.gguf8885 5555 5555 5555 5544 ... 4444
flux1-dev-IQ4_NL.gguf8885 5555 5555 5555 5544 ... 4444
flux1-dev-Q4_0.gguf8885 5555 5444 ... 4444
flux1-dev-Q4_1.gguf8885 5555 5444 ... 4444
flux1-dev-Q5K_S.ggufFFF6 6666 6666 6666 6655 ... 5555
flux1-dev-Q5_0.ggufFFF8 8888 8555 ... 5555
flux1-dev-Q5K_M.ggufFFF8 8888 8666 6666 6655 ... 5555
flux1-dev-Q5_1.ggufFFF8 8888 8555 ... 5555
flux1-dev-Q6_K.ggufFFF8 8888 8666 .. 6666
flux1-dev-Q8_0.ggufFFF8 8888 .. 8888

Observations

  • More imatrix data doesn't necessarily result in better quants
  • I-quants worse than same bits k-quants?
  • Quant-dequant loss

Bravo

Combined imatrix multiple images 512x512 25 and 50 steps city96/flux1-dev-Q8_0 euler

Using llama.cpp quantize cae9fb4 with modified lcpp.patch.

Experimental from f16

FilenameQuant typeFile SizeDescription / L2 Loss Step 25Example Image
flux1-dev-IQ1S.ggufIQ1S2.45GBworst / 156Example
flux1-dev-IQ1M.ggufIQ1M2.72GBworst / 141Example
flux1-dev-IQ2XXS.ggufIQ2XXS3.19GBworst / 131Example
flux1-dev-IQ2XS.ggufIQ2XS3.56GBworst / 125-
flux1-dev-IQ2S.ggufIQ2S3.56GBworst / 125-
flux1-dev-IQ2M.ggufIQ2M3.93GBworst / 120Example
flux1-dev-Q2KS.ggufQ2KS4.02GBok / 56Example
flux1-dev-IQ3XXS.ggufIQ3XXS4.66GBTBC / 68Example
flux1-dev-IQ3XS.ggufIQ3XS5.22GBworse than IQ3XXS / 115Example
flux1-dev-IQ3S.ggufIQ3STBCTBC-
flux1-dev-IQ3M.ggufIQ3MTBCTBC-
flux1-dev-Q3KS.ggufQ3KS5.22GBTBC / 34Example
flux1-dev-IQ4XS.ggufIQ4XS6.42GBTBC / 25-
flux1-dev-Q40.ggufQ406.79GBTBC / 31-
flux1-dev-IQ4NL.ggufIQ4NL6.79GBTBC / 21Example
flux1-dev-Q4KS.ggufQ4KS6.79GBTBC / 29Example
flux1-dev-Q41.ggufQ417.53GBTBC / 24-
flux1-dev-Q50.ggufQ508.27GBTBC / 25-
flux1-dev-Q51.ggufQ51TBCTBC / 24-
flux1-dev-Q5KS.ggufQ5KS8.27GBTBC / 20Example
flux1-dev-Q6K.ggufQ6K9.84GBTBC / 19Example
flux1-dev-Q80.ggufQ80-TBC / 10-
-F1623.8GBreferenceExample

Observations

Alpha

Simple imatrix: 512x512 single image 8/20 steps city96/flux1-dev-Q3K_S euler

data: load_imatrix: loaded 314 importance matrix entries from imatrix.dat computed on 7 chunks.

Using llama.cpp quantize cae9fb4 with modified lcpp.patch.

Experimental from q8

FilenameQuant typeFile SizeDescription / L2 Loss Step 25Example Image
flux1-dev-IQ1S.ggufIQ1S2.45GBworst / 152Example
-IQ1_M-broken-
flux1-dev-TQ10.ggufTQ102.63GBTBC / 220-
flux1-dev-TQ20.ggufTQ203.19GBTBC / 220-
flux1-dev-IQ2XXS.ggufIQ2XXS3.19GBworst / 130Example
flux1-dev-IQ2XS.ggufIQ2XS3.56GBworst / 129Example
flux1-dev-IQ2S.ggufIQ2S3.56GBworst / 129-
flux1-dev-IQ2M.ggufIQ2M3.93GBworst / 121-
flux1-dev-Q2K.ggufQ2K4.02GBTBC / 77-
flux1-dev-Q2KS.ggufQ2KS4.02GBok / 77Example
flux1-dev-IQ3XXS.ggufIQ3XXS4.66GBTBC / 130Example
flux1-dev-IQ3XS.ggufIQ3XS5.22GBTBC / 114-
flux1-dev-IQ3S.ggufIQ3S5.22GBTBC / 114-
flux1-dev-IQ3M.ggufIQ3M5.22GBTBC / 114-
flux1-dev-Q3KS.ggufQ3KS5.22GBTBC / 36Example
flux1-dev-Q3KM.ggufQ3K_M5.36GBTBC / 42-
flux1-dev-Q3KL.ggufQ3K_L5.36GBTBC / 42-
flux1-dev-IQ4XS.ggufIQ4XS6.42GBTBC / 30Example
flux1-dev-IQ4NL.ggufIQ4NL6.79GBTBC / 23Example
flux1-dev-Q40.ggufQ406.79GBTBC / 27-
-Q4_KTBCTBC / 27-
flux1-dev-Q4KS.ggufQ4KS6.79GBTBC / 26Example
flux1-dev-Q4KM.ggufQ4K_M6.93GBTBC / 27-
flux1-dev-Q41.ggufQ417.53GBTBC / 23-
flux1-dev-Q5KS.ggufQ5KS8.27GBTBC / 19Example
flux1-dev-Q5K.ggufQ5K8.41GBTBC / 20-
-Q5KMTBCTBC-
flux1-dev-Q6K.ggufQ6K9.84GBTBC / 22-
-Q8012.7GBnear perfect / 10Example
-F1623.8GBreferenceExample

Observations

Sub-quants not diferentiated as expected: IQ2XS == IQ2S, IQ3XS == IQ3S == IQ3M, Q3KM == Q3K_L.

  • Check if lcpp_sd3.patch includes more specific quant level logic
  • Extrapolate the existing level logic

πŸ“‚ GGUF File List

No GGUF files available