πŸ“‹ Model Description


license: apache-2.0 base_model:
  • ACE-Step/ACE-Step-v1-3.5B
pipeline_tag: text-to-audio tags:
  • gguf-node

gguf quantized ace-step-v1-3.5b

  • base model from ace-step
  • full set gguf (model+encoder+vae) works right away

setup (once)

  • drag ace-step to > ./ComfyUI/models/diffusionmodels
  • drag umt5-base to > ./ComfyUI/models/textencoders
  • drag pig to > ./ComfyUI/models/vae

!screenshot

workflow

  • drag json or demo audio below to browser for workflow
PromptAudio Sample
female singing pop music electronic beats fennec core
cute fennec girl
massive fennec ears
big fluffy tail
long blonde wavy hair
large blue eyes
I love fennec girl
🎧 ace-step
female singing pop music electronic beats fennec core
cute pinky pig
massive pinky ears
big fluffy tail
long cutie wavy hair
large blue eyes
I love pinky pig
🎧 ace-audio

review

  • note: as need to keep some key tensors (in f32 status) to make it works; file size might not decrease that much; but load faster than safetensors checkpoint in general (no last minute bottle neck problem)
  • rebuilding umt5-base tokenizer logic applied successfully; upgrade your node to the latest version for umt5-base encoder support; hence, safetensors checkpoint is no longer needed (removed here; if you want it still, you could get it from comfyui-org)
  • get more umt5-base encoder here

bonus: fp8/16/32 scaled stable-audio-open-1.0 with gguf quantized t5_base encoder

  • base model from stabilityai
  • note: this is a different model; don't mix it up; also powerful and lite weight

setup (once)

  • drag t5-base to > ./ComfyUI/models/text_encoders
  • drag safetensors to > ./ComfyUI/models/checkpoints
  • drag pig to > ./ComfyUI/models/vae

!screenshot

PromptAudio Sample
heaven church electronic dance music🎧 stable-audio

review

  • note: the safetensors checkpoint in this repo is an extracted version; only contains model and condition switch tensors (extremely lite weighted); no clip and vae inside; should use it along with separate clip (text encoder) and vae
  • opt to get fp8/16/32 scaled checkpoint with model and vae embedded here
  • get more t5-base encoder here

reference

πŸ“‚ GGUF File List

πŸ“ Filename πŸ“¦ Size ⚑ Download
ace-step-v1-3.5b-f16.gguf
LFS FP16
6.15 GB Download
ace-step-v1-3.5b-f32.gguf
LFS
12.31 GB Download
ace-step-v1-3.5b-fp32-iq4_nl.gguf
LFS Q4
6.27 GB Download
ace-step-v1-3.5b-fp32-iq4_xs.gguf
LFS Q4
6.22 GB Download
ace-step-v1-3.5b-fp32-q4_0.gguf
Recommended LFS Q4
6.27 GB Download
ace-step-v1-3.5b-fp32-q5_0.gguf
LFS Q5
6.49 GB Download
ace-step-v1-3.5b-fp32-q8_0.gguf
LFS Q8
7.15 GB Download
ace-step-v1-3.5b-q2_k.gguf
LFS Q2
5.1 GB Download
ace-step-v1-3.5b-q3_k_l.gguf
LFS Q3
5.35 GB Download
ace-step-v1-3.5b-q3_k_m.gguf
LFS Q3
5.31 GB Download
ace-step-v1-3.5b-q3_k_s.gguf
LFS Q3
5.27 GB Download
ace-step-v1-3.5b-q4_0.gguf
LFS Q4
5.53 GB Download
ace-step-v1-3.5b-q4_1.gguf
LFS Q4
5.66 GB Download
ace-step-v1-3.5b-q4_k_m.gguf
LFS Q4
5.61 GB Download
ace-step-v1-3.5b-q4_k_s.gguf
LFS Q4
5.54 GB Download
ace-step-v1-3.5b-q5_0.gguf
LFS Q5
5.78 GB Download
ace-step-v1-3.5b-q5_1.gguf
LFS Q5
5.9 GB Download
ace-step-v1-3.5b-q5_k_m.gguf
LFS Q5
5.82 GB Download
ace-step-v1-3.5b-q5_k_s.gguf
LFS Q5
5.78 GB Download
ace-step-v1-3.5b-q6_k.gguf
LFS Q6
6.04 GB Download
ace-step-v1-3.5b-q8_0.gguf
LFS Q8
6.52 GB Download
pig_ace_vae_fp32-f16.gguf
LFS FP16
496.42 MB Download
pig_sd_audio_vae_fp32-f16.gguf
LFS FP16
298.03 MB Download
t5_base_fp32-bf16.gguf
LFS FP16
425.27 MB Download
t5_base_fp32-f16.gguf
LFS FP16
425.27 MB Download
t5_base_fp32-q4_k_m.gguf
LFS Q4
119.75 MB Download
umt5base-q4_0.gguf
LFS Q4
205.76 MB Download