Usage

When using a ComfyUI workflow which uses the original fp16 gemma 3 12b it model, simply select the text encoder from here instead.

Right now ComfyUI memory offloading seems to have issues with the text encoder loaded by the LTX-2 text encoder loader node, for now as a workaround (If you're getting an OOM error) you can launch ComfyUI with the --novram flag. This will slightly slow down generations so I recommend reverting this when a fix has been released.
--lowvram isn't needed if using the DualClipLoader, as it can be set to use cpu only.

Usage with DualClipLoader (with projections)

Use the vanilla ComfyUI DualClipLoader node, and as the clip models select the gemma_3_12B_it_fp8_e4m3fn.safetensors and ltx-2-19b-dev-fp4_projections_only.safetensors from this repo. Now replace the LTXV Audio Text Encoder Loader node with the DualClipLoader node.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support