Running the LTX 2.3 22B transformer-only model in ComfyUI produces completely garbled output (pure noise, no recognizable content) regardless of prompt.
Setup:
- Model: ltx-2.3-22b-dev (transformer only)
- VAE: Video VAE + Audio VAE loaded separately
- Text encoder: gemma_3_12B_it_fp8_e4m3fn.safetensors (loaded via GGUF CLIP loader node)
- Text projection: ltx-2.3_text_projection_bf16.safetensors
- ComfyUI version: [0.19.1]
- GPU / VRAM: [4060TI 16GB]
- OS: Windows 11
What I expected
A generated video matching the prompt
What I got
Completely garbled frames — uniform noise across the entire output, every time
What I have tried
- Multiple different workflows from YouTube tutorials
- Switching from fp4_mixed to fp8_e4m3fn Gemma variant (resolved a shape mismatch error but output still garbage)
- Multiple different prompts
Appreciate someone can help
Thank you all
Running the LTX 2.3 22B transformer-only model in ComfyUI produces completely garbled output (pure noise, no recognizable content) regardless of prompt.
Setup:
What I expected
A generated video matching the prompt
What I got
Completely garbled frames — uniform noise across the entire output, every time
What I have tried
Appreciate someone can help
Thank you all