runpod

Author	SHA1	Message	Date
Sebastian Krüger	91f6e9bd59	fix: patch DiffRhythm DIT to add missing LlamaConfig attention head parameters All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 15s Details Adds monkey-patch for DiT.__init__() to properly configure LlamaConfig with num_attention_heads and num_key_value_heads parameters, which are missing in the upstream DiffRhythm code. Root cause: transformers 4.49.0+ requires these parameters but DiffRhythm's dit.py only specifies hidden_size, causing the library to incorrectly infer head_dim as 32 instead of 64, leading to tensor dimension mismatches. Solution: - Sets num_attention_heads = hidden_size // 64 (standard Llama architecture) - Sets num_key_value_heads = num_attention_heads // 4 (GQA configuration) - Ensures head_dim = 64, fixing the "tensor a (32) vs tensor b (64)" error This is a proper fix rather than just downgrading transformers version. References: - https://github.com/billwuhao/ComfyUI_DiffRhythm/issues/44 - https://github.com/billwuhao/ComfyUI_DiffRhythm/issues/48 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 18:53:18 +01:00
Sebastian Krüger	8c4eb8c3f1	fix: pin transformers to 4.49.0 for DiffRhythm compatibility All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 13s Details Resolves tensor dimension mismatch error in rotary position embeddings. DiffRhythm requires transformers 4.49.0 - newer versions (4.50+) cause "The size of tensor a (32) must match the size of tensor b (64)" error due to transformer block initialization changes. Updated pivoine_diffrhythm.py documentation to reflect actual root cause and link to upstream GitHub issues #44 and #48. References: - https://github.com/billwuhao/ComfyUI_DiffRhythm/issues/44 - https://github.com/billwuhao/ComfyUI_DiffRhythm/issues/48 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 18:14:40 +01:00
Sebastian Krüger	67d41c3923	fix: patch infer_utils.decode_audio instead of DiffRhythmNode.infer All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 13s Details The correct function to patch is decode_audio from infer_utils module, which is where chunked VAE decoding actually happens. This intercepts the call at the right level to force chunked=False. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 17:28:30 +01:00
Sebastian Krüger	1981b7b256	fix: monkey-patch DiffRhythm infer function to force chunked=False All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details The previous approach of overriding diffrhythmgen wasn't working because ComfyUI doesn't pass the chunked parameter when it's not in INPUT_TYPES. This fix monkey-patches the infer() function at module level to always force chunked=False, preventing the tensor dimension mismatch error. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 17:24:22 +01:00
Sebastian Krüger	5096e3ffb5	feat: add Pivoine custom ComfyUI nodes for DiffRhythm All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Add custom node wrapper PivoineDiffRhythmRun that fixes tensor dimension mismatch error by disabling chunked VAE decoding. The original DiffRhythm node's overlap=32 parameter conflicts with the VAE's 64-channel architecture. Changes: - Add comfyui/nodes/pivoine_diffrhythm.py: Custom node wrapper - Add comfyui/nodes/__init__.py: Package initialization - Add arty.yml setup/pivoine-nodes: Deployment script for symlink - Update all 4 DiffRhythm workflows to use PivoineDiffRhythmRun Technical details: - Inherits from DiffRhythmRun to avoid upstream patching - Forces chunked=False in diffrhythmgen() override - Requires more VRAM (~12-16GB) but RTX 4090 has 24GB - Category: 🌸Pivoine/Audio for easy identification 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 16:28:54 +01:00
Sebastian Krüger	073711c017	fix: use correct DiffRhythm parameter order from UI testing All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Correct widgets_values order (11 parameters): 0: model (string) 1: prompt/style_prompt (text) 2: unload_model (boolean) 3: odeint_method (enum) 4: steps (int) 5: cfg (int) 6: quality_or_speed (enum) 7: seed (int) 8: control_after_generate (string) 9: edit (boolean) 10: segments/edit_segments (text) Updated all four workflows: - diffrhythm-simple-t2m-v1.json - diffrhythm-random-generation-v1.json - diffrhythm-reference-based-v1.json - diffrhythm-full-length-t2m-v1.json 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 15:57:25 +01:00
Sebastian Krüger	279f703591	fix: correct DiffRhythm workflow parameter order to match function signature All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details The parameters must match the diffrhythmgen() function signature order, not the INPUT_TYPES order. The function has 'edit' as the first parameter. Correct widgets_values order (11 parameters): 0: edit (boolean) 1: model (string) 2: style_prompt (string) 3: lyrics_or_edit_lyrics (string) 4: edit_segments (string) 5: odeint_method (enum) 6: steps (int) 7: cfg (int) 8: quality_or_speed (enum) 9: unload_model (boolean) 10: seed (int) Note: style_audio_or_edit_song comes from input connection (not in widgets) Note: chunked parameter is hidden (not in widgets) Updated workflows: - diffrhythm-simple-t2m-v1.json - diffrhythm-random-generation-v1.json - diffrhythm-reference-based-v1.json - diffrhythm-full-length-t2m-v1.json 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 15:53:15 +01:00
Sebastian Krüger	64db634ab5	fix: correct DiffRhythm workflow parameter order for all three workflows All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 15s Details Changed edit_segments from "[-1, 20], [60, -1]" to empty string "" at position 11. This fixes validation errors where parameters were being interpreted as wrong types. The correct 12-parameter structure is: 0: model (string) 1: style_prompt (string) 2: unload_model (boolean) 3: odeint_method (enum) 4: steps (int) 5: cfg (int) 6: quality_or_speed (enum) 7: seed (int) 8: edit (boolean) 9: edit_lyrics (string, empty) 10: edit_song (string, empty) 11: edit_segments (string, empty) Updated workflows: - diffrhythm-random-generation-v1.json - diffrhythm-reference-based-v1.json - diffrhythm-full-length-t2m-v1.json 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 15:48:56 +01:00
Sebastian Krüger	56476f4230	fix: add missing edit_song and edit_lyrics parameters to DiffRhythm workflows All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Fix "edit song, edit lyrics, edit segments must be provided" error by adding the two missing parameters to all three DiffRhythm workflow files: - diffrhythm-random-generation-v1.json - diffrhythm-reference-based-v1.json - diffrhythm-full-length-t2m-v1.json Added empty string parameters at positions 9 and 10 in widgets_values array: - edit_song: "" (empty when edit=false) - edit_lyrics: "" (empty when edit=false) The DiffRhythmRun node requires 12 parameters total, not 10. These workflows use edit=false (no editing), so the edit parameters should be empty strings. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 12:55:58 +01:00
Sebastian Krüger	a249dfc941	feat: add torchcodec dependency for DiffRhythm audio caching All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Add torchcodec to ComfyUI requirements.txt to fix audio tensor caching error in DiffRhythm. This package is required for save_with_torchcodec function used by DiffRhythm audio nodes. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 12:44:05 +01:00
Sebastian Krüger	cf3fcafbae	feat: add DiffRhythm music generation support All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 15s Details - Add DiffRhythm dependencies to requirements.txt (19 packages) - Add reference audio placeholder for style transfer workflow - DiffRhythm nodes now loading in ComfyUI - All four workflows ready for music generation 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 12:17:46 +01:00
Sebastian Krüger	8fe87064f8	feat: add DiffRhythm dependencies to ComfyUI requirements All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Added all required packages for ComfyUI_DiffRhythm extension: - torchdiffeq: ODE solvers for diffusion models - x-transformers: Transformer architecture components - librosa: Audio analysis and feature extraction - pandas, pyarrow: Data handling - ema-pytorch, prefigure: Training utilities - muq: Music quality model - mutagen: Audio metadata handling - pykakasi, jieba, cn2an, pypinyin: Chinese/Japanese text processing - Unidecode, phonemizer, inflect: Text normalization and phonetic conversion - py3langid: Language identification These dependencies enable the DiffRhythm node to load and function properly in ComfyUI, fixing the "ModuleNotFoundError: No module named 'infer_utils'" error. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 10:51:50 +01:00
Sebastian Krüger	44762a063c	fix: update DiffRhythm workflows with correct node names and parameters All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Updated all 4 DiffRhythm workflow JSON files to use actual node class names from ComfyUI_DiffRhythm: Node Name Changes: - DiffRhythmTextToMusic → DiffRhythmRun - DiffRhythmRandomGeneration → DiffRhythmRun (with empty style_prompt) - DiffRhythmReferenceBasedGeneration → DiffRhythmRun (with audio input) Corrected Parameter Structure: All workflows now use proper widgets_values array matching DiffRhythmRun INPUT_TYPES: 1. model (string: "cfm_model_v1_2.pt", "cfm_model.pt", or "cfm_full_model.pt") 2. style_prompt (string: multiline text or empty for random) 3. unload_model (boolean: default true) 4. odeint_method (string: "euler", "midpoint", "rk4", "implicit_adams") 5. steps (int: 1-100, default 30) 6. cfg (int: 1-10, default 4) 7. quality_or_speed (string: "quality" or "speed") 8. seed (int: -1 for random, or specific number) 9. edit (boolean: default false) 10. edit_segments (string: "[-1, 20], [60, -1]") Workflow-Specific Updates: diffrhythm-simple-t2m-v1.json: - Text-to-music workflow for 95s generation - Uses cfm_model_v1_2.pt with text prompt guidance - Default settings: steps=30, cfg=4, speed mode, seed=42 diffrhythm-full-length-t2m-v1.json: - Full-length 4m45s (285s) generation - Uses cfm_full_model.pt for extended compositions - Quality mode enabled for better results - Default seed=123 diffrhythm-reference-based-v1.json: - Reference audio + text prompt workflow - Uses LoadAudio node connected to style_audio_or_edit_song input - Higher cfg=5 for stronger prompt adherence - Demonstrates optional audio input connection diffrhythm-random-generation-v1.json: - Pure random generation (no prompt/guidance) - Empty style_prompt string - Minimal cfg=1 for maximum randomness - Random seed=-1 for unique output each time Documentation Updates: - Removed PLACEHOLDER notes - Updated usage sections with correct parameter descriptions - Added notes about optional MultiLineLyricsDR node for lyrics - Clarified parameter behavior and recommendations These workflows are now ready to use in ComfyUI with the installed DiffRhythm extension. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 10:46:31 +01:00
Sebastian Krüger	f2186db78e	feat: integrate ComfyUI_DiffRhythm extension with 7 models and 4 workflows All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 15s Details - Add DiffRhythm to arty.yml references and setup/comfyui-nodes - Install espeak-ng system dependency for phoneme processing - Add 7 DiffRhythm models to models_huggingface.yaml with file mappings: * ASLP-lab/DiffRhythm-1_2 (95s generation) * ASLP-lab/DiffRhythm-full (4m45s generation) * ASLP-lab/DiffRhythm-base * ASLP-lab/DiffRhythm-vae * OpenMuQ/MuQ-MuLan-large * OpenMuQ/MuQ-large-msd-iter * FacebookAI/xlm-roberta-base - Create 4 comprehensive workflows: * diffrhythm-simple-t2m-v1.json (basic 95s text-to-music) * diffrhythm-full-length-t2m-v1.json (4m45s full-length) * diffrhythm-reference-based-v1.json (style transfer with reference audio) * diffrhythm-random-generation-v1.json (no-prompt random generation) - Update storage requirements: 90GB essential, 149GB total 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 09:50:45 +01:00
Sebastian Krüger	0e3150e26c	fix: correct Pony Diffusion workflow checkpoint reference - Changed checkpoint from waiIllustriousSDXL_v150.safetensors to ponyDiffusionV6XL_v6StartWithThisOne.safetensors - Fixed metadata model reference (was incorrectly referencing LoRA) - Added files field to models_civitai.yaml for explicit filename mapping - Aligns workflow with actual Pony Diffusion V6 XL model	2025-11-23 19:57:45 +01:00
Sebastian Krüger	5770563d9a	feat: add comprehensive negative embeddings support (SD 1.5, SDXL, Pony) - Add 3 new embedding categories to models_civitai.yaml: - embeddings_sd15: 6 embeddings (BadDream, UnrealisticDream, badhandv4, EasyNegative, FastNegativeV2, BadNegAnatomyV1-neg) - embeddings_sdxl: 1 embedding (BadX v1.1) - embeddings_pony: 2 embeddings (zPDXL3, zPDXLxxx) - Total storage: ~1.1 MB (9 embeddings) - Add comprehensive embeddings documentation to NSFW README - Include usage examples, compatibility notes, and syntax guide - Document embedding weights and recommended combinations	2025-11-23 19:39:18 +01:00
Sebastian Krüger	68d3606cab	fix: use WAI-NSFW-Illustrious checkpoint instead of non-existent Pony model Changed checkpoint from 'add-detail-xl.safetensors' (which is a LoRA) to 'waiIllustriousSDXL_v150.safetensors' which is the downloaded anime NSFW model	2025-11-23 19:13:22 +01:00
Sebastian Krüger	1d851bb11c	feat: add NSFW ComfyUI workflow suite with LoRA fusion and upscaling Added 5 production-ready workflows to leverage downloaded CivitAI NSFW models: NSFW Text-to-Image Workflows (3): - lustify-realistic-t2i-production-v1.json - Photorealistic NSFW with LUSTIFY v7.0 - DPM++ 2M SDE, Exponential scheduler, 30 steps, CFG 6.0 - Optimized for women in realistic scenarios with professional photography quality - pony-anime-t2i-production-v1.json - Anime/cartoon/furry with Pony Diffusion V6 XL - Euler Ancestral, Normal scheduler, 35 steps, CFG 7.5 - Danbooru tag support, balanced safe/questionable/explicit content - realvisxl-lightning-t2i-production-v1.json - Ultra-fast photorealistic with RealVisXL V5.0 Lightning - DPM++ SDE Karras, 6 steps (vs 30+), CFG 2.0 - 4-6 step generation for rapid high-quality output Enhancement Workflows (2): - lora-fusion-t2i-production-v1.json - Multi-LoRA stacking (text-to-image directory) - Stack up to 3 LoRAs with adjustable weights (0.2-1.0) - Compatible with all SDXL checkpoints including NSFW models - Hierarchical strength control for style mixing and enhancement - nsfw-ultimate-upscale-production-v1.json - Professional 2x upscaling with LUSTIFY - RealESRGAN_x2 + diffusion refinement via Ultimate SD Upscale - Tiled processing, optimized for detailed skin texture - Denoise 0.25 preserves original composition Documentation: - Comprehensive README.md with usage examples, API integration, model comparison - Optimized settings for each workflow based on model recommendations - Advanced usage guide for LoRA stacking and upscaling pipelines - Version history tracking Total additions: 1,768 lines across 6 files These workflows complement the 27GB of CivitAI NSFW models downloaded in previous commit. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 18:46:22 +01:00
Sebastian Krüger	61fd0e9265	fix: correct widgets_values - remove upscale_model/custom params, fix seam_fix order (width before mask_blur)	2025-11-23 12:34:28 +01:00
Sebastian Krüger	b9afd68ddd	fix: add control_after_generate parameter at position 2 (23 total params)	2025-11-23 12:26:27 +01:00
Sebastian Krüger	2f53f542e7	fix: add custom_sampler and custom_sigmas null placeholders (22 total parameters)	2025-11-23 12:21:40 +01:00
Sebastian Krüger	14a1fcf4a7	fix: add null placeholder for upscale_model in widgets_values (20th parameter)	2025-11-23 12:20:48 +01:00
Sebastian Krüger	626dab6f65	fix: back to function signature order for seam_fix params	2025-11-23 12:18:42 +01:00
Sebastian Krüger	abbd89981e	fix: use USDU_base_inputs order (seam_fix_width before mask_blur)	2025-11-23 12:15:49 +01:00
Sebastian Krüger	f976dc2c74	fix: correct seam_fix parameter order - mask_blur comes before width in function signature	2025-11-23 12:14:19 +01:00
Sebastian Krüger	75c6c77391	fix: correct widgets_values array to match actual parameter order (19 widget values for unconnected parameters)	2025-11-23 12:11:54 +01:00
Sebastian Krüger	6f4ac14032	fix: correct seam_fix parameter order in widgets_values (seam_fix_denoise was 1.0, should be 0.3)	2025-11-23 12:10:23 +01:00
Sebastian Krüger	21efd3b86d	fix: remove widget parameters from inputs array - they belong in widgets_values only	2025-11-23 12:09:11 +01:00
Sebastian Krüger	8b8a29a47e	fix: add missing type fields to sampler_name and scheduler inputs	2025-11-23 12:07:43 +01:00
Sebastian Krüger	d6fbda38f1	fix: correct UltimateSDUpscale input indices in workflow The upscale_model input was at index 5 instead of index 12, causing all widget parameters to be misaligned. Fixed by: - Updating link target index from 5 to 12 for upscale_model - Adding explicit entries for widget parameters in inputs array - Maintaining correct parameter order per custom node definition 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 12:06:25 +01:00
Sebastian Krüger	096d565f3d	chore: reorganize workflow assets and remove unused files - Move example images to their respective workflow directories - Remove unused COMFYUI_MODELS.md (content consolidated elsewhere) - Remove fix_workflows.py script (no longer needed) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 12:01:38 +01:00
Sebastian Krüger	d12c868e65	fix: add UpscaleModelLoader and correct widget order in UltimateSDUpscale workflow - Added UpscaleModelLoader node (node 8) for RealESRGAN model - Connected upscale_model input to UltimateSDUpscale - Fixed widgets_values array to match correct parameter order: upscale_by, seed, steps, cfg, sampler_name, scheduler, denoise, mode_type, tile_width, tile_height, mask_blur, tile_padding, seam_fix_mode, seam_fix_denoise, seam_fix_width, seam_fix_mask_blur, seam_fix_padding, force_uniform_tiles, tiled_decode - Updated version to 1.1.0	2025-11-23 11:45:28 +01:00
Sebastian Krüger	c114569309	feat: add placeholder input images for workflows Added example images for testing workflows: - input_image.png (512x512) - for general upscaling workflows - input_portrait.png (512x768) - for portrait/face upscaling workflows	2025-11-23 11:33:00 +01:00
Sebastian Krüger	0df4c63412	fix: add missing links and rebuild upscaling workflows - simple-upscale: Added proper node connections, changed ImageScale to ImageScaleBy - ultimate-sd-upscale: Added CLIP text encoders, removed incorrect VAEDecode and UpscaleModelLoader nodes - face-upscale: Simplified to basic upscaling workflow (FaceDetailer requires complex bbox detector setup) All workflows now have proper inputs, outputs, and links arrays.	2025-11-23 11:30:29 +01:00
Sebastian Krüger	f1788f88ca	fix: replace PreviewAudio with AudioPlay in MusicGen workflows Sound Lab's Musicgen_ node outputs AUDIO format that is only compatible with Sound Lab nodes like AudioPlay, not the built-in ComfyUI audio nodes (SaveAudio/PreviewAudio).	2025-11-23 11:20:15 +01:00
Sebastian Krüger	b6ab524b79	fix: replace SaveAudio with PreviewAudio in MusicGen workflows SaveAudio was erroring on 'waveform' key - the AUDIO output from Musicgen_ node has a different internal structure than what SaveAudio expects. PreviewAudio is more compatible with Sound Lab's AUDIO format. Files are still saved to ComfyUI output directory, just through PreviewAudio instead of SaveAudio. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 11:14:17 +01:00
Sebastian Krüger	c787b40311	fix: rebuild all MusicGen workflows with correct nodes and links Fixed medium, small, and melody workflows: - Replaced non-existent nodes with Musicgen_ from Sound Lab - Added missing links arrays to connect nodes properly - Updated all metadata and performance specs Note: Melody workflow simplified to text-only as Sound Lab doesn't currently support melody conditioning via audio input. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 11:09:33 +01:00
Sebastian Krüger	85b1831876	fix: rebuild MusicGen workflow with correct node types and links Changed from non-existent nodes to actual Sound Lab nodes: - Replaced MusicGenLoader/MusicGenTextEncode/MusicGenSampler with Musicgen_ - Replaced custom SaveAudio with standard SaveAudio node - Added missing links array to connect nodes - All parameters: prompt, duration, guidance_scale, seed, device Node is called "Musicgen_" (with underscore) from comfyui-sound-lab. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 11:06:42 +01:00
Sebastian Krüger	5c1e9d092b	fix: rebuild SD3.5 workflow with TripleCLIPLoader SD3.5 checkpoint doesn't contain CLIP encoders. Now using: - CheckpointLoaderSimple for MODEL and VAE - TripleCLIPLoader for CLIP-L, CLIP-G, and T5-XXL - Standard CLIPTextEncode for prompts This fixes the "clip input is invalid: None" error. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 10:56:09 +01:00
Sebastian Krüger	91ed1aa9e3	fix: correct model paths in SD3.5 and SDXL Refiner workflows Changed from diffusers paths to actual .safetensors filenames: - sd3.5: diffusers/stable-diffusion-3.5-large -> sd3.5_large.safetensors - sdxl-base: diffusers/stable-diffusion-xl-base-1.0 -> sd_xl_base_1.0.safetensors - sdxl-refiner: diffusers/stable-diffusion-xl-refiner-1.0 -> sd_xl_refiner_1.0.safetensors 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 10:51:55 +01:00
Sebastian Krüger	ac74730ee2	fix: rebuild FLUX Schnell workflow with correct node types Replaced CheckpointLoaderSimple with UNETLoader + DualCLIPLoader. Replaced CLIPTextEncode with CLIPTextEncodeFlux. Added proper VAELoader with ae.safetensors. Added ConditioningZeroOut for empty negative conditioning. Removed old negative prompt input (FLUX doesn't use it). Changes match FLUX Dev workflow structure. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 10:48:13 +01:00
Sebastian Krüger	7dd6739f5e	fix: add FLUX VAE autoencoder for proper image decoding Added FLUX VAE (ae.safetensors) to model configuration and updated workflow to use it instead of non-existent pixel_space VAE. This fixes the SaveImage data type error (1, 1, 16), \|u1. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 10:43:11 +01:00
Sebastian Krüger	3eced21d2a	fix: add link 8 to CLIPTextEncodeFlux output links array Node 3 (CLIPTextEncodeFlux) output feeds both KSampler (link 3) and ConditioningZeroOut (link 8), so the output links array must include both links. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 10:39:17 +01:00
Sebastian Krüger	30cc2513cb	fix: add ConditioningZeroOut for FLUX workflow negative input FLUX models require negative conditioning even though they don't use it. Added ConditioningZeroOut node to create empty negative conditioning from positive output, satisfying KSampler's required negative input. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 10:35:22 +01:00
Sebastian Krüger	a2455ae9ee	fix: rebuild FLUX Dev workflow with correct node types - Replace CheckpointLoaderSimple with UNETLoader - Replace CLIPTextEncode with DualCLIPLoader + CLIPTextEncodeFlux - Add VAELoader with pixel_space - Remove negative prompt (FLUX uses guidance differently) - Set CFG to 1.0, guidance in text encoder (3.5) - Add all node connections in links array 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 10:30:47 +01:00
Sebastian Krüger	8b4f141d82	fix: rebuild SVD-XT workflow with correct node types - Replace DiffusersLoader with ImageOnlyCheckpointLoader - Replace SVDSampler with SVD_img2vid_Conditioning + KSampler - Add VideoLinearCFGGuidance for temporal consistency - Add all node connections in links array - Configure VHS_VideoCombine with correct parameters (25 frames) - Increase steps to 30 for better quality with longer video 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 10:15:43 +01:00
Sebastian Krüger	d7bae9cde5	fix: correct VHS_VideoCombine parameters for SVD workflow Remove format-specific parameters from widgets_values array. Only base parameters should be in widgets_values: - frame_rate, loop_count, filename_prefix, format, pingpong, save_output Format-specific params (pix_fmt, crf) are added dynamically by ComfyUI. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 10:11:52 +01:00
Sebastian Krüger	764cb5d2d7	fix: rebuild SVD workflow with correct node types - Replace DiffusersLoader with ImageOnlyCheckpointLoader - Replace SVDSampler with SVD_img2vid_Conditioning + KSampler - Add VideoLinearCFGGuidance for temporal consistency - Add all node connections in links array - Configure VHS_VideoCombine with H.264 parameters 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 10:10:38 +01:00
Sebastian Krüger	22afe18957	fix: change input image to 720x480 for CogVideoX-5b-I2V - CogVideoX-5b-I2V requires specific resolution (720x480) - Cannot generate videos at different resolutions - Update placeholder image to match model requirements	2025-11-23 09:51:12 +01:00
Sebastian Krüger	385b36b062	feat: enable CPU offload for CogVideoX model to reduce VRAM usage - Add enable_sequential_cpu_offload=true to DownloadAndLoadCogVideoModel - Reduces VRAM from ~20GB to ~12GB at cost of slower inference - Widget values: [model, precision, quantization, cpu_offload] = ['THUDM/CogVideoX-5b-I2V', 'bf16', 'disabled', true] - Necessary for 24GB GPU with other services running	2025-11-23 09:47:02 +01:00

1 2

83 Commits