runpod

Author	SHA1	Message	Date
Sebastian Krüger	5096e3ffb5	feat: add Pivoine custom ComfyUI nodes for DiffRhythm All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Add custom node wrapper PivoineDiffRhythmRun that fixes tensor dimension mismatch error by disabling chunked VAE decoding. The original DiffRhythm node's overlap=32 parameter conflicts with the VAE's 64-channel architecture. Changes: - Add comfyui/nodes/pivoine_diffrhythm.py: Custom node wrapper - Add comfyui/nodes/__init__.py: Package initialization - Add arty.yml setup/pivoine-nodes: Deployment script for symlink - Update all 4 DiffRhythm workflows to use PivoineDiffRhythmRun Technical details: - Inherits from DiffRhythmRun to avoid upstream patching - Forces chunked=False in diffrhythmgen() override - Requires more VRAM (~12-16GB) but RTX 4090 has 24GB - Category: 🌸Pivoine/Audio for easy identification 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 16:28:54 +01:00
Sebastian Krüger	073711c017	fix: use correct DiffRhythm parameter order from UI testing All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Correct widgets_values order (11 parameters): 0: model (string) 1: prompt/style_prompt (text) 2: unload_model (boolean) 3: odeint_method (enum) 4: steps (int) 5: cfg (int) 6: quality_or_speed (enum) 7: seed (int) 8: control_after_generate (string) 9: edit (boolean) 10: segments/edit_segments (text) Updated all four workflows: - diffrhythm-simple-t2m-v1.json - diffrhythm-random-generation-v1.json - diffrhythm-reference-based-v1.json - diffrhythm-full-length-t2m-v1.json 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 15:57:25 +01:00
Sebastian Krüger	279f703591	fix: correct DiffRhythm workflow parameter order to match function signature All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details The parameters must match the diffrhythmgen() function signature order, not the INPUT_TYPES order. The function has 'edit' as the first parameter. Correct widgets_values order (11 parameters): 0: edit (boolean) 1: model (string) 2: style_prompt (string) 3: lyrics_or_edit_lyrics (string) 4: edit_segments (string) 5: odeint_method (enum) 6: steps (int) 7: cfg (int) 8: quality_or_speed (enum) 9: unload_model (boolean) 10: seed (int) Note: style_audio_or_edit_song comes from input connection (not in widgets) Note: chunked parameter is hidden (not in widgets) Updated workflows: - diffrhythm-simple-t2m-v1.json - diffrhythm-random-generation-v1.json - diffrhythm-reference-based-v1.json - diffrhythm-full-length-t2m-v1.json 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 15:53:15 +01:00
Sebastian Krüger	44762a063c	fix: update DiffRhythm workflows with correct node names and parameters All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Updated all 4 DiffRhythm workflow JSON files to use actual node class names from ComfyUI_DiffRhythm: Node Name Changes: - DiffRhythmTextToMusic → DiffRhythmRun - DiffRhythmRandomGeneration → DiffRhythmRun (with empty style_prompt) - DiffRhythmReferenceBasedGeneration → DiffRhythmRun (with audio input) Corrected Parameter Structure: All workflows now use proper widgets_values array matching DiffRhythmRun INPUT_TYPES: 1. model (string: "cfm_model_v1_2.pt", "cfm_model.pt", or "cfm_full_model.pt") 2. style_prompt (string: multiline text or empty for random) 3. unload_model (boolean: default true) 4. odeint_method (string: "euler", "midpoint", "rk4", "implicit_adams") 5. steps (int: 1-100, default 30) 6. cfg (int: 1-10, default 4) 7. quality_or_speed (string: "quality" or "speed") 8. seed (int: -1 for random, or specific number) 9. edit (boolean: default false) 10. edit_segments (string: "[-1, 20], [60, -1]") Workflow-Specific Updates: diffrhythm-simple-t2m-v1.json: - Text-to-music workflow for 95s generation - Uses cfm_model_v1_2.pt with text prompt guidance - Default settings: steps=30, cfg=4, speed mode, seed=42 diffrhythm-full-length-t2m-v1.json: - Full-length 4m45s (285s) generation - Uses cfm_full_model.pt for extended compositions - Quality mode enabled for better results - Default seed=123 diffrhythm-reference-based-v1.json: - Reference audio + text prompt workflow - Uses LoadAudio node connected to style_audio_or_edit_song input - Higher cfg=5 for stronger prompt adherence - Demonstrates optional audio input connection diffrhythm-random-generation-v1.json: - Pure random generation (no prompt/guidance) - Empty style_prompt string - Minimal cfg=1 for maximum randomness - Random seed=-1 for unique output each time Documentation Updates: - Removed PLACEHOLDER notes - Updated usage sections with correct parameter descriptions - Added notes about optional MultiLineLyricsDR node for lyrics - Clarified parameter behavior and recommendations These workflows are now ready to use in ComfyUI with the installed DiffRhythm extension. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 10:46:31 +01:00
Sebastian Krüger	f2186db78e	feat: integrate ComfyUI_DiffRhythm extension with 7 models and 4 workflows All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 15s Details - Add DiffRhythm to arty.yml references and setup/comfyui-nodes - Install espeak-ng system dependency for phoneme processing - Add 7 DiffRhythm models to models_huggingface.yaml with file mappings: * ASLP-lab/DiffRhythm-1_2 (95s generation) * ASLP-lab/DiffRhythm-full (4m45s generation) * ASLP-lab/DiffRhythm-base * ASLP-lab/DiffRhythm-vae * OpenMuQ/MuQ-MuLan-large * OpenMuQ/MuQ-large-msd-iter * FacebookAI/xlm-roberta-base - Create 4 comprehensive workflows: * diffrhythm-simple-t2m-v1.json (basic 95s text-to-music) * diffrhythm-full-length-t2m-v1.json (4m45s full-length) * diffrhythm-reference-based-v1.json (style transfer with reference audio) * diffrhythm-random-generation-v1.json (no-prompt random generation) - Update storage requirements: 90GB essential, 149GB total 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 09:50:45 +01:00

5 Commits