runpod

Author	SHA1	Message	Date
Sebastian Krüger	06b8ec0064	refactor: remove simple ACE Step workflow in favor of official workflows All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Removed acestep-simple-t2m-v1.json as the official Comfy-Org workflows provide better quality: - acestep-official-t2m-v1.json - Advanced T2M with specialized nodes - acestep-m2m-editing-v1.json - Music-to-music editing capability 🤖 Generated with Claude Code (https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-25 09:46:24 +01:00
Sebastian Krüger	e610330b91	feat: add official Comfy-Org ACE Step workflows and example assets All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Added 2 official workflows from Comfy-Org/example_workflows: - acestep-official-t2m-v1.json - Advanced T2M with specialized nodes (50 steps, multiple formats) - acestep-m2m-editing-v1.json - Music-to-music editing with denoise control Added 3 audio example assets: - acestep-m2m-input.mp3 (973 KB) - Example input for M2M editing - acestep-t2m-output.flac (3.4 MB) - T2M output reference - acestep-m2m-output.mp3 (998 KB) - M2M output reference Total: 3 workflows (simple + official T2M + M2M editing) with audio examples 🤖 Generated with Claude Code (https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-25 09:34:57 +01:00
Sebastian Krüger	55b37894b1	fix: remove empty ACE Step workflow placeholders All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 13s Details Removed 3 empty placeholder workflows that only contained metadata: - acestep-multilang-t2m-v1.json - acestep-remix-m2m-v1.json - acestep-chinese-rap-v1.json Kept only the functional workflow: - acestep-simple-t2m-v1.json (6 nodes, fully operational) Users can use the simple workflow and modify the prompt for different use cases: - Multi-language: prefix lyrics with language tags like [zh], [ja], [ko] - Remixing: load audio input and adjust denoise strength (0.1-0.7) - Chinese RAP: use Chinese RAP LoRA with strength 0.8-1.0 🤖 Generated with Claude Code (https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-25 08:51:37 +01:00
Sebastian Krüger	513062623c	feat: integrate ACE Step music generation with 19-language support All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Added ACE Step v1 3.5B model for state-of-the-art music generation: - 15x faster than LLM baselines with superior structural coherence - Supports 19 languages (en, zh, ja, ko, fr, es, de, it, pt, ru + 9 more) - Voice cloning, lyric alignment, and multi-genre capabilities Changes: - Added ACE Step models to models_huggingface.yaml (checkpoint + Chinese RAP LoRA) - Added ComfyUI_ACE-Step custom node to arty.yml with installation script - Created 4 comprehensive workflows in comfyui/workflows/text-to-music/: * acestep-simple-t2m-v1.json - Basic 60s text-to-music generation * acestep-multilang-t2m-v1.json - 19-language music generation * acestep-remix-m2m-v1.json - Music-to-music remixing with style transfer * acestep-chinese-rap-v1.json - Chinese hip-hop with specialized LoRA 🤖 Generated with Claude Code (https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-25 08:40:17 +01:00
Sebastian Krüger	6ce989dd91	Remove unused diffrhythm-random-generation workflow All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 15s Details Removed diffrhythm-random-generation-v1.json as it's no longer needed. Keeping only the essential DiffRhythm workflows: - simple text-to-music (95s) - full-length generation (4m45s) - reference-based style transfer 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 20:34:53 +01:00
Sebastian Krüger	d74a7cb7cb	fix: replace custom Pivoine node with direct DiffRhythm patch All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details - Remove custom PivoineDiffRhythmRun wrapper node - Add git patch file for ComfyUI_DiffRhythm __init__.py - Patch adds LlamaConfig fix at import time - Add arty script 'fix/diffrhythm-patch' to apply patch - Revert all workflows to use original DiffRhythmRun - Remove startup_patch.py and revert start.sh This approach is cleaner and more maintainable than wrapping the node. The patch directly fixes the tensor dimension mismatch (32 vs 64) in DiffRhythm's rotary position embeddings by ensuring num_attention_heads and num_key_value_heads are properly set based on hidden_size. References: - https://github.com/billwuhao/ComfyUI_DiffRhythm/issues/44 - https://github.com/billwuhao/ComfyUI_DiffRhythm/issues/48	2025-11-24 19:27:18 +01:00
Sebastian Krüger	5096e3ffb5	feat: add Pivoine custom ComfyUI nodes for DiffRhythm All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Add custom node wrapper PivoineDiffRhythmRun that fixes tensor dimension mismatch error by disabling chunked VAE decoding. The original DiffRhythm node's overlap=32 parameter conflicts with the VAE's 64-channel architecture. Changes: - Add comfyui/nodes/pivoine_diffrhythm.py: Custom node wrapper - Add comfyui/nodes/__init__.py: Package initialization - Add arty.yml setup/pivoine-nodes: Deployment script for symlink - Update all 4 DiffRhythm workflows to use PivoineDiffRhythmRun Technical details: - Inherits from DiffRhythmRun to avoid upstream patching - Forces chunked=False in diffrhythmgen() override - Requires more VRAM (~12-16GB) but RTX 4090 has 24GB - Category: 🌸Pivoine/Audio for easy identification 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 16:28:54 +01:00
Sebastian Krüger	073711c017	fix: use correct DiffRhythm parameter order from UI testing All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Correct widgets_values order (11 parameters): 0: model (string) 1: prompt/style_prompt (text) 2: unload_model (boolean) 3: odeint_method (enum) 4: steps (int) 5: cfg (int) 6: quality_or_speed (enum) 7: seed (int) 8: control_after_generate (string) 9: edit (boolean) 10: segments/edit_segments (text) Updated all four workflows: - diffrhythm-simple-t2m-v1.json - diffrhythm-random-generation-v1.json - diffrhythm-reference-based-v1.json - diffrhythm-full-length-t2m-v1.json 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 15:57:25 +01:00
Sebastian Krüger	279f703591	fix: correct DiffRhythm workflow parameter order to match function signature All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details The parameters must match the diffrhythmgen() function signature order, not the INPUT_TYPES order. The function has 'edit' as the first parameter. Correct widgets_values order (11 parameters): 0: edit (boolean) 1: model (string) 2: style_prompt (string) 3: lyrics_or_edit_lyrics (string) 4: edit_segments (string) 5: odeint_method (enum) 6: steps (int) 7: cfg (int) 8: quality_or_speed (enum) 9: unload_model (boolean) 10: seed (int) Note: style_audio_or_edit_song comes from input connection (not in widgets) Note: chunked parameter is hidden (not in widgets) Updated workflows: - diffrhythm-simple-t2m-v1.json - diffrhythm-random-generation-v1.json - diffrhythm-reference-based-v1.json - diffrhythm-full-length-t2m-v1.json 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 15:53:15 +01:00
Sebastian Krüger	64db634ab5	fix: correct DiffRhythm workflow parameter order for all three workflows All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 15s Details Changed edit_segments from "[-1, 20], [60, -1]" to empty string "" at position 11. This fixes validation errors where parameters were being interpreted as wrong types. The correct 12-parameter structure is: 0: model (string) 1: style_prompt (string) 2: unload_model (boolean) 3: odeint_method (enum) 4: steps (int) 5: cfg (int) 6: quality_or_speed (enum) 7: seed (int) 8: edit (boolean) 9: edit_lyrics (string, empty) 10: edit_song (string, empty) 11: edit_segments (string, empty) Updated workflows: - diffrhythm-random-generation-v1.json - diffrhythm-reference-based-v1.json - diffrhythm-full-length-t2m-v1.json 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 15:48:56 +01:00
Sebastian Krüger	56476f4230	fix: add missing edit_song and edit_lyrics parameters to DiffRhythm workflows All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Fix "edit song, edit lyrics, edit segments must be provided" error by adding the two missing parameters to all three DiffRhythm workflow files: - diffrhythm-random-generation-v1.json - diffrhythm-reference-based-v1.json - diffrhythm-full-length-t2m-v1.json Added empty string parameters at positions 9 and 10 in widgets_values array: - edit_song: "" (empty when edit=false) - edit_lyrics: "" (empty when edit=false) The DiffRhythmRun node requires 12 parameters total, not 10. These workflows use edit=false (no editing), so the edit parameters should be empty strings. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 12:55:58 +01:00
Sebastian Krüger	cf3fcafbae	feat: add DiffRhythm music generation support All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 15s Details - Add DiffRhythm dependencies to requirements.txt (19 packages) - Add reference audio placeholder for style transfer workflow - DiffRhythm nodes now loading in ComfyUI - All four workflows ready for music generation 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 12:17:46 +01:00
Sebastian Krüger	44762a063c	fix: update DiffRhythm workflows with correct node names and parameters All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s Details Updated all 4 DiffRhythm workflow JSON files to use actual node class names from ComfyUI_DiffRhythm: Node Name Changes: - DiffRhythmTextToMusic → DiffRhythmRun - DiffRhythmRandomGeneration → DiffRhythmRun (with empty style_prompt) - DiffRhythmReferenceBasedGeneration → DiffRhythmRun (with audio input) Corrected Parameter Structure: All workflows now use proper widgets_values array matching DiffRhythmRun INPUT_TYPES: 1. model (string: "cfm_model_v1_2.pt", "cfm_model.pt", or "cfm_full_model.pt") 2. style_prompt (string: multiline text or empty for random) 3. unload_model (boolean: default true) 4. odeint_method (string: "euler", "midpoint", "rk4", "implicit_adams") 5. steps (int: 1-100, default 30) 6. cfg (int: 1-10, default 4) 7. quality_or_speed (string: "quality" or "speed") 8. seed (int: -1 for random, or specific number) 9. edit (boolean: default false) 10. edit_segments (string: "[-1, 20], [60, -1]") Workflow-Specific Updates: diffrhythm-simple-t2m-v1.json: - Text-to-music workflow for 95s generation - Uses cfm_model_v1_2.pt with text prompt guidance - Default settings: steps=30, cfg=4, speed mode, seed=42 diffrhythm-full-length-t2m-v1.json: - Full-length 4m45s (285s) generation - Uses cfm_full_model.pt for extended compositions - Quality mode enabled for better results - Default seed=123 diffrhythm-reference-based-v1.json: - Reference audio + text prompt workflow - Uses LoadAudio node connected to style_audio_or_edit_song input - Higher cfg=5 for stronger prompt adherence - Demonstrates optional audio input connection diffrhythm-random-generation-v1.json: - Pure random generation (no prompt/guidance) - Empty style_prompt string - Minimal cfg=1 for maximum randomness - Random seed=-1 for unique output each time Documentation Updates: - Removed PLACEHOLDER notes - Updated usage sections with correct parameter descriptions - Added notes about optional MultiLineLyricsDR node for lyrics - Clarified parameter behavior and recommendations These workflows are now ready to use in ComfyUI with the installed DiffRhythm extension. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 10:46:31 +01:00
Sebastian Krüger	f2186db78e	feat: integrate ComfyUI_DiffRhythm extension with 7 models and 4 workflows All checks were successful Build and Push RunPod Docker Image / build-and-push (push) Successful in 15s Details - Add DiffRhythm to arty.yml references and setup/comfyui-nodes - Install espeak-ng system dependency for phoneme processing - Add 7 DiffRhythm models to models_huggingface.yaml with file mappings: * ASLP-lab/DiffRhythm-1_2 (95s generation) * ASLP-lab/DiffRhythm-full (4m45s generation) * ASLP-lab/DiffRhythm-base * ASLP-lab/DiffRhythm-vae * OpenMuQ/MuQ-MuLan-large * OpenMuQ/MuQ-large-msd-iter * FacebookAI/xlm-roberta-base - Create 4 comprehensive workflows: * diffrhythm-simple-t2m-v1.json (basic 95s text-to-music) * diffrhythm-full-length-t2m-v1.json (4m45s full-length) * diffrhythm-reference-based-v1.json (style transfer with reference audio) * diffrhythm-random-generation-v1.json (no-prompt random generation) - Update storage requirements: 90GB essential, 149GB total 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 09:50:45 +01:00
Sebastian Krüger	f1788f88ca	fix: replace PreviewAudio with AudioPlay in MusicGen workflows Sound Lab's Musicgen_ node outputs AUDIO format that is only compatible with Sound Lab nodes like AudioPlay, not the built-in ComfyUI audio nodes (SaveAudio/PreviewAudio).	2025-11-23 11:20:15 +01:00
Sebastian Krüger	b6ab524b79	fix: replace SaveAudio with PreviewAudio in MusicGen workflows SaveAudio was erroring on 'waveform' key - the AUDIO output from Musicgen_ node has a different internal structure than what SaveAudio expects. PreviewAudio is more compatible with Sound Lab's AUDIO format. Files are still saved to ComfyUI output directory, just through PreviewAudio instead of SaveAudio. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 11:14:17 +01:00
Sebastian Krüger	c787b40311	fix: rebuild all MusicGen workflows with correct nodes and links Fixed medium, small, and melody workflows: - Replaced non-existent nodes with Musicgen_ from Sound Lab - Added missing links arrays to connect nodes properly - Updated all metadata and performance specs Note: Melody workflow simplified to text-only as Sound Lab doesn't currently support melody conditioning via audio input. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 11:09:33 +01:00
Sebastian Krüger	85b1831876	fix: rebuild MusicGen workflow with correct node types and links Changed from non-existent nodes to actual Sound Lab nodes: - Replaced MusicGenLoader/MusicGenTextEncode/MusicGenSampler with Musicgen_ - Replaced custom SaveAudio with standard SaveAudio node - Added missing links array to connect nodes - All parameters: prompt, duration, guidance_scale, seed, device Node is called "Musicgen_" (with underscore) from comfyui-sound-lab. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 11:06:42 +01:00
Sebastian Krüger	897dcb175a	refactor: reorganize directory structure and remove hardcoded paths Move comfyui and vllm out of models/ directory to top level for better organization. Replace all hardcoded /workspace paths with relative paths to make the configuration portable across different environments. Changes: - Move models/comfyui/ → comfyui/ - Move models/vllm/ → vllm/ - Remove models/ directory (empty) - Update arty.yml: replace /workspace with environment variables - Update supervisord.conf: use relative paths from /workspace/ai - Update all script references to use new paths - Maintain TQDM_DISABLE=1 to fix BrokenPipeError 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-22 20:49:27 +01:00

19 Commits