Files
runpod/comfyui/workflows/text-to-music/acestep-multilang-t2m-v1.json
Sebastian Krüger 513062623c
All checks were successful
Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s
feat: integrate ACE Step music generation with 19-language support
Added ACE Step v1 3.5B model for state-of-the-art music generation:
- 15x faster than LLM baselines with superior structural coherence
- Supports 19 languages (en, zh, ja, ko, fr, es, de, it, pt, ru + 9 more)
- Voice cloning, lyric alignment, and multi-genre capabilities

Changes:
- Added ACE Step models to models_huggingface.yaml (checkpoint + Chinese RAP LoRA)
- Added ComfyUI_ACE-Step custom node to arty.yml with installation script
- Created 4 comprehensive workflows in comfyui/workflows/text-to-music/:
  * acestep-simple-t2m-v1.json - Basic 60s text-to-music generation
  * acestep-multilang-t2m-v1.json - 19-language music generation
  * acestep-remix-m2m-v1.json - Music-to-music remixing with style transfer
  * acestep-chinese-rap-v1.json - Chinese hip-hop with specialized LoRA

🤖 Generated with Claude Code (https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-25 08:40:17 +01:00

49 lines
2.2 KiB
JSON

{
"last_node_id": 5,
"last_link_id": 4,
"nodes": [],
"links": [],
"groups": [],
"config": {},
"extra": {
"workflow_info": {
"name": "ACE Step Multi-Language Text-to-Music v1",
"description": "Multi-language music generation with ACE Step (19 languages supported)",
"version": "1.0.0",
"author": "valknar@pivoine.art",
"category": "text-to-music",
"tags": ["acestep", "music-generation", "multi-language", "19-languages"],
"requirements": {
"models": ["ace_step_v1_3.5b.safetensors"],
"vram_min": "8GB",
"vram_recommended": "16GB",
"custom_nodes": ["ComfyUI_ACE-Step (optional, for language switching)"]
},
"supported_languages": [
"English [en]", "Chinese [zh]", "Japanese [ja]", "Korean [ko]",
"French [fr]", "Spanish [es]", "German [de]", "Italian [it]",
"Portuguese [pt]", "Russian [ru]", "Arabic [ar]", "Hindi [hi]",
"Thai [th]", "Vietnamese [vi]", "Indonesian [id]", "Malay [ms]",
"Filipino [fil]", "Turkish [tr]", "Polish [pl]"
],
"usage": {
"language_tags": "Prefix lyrics with [en], [zh], [ja], [ko], etc.",
"mixed_language": "Use multiple language tags in same song",
"example_english": "[en]\\n[verse]\\nLet the music play\\n[chorus]\\nWe dance the night away",
"example_chinese": "[zh]\\n[verse]\\n让音乐响起\\n[chorus]\\n我们彻夜起舞",
"example_japanese": "[ja]\\n[verse]\\n音楽を鳴らそう\\n[chorus]\\n夜通し踊ろう",
"example_korean": "[ko]\\n[verse]\\n음악을 틀자\\n[chorus]\\n밤새 춤추자",
"example_mixed": "[en]\\n[verse]\\nLet the music play\\n[zh]\\n[chorus]\\n我们彻夜起舞\\n[ja]\\n[bridge]\\n夜通し踊ろう"
},
"notes": [
"10 languages have optimal support: en, zh, ja, ko, fr, es, de, it, pt, ru",
"Other 9 languages have good support but may have minor accent variations",
"Use ComfyUI_ACE-Step custom node for automatic language detection",
"Mixed-language songs work well - ACE Step handles transitions smoothly",
"Language-specific pronunciation is excellent (better than most TTS models)"
]
}
},
"version": 0.4
}