Files
runpod/comfyui/workflows/text-to-music/acestep-chinese-rap-v1.json
Sebastian Krüger 513062623c
All checks were successful
Build and Push RunPod Docker Image / build-and-push (push) Successful in 14s
feat: integrate ACE Step music generation with 19-language support
Added ACE Step v1 3.5B model for state-of-the-art music generation:
- 15x faster than LLM baselines with superior structural coherence
- Supports 19 languages (en, zh, ja, ko, fr, es, de, it, pt, ru + 9 more)
- Voice cloning, lyric alignment, and multi-genre capabilities

Changes:
- Added ACE Step models to models_huggingface.yaml (checkpoint + Chinese RAP LoRA)
- Added ComfyUI_ACE-Step custom node to arty.yml with installation script
- Created 4 comprehensive workflows in comfyui/workflows/text-to-music/:
  * acestep-simple-t2m-v1.json - Basic 60s text-to-music generation
  * acestep-multilang-t2m-v1.json - 19-language music generation
  * acestep-remix-m2m-v1.json - Music-to-music remixing with style transfer
  * acestep-chinese-rap-v1.json - Chinese hip-hop with specialized LoRA

🤖 Generated with Claude Code (https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-25 08:40:17 +01:00

66 lines
3.2 KiB
JSON

{
"last_node_id": 5,
"last_link_id": 4,
"nodes": [],
"links": [],
"groups": [],
"config": {},
"extra": {
"workflow_info": {
"name": "ACE Step Chinese RAP with LoRA v1",
"description": "Chinese hip-hop generation using ACE Step with specialized Chinese RAP LoRA",
"version": "1.0.0",
"author": "valknar@pivoine.art",
"category": "text-to-music",
"tags": ["acestep", "chinese-rap", "lora", "hip-hop", "zh-language"],
"requirements": {
"models": [
"ace_step_v1_3.5b.safetensors",
"ace-step-chinese-rap-lora.safetensors (ACE-Step/ACE-Step-v1-chinese-rap-LoRA)"
],
"vram_min": "8GB",
"vram_recommended": "16GB",
"custom_nodes": ["ComfyUI_ACE-Step (for LoRA loading)"]
},
"usage": {
"lora_strength": "0.8-1.0 (recommended for Chinese RAP)",
"language_tag": "[zh] prefix for Chinese lyrics",
"genre_tags": "chinese rap, hip-hop, trap, 90 BPM, 808 bass hi-hats snare, aggressive gritty, E minor",
"lyric_structure": "[verse], [chorus], [bridge] with Chinese characters",
"workflow": "LoadCheckpoint → LoadLoRA (strength 0.8-1.0) → TextEncode ([zh] + tags + lyrics) → KSampler → SaveAudio"
},
"benefits": {
"pronunciation": "Significantly improved Chinese pronunciation accuracy",
"genre_adherence": "Better hip-hop/electronic/trap style generation",
"cultural_authenticity": "More authentic Chinese rap flow and cadence",
"lyric_clarity": "Clearer articulation of Chinese characters"
},
"examples": {
"aggressive_rap": {
"tags": "chinese rap, 90 BPM, 808 bass hi-hats snare, aggressive gritty dark, E minor",
"lyrics": "[zh]\n[verse]\n在这个城市的夜晚 我独自前行\n没有人能够阻挡 我的决心\n[chorus]\n我要冲破这黑暗 找到光明\n不管有多少困难 我都不停"
},
"melodic_trap": {
"tags": "melodic trap, 85 BPM, 808 synth pad vocal chops, emotional melancholic, A minor",
"lyrics": "[zh]\n[verse]\n回忆像雨滴 落在我心里\n那些过去的日子 已经离去\n[chorus]\n但我还在这里 等待着奇迹\n希望有一天 能再次遇见你"
},
"boom_bap": {
"tags": "boom bap, 95 BPM, vinyl samples drums bass scratches, classic nostalgic, D minor",
"lyrics": "[zh]\n[verse]\n从街头到舞台 我一路走来\n用我的音乐 讲述我的未来\n[chorus]\n这就是我的故事 真实不虚伪\n中国说唱 永远不会退"
}
},
"notes": [
"LoRA specifically trained for Chinese hip-hop pronunciation and flow",
"Recommended strength: 0.8-1.0 (higher = stronger LoRA effect)",
"Works with aggressive rap, melodic trap, boom bap, and electronic styles",
"Improves pronunciation of complex Chinese characters and tones",
"Can combine with English sections using [en] language tags",
"BPM range: 80-120 for various hip-hop subgenres",
"Use aggressive/gritty tags for harder styles, emotional/melancholic for melodic",
"808 bass and hi-hats are characteristic of modern Chinese trap"
]
}
},
"version": 0.4
}