- Multi-modal AI infrastructure for RunPod RTX 4090 - Automatic model orchestration (text, image, music) - Text: vLLM + Qwen 2.5 7B Instruct - Image: Flux.1 Schnell via OpenEDAI - Music: MusicGen Medium via AudioCraft - Cost-optimized sequential loading on single GPU - Template preparation scripts for rapid deployment - Comprehensive documentation (README, DEPLOYMENT, TEMPLATE)
14 lines
288 B
JSON
14 lines
288 B
JSON
{
|
|
"model": "flux-schnell",
|
|
"offload": true,
|
|
"sequential_cpu_offload": false,
|
|
"vae_tiling": true,
|
|
"enable_model_cpu_offload": true,
|
|
"low_vram_mode": false,
|
|
"torch_compile": false,
|
|
"safety_checker": false,
|
|
"watermark": false,
|
|
"flux_device": "cuda",
|
|
"compile": false
|
|
}
|