Initial commit: RunPod multi-modal AI orchestration stack

- Multi-modal AI infrastructure for RunPod RTX 4090 - Automatic model orchestration (text, image, music) - Text: vLLM + Qwen 2.5 7B Instruct - Image: Flux.1 Schnell via OpenEDAI - Music: MusicGen Medium via AudioCraft - Cost-optimized sequential loading on single GPU - Template preparation scripts for rapid deployment - Comprehensive documentation (README, DEPLOYMENT, TEMPLATE)
2025-11-21 14:34:55 +01:00
commit 277f1c95bd
35 changed files with 7654 additions and 0 deletions
--- a/flux/config/config.json
+++ b/flux/config/config.json
@@ -0,0 +1,13 @@
+{
+  "model": "flux-schnell",
+  "offload": true,
+  "sequential_cpu_offload": false,
+  "vae_tiling": true,
+  "enable_model_cpu_offload": true,
+  "low_vram_mode": false,
+  "torch_compile": false,
+  "safety_checker": false,
+  "watermark": false,
+  "flux_device": "cuda",
+  "compile": false
+}