From 0fd2eacad13af728bb9e52532fb4ab9030c9fe54 Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Sebastian=20Kr=C3=BCger?= <valknar@pivoine.art>
Date: Sat, 22 Nov 2025 13:19:02 +0100
Subject: [PATCH] feat: add Supervisor proxy with Authelia SSO
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Add nginx reverse proxy service for Supervisor web UI at supervisor.ai.pivoine.art with Authelia authentication. Proxies to RunPod GPU instance via Tailscale (100.121.199.88:9001).

Changes:
- Create supervisor-nginx.conf for nginx proxy configuration
- Add supervisor service to docker-compose with Traefik labels
- Add supervisor.ai.pivoine.art to Authelia protected domains
- Remove deprecated Flux-related files

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
---
 ai/FLUX_SETUP.md               | 181 ---------------------------------
 ai/compose.yaml                |  33 ++++++
 ai/functions/flux_image_gen.py | 158 ----------------------------
 ai/supervisor-nginx.conf       |  38 +++++++
 net/authelia/configuration.yml |   1 +
 5 files changed, 72 insertions(+), 339 deletions(-)
 delete mode 100644 ai/FLUX_SETUP.md
 delete mode 100644 ai/functions/flux_image_gen.py
 create mode 100644 ai/supervisor-nginx.conf

diff --git a/ai/FLUX_SETUP.md b/ai/FLUX_SETUP.md
deleted file mode 100644
index 3eb7414..0000000
--- a/ai/FLUX_SETUP.md
+++ /dev/null
@@ -1,181 +0,0 @@
-# Flux Image Generation Setup for Open WebUI
-
-This guide explains how to add Flux.1 Schnell image generation to your Open WebUI installation.
-
-## Architecture
-
-```
-Open WebUI → flux_image_gen.py Function → LiteLLM (port 4000) → Orchestrator (RunPod port 9000) → Flux Model
-```
-
-## Installation
-
-### Automatic (via Docker Compose)
-
-The Flux function is **automatically loaded** via Docker volume mount. No manual upload needed!
-
-**How it works:**
-- Function file: `ai/functions/flux_image_gen.py`
-- Mounted to: `/app/backend/data/functions/` in the container (read-only)
-- Open WebUI automatically discovers and loads functions from this directory on startup
-
-**To deploy:**
-```bash
-cd ~/Projects/docker-compose
-pnpm arty up -d ai_webui  # Restart Open WebUI to load function
-```
-
-### Verify Installation
-
-After restarting Open WebUI, the function should automatically appear in:
-1. **Admin Settings → Functions**: Listed as "Flux Image Generator"
-2. **Model dropdown**: "Flux.1 Schnell (4-5s)" available for selection
-
-If you don't see it:
-```bash
-# Check if function is mounted correctly
-docker exec ai_webui ls -la /app/backend/data/functions/
-
-# Check logs for any loading errors
-docker logs ai_webui | grep -i flux
-```
-
-## Usage
-
-### Basic Image Generation
-
-1. **Select the Flux model:**
-   - In Open WebUI chat, select "Flux.1 Schnell (4-5s)" from the model dropdown
-
-2. **Send your prompt:**
-   ```
-   A serene mountain landscape at sunset with vibrant colors
-   ```
-
-3. **Wait for generation:**
-   - The function will call LiteLLM → Orchestrator → RunPod Flux
-   - Image appears in 4-5 seconds
-
-### Advanced Options
-
-The function supports custom sizes (configure in Valves):
-- `1024x1024` (default, square)
-- `1024x768` (landscape)
-- `768x1024` (portrait)
-
-## Configuration
-
-### Valves (Customization)
-
-To customize function behavior:
-
-1. **Access Open WebUI**:
-   - Go to https://ai.pivoine.art
-   - Profile → Settings → Admin Settings → Functions
-
-2. **Find Flux Image Generator**:
-   - Click on "Flux Image Generator" in the functions list
-   - Go to "Valves" tab
-
-3. **Available Settings:**
-   - `LITELLM_API_BASE`: LiteLLM endpoint (default: `http://litellm:4000/v1`)
-   - `LITELLM_API_KEY`: API key (default: `dummy` - not needed for internal use)
-   - `DEFAULT_MODEL`: Model name (default: `flux-schnell`)
-   - `DEFAULT_SIZE`: Image dimensions (default: `1024x1024`)
-   - `TIMEOUT`: Request timeout in seconds (default: `120`)
-
-## Troubleshooting
-
-### Function not appearing in model list
-
-**Check:**
-1. Function is enabled in Admin Settings → Functions
-2. Function has no syntax errors (check logs)
-3. Refresh browser cache (Ctrl+Shift+R)
-
-### Image generation fails
-
-**Check:**
-1. LiteLLM is running: `docker ps | grep litellm`
-2. LiteLLM can reach orchestrator: Check `docker logs ai_litellm`
-3. Orchestrator is running on RunPod
-4. Flux model is loaded: Check orchestrator logs
-
-**Test LiteLLM directly:**
-```bash
-curl -X POST http://localhost:4000/v1/images/generations \
-  -H 'Content-Type: application/json' \
-  -d '{
-    "model": "flux-schnell",
-    "prompt": "A test image",
-    "size": "1024x1024"
-  }'
-```
-
-### Timeout errors
-
-The default timeout is 120 seconds. If you're getting timeouts:
-
-1. **Increase timeout in Valves:**
-   - Set `TIMEOUT` to `180` or higher
-
-2. **Check Orchestrator status:**
-   - Flux model may still be loading (takes ~1 minute on first request)
-
-## Technical Details
-
-### How it Works
-
-1. **User sends prompt** in Open WebUI chat interface
-2. **Function extracts prompt** from messages array
-3. **Function calls LiteLLM** `/v1/images/generations` endpoint
-4. **LiteLLM routes to Orchestrator** via config (`http://100.121.199.88:9000/v1`)
-5. **Orchestrator loads Flux** on RunPod GPU (if not already running)
-6. **Flux generates image** in 4-5 seconds
-7. **Image returns as base64** through the chain
-8. **Function displays image** as markdown in chat
-
-### Request Flow
-
-```json
-// Function sends to LiteLLM:
-{
-  "model": "flux-schnell",
-  "prompt": "A serene mountain landscape",
-  "size": "1024x1024",
-  "n": 1,
-  "response_format": "b64_json"
-}
-
-// LiteLLM response:
-{
-  "data": [{
-    "b64_json": "iVBORw0KGgoAAAANSUhEUgAA..."
-  }]
-}
-
-// Function converts to markdown:
-![Generated Image](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAA...)
-```
-
-## Limitations
-
-- **Single model**: Currently only Flux.1 Schnell is available
-- **Sequential generation**: One image at a time (n=1)
-- **Fixed format**: PNG format only
-- **Orchestrator dependency**: Requires RunPod GPU server to be running
-
-## Future Enhancements
-
-Potential improvements:
-- Multiple size presets in model dropdown
-- Support for other Flux variants (Dev, Pro)
-- Batch generation (n > 1)
-- Image-to-image support
-- Custom aspect ratios
-
-## Support
-
-- **Documentation**: `/home/valknar/Projects/docker-compose/CLAUDE.md`
-- **RunPod README**: `/home/valknar/Projects/runpod/README.md`
-- **LiteLLM Config**: `/home/valknar/Projects/docker-compose/ai/litellm-config.yaml`
diff --git a/ai/compose.yaml b/ai/compose.yaml
index 84e96a2..94800ae 100644
--- a/ai/compose.yaml
+++ b/ai/compose.yaml
@@ -232,6 +232,39 @@ services:
       # Watchtower
       - 'com.centurylinklabs.watchtower.enable=${WATCHTOWER_LABEL_ENABLE}'
 
+  # Supervisor - Process manager web UI (proxies to RunPod GPU)
+  supervisor:
+    image: nginx:alpine
+    container_name: ${AI_COMPOSE_PROJECT_NAME}_supervisor
+    restart: unless-stopped
+    environment:
+      TZ: ${TIMEZONE:-Europe/Berlin}
+      SUPERVISOR_BACKEND_HOST: ${SUPERVISOR_BACKEND_HOST:-100.121.199.88}
+      SUPERVISOR_BACKEND_PORT: ${SUPERVISOR_BACKEND_PORT:-9001}
+    volumes:
+      - ./supervisor-nginx.conf:/etc/nginx/nginx.conf.template:ro
+    command: /bin/sh -c "envsubst '$${SUPERVISOR_BACKEND_HOST},$${SUPERVISOR_BACKEND_PORT}' < /etc/nginx/nginx.conf.template > /etc/nginx/nginx.conf && exec nginx -g 'daemon off;'"
+    networks:
+      - compose_network
+    labels:
+      - 'traefik.enable=${AI_SUPERVISOR_TRAEFIK_ENABLED:-true}'
+      # HTTP to HTTPS redirect
+      - 'traefik.http.middlewares.${AI_COMPOSE_PROJECT_NAME}-supervisor-redirect-web-secure.redirectscheme.scheme=https'
+      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web.middlewares=${AI_COMPOSE_PROJECT_NAME}-supervisor-redirect-web-secure'
+      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web.rule=Host(`${AI_SUPERVISOR_TRAEFIK_HOST:-supervisor.ai.pivoine.art}`)'
+      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web.entrypoints=web'
+      # HTTPS router with Authelia SSO
+      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure.rule=Host(`${AI_SUPERVISOR_TRAEFIK_HOST:-supervisor.ai.pivoine.art}`)'
+      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure.tls.certresolver=resolver'
+      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure.entrypoints=web-secure'
+      - 'traefik.http.middlewares.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure-compress.compress=true'
+      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure.middlewares=${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure-compress,net-authelia,security-headers@file'
+      # Service
+      - 'traefik.http.services.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure.loadbalancer.server.port=80'
+      - 'traefik.docker.network=${NETWORK_NAME}'
+      # Watchtower
+      - 'com.centurylinklabs.watchtower.enable=${WATCHTOWER_LABEL_ENABLE}'
+
 volumes:
   ai_postgres_data:
     name: ${AI_COMPOSE_PROJECT_NAME}_postgres_data
diff --git a/ai/functions/flux_image_gen.py b/ai/functions/flux_image_gen.py
deleted file mode 100644
index fe2b392..0000000
--- a/ai/functions/flux_image_gen.py
+++ /dev/null
@@ -1,158 +0,0 @@
-"""
-title: Flux Image Generator
-author: Valknar
-version: 1.0.0
-license: MIT
-description: Generate images using Flux.1 Schnell via LiteLLM
-requirements: requests, pydantic
-"""
-
-import os
-import base64
-import json
-import requests
-from typing import Generator
-from pydantic import BaseModel, Field
-
-
-class Pipe:
-    """
-    Flux Image Generation Function for Open WebUI
-    Routes image generation requests to LiteLLM → Orchestrator → RunPod Flux
-    """
-
-    class Valves(BaseModel):
-        """Configuration valves for the image generation function"""
-        LITELLM_API_BASE: str = Field(
-            default="http://litellm:4000/v1",
-            description="LiteLLM API base URL"
-        )
-        LITELLM_API_KEY: str = Field(
-            default="dummy",
-            description="LiteLLM API key (not required for internal use)"
-        )
-        DEFAULT_MODEL: str = Field(
-            default="flux-schnell",
-            description="Default model to use for image generation"
-        )
-        DEFAULT_SIZE: str = Field(
-            default="1024x1024",
-            description="Default image size"
-        )
-        TIMEOUT: int = Field(
-            default=120,
-            description="Request timeout in seconds"
-        )
-
-    def __init__(self):
-        self.type = "manifold"
-        self.id = "flux_image_gen"
-        self.name = "Flux"
-        self.valves = self.Valves()
-
-    def pipes(self):
-        """Return available models"""
-        return [
-            {
-                "id": "flux-schnell",
-                "name": "Flux.1 Schnell (4-5s)"
-            }
-        ]
-
-    def pipe(self, body: dict) -> Generator[str, None, None]:
-        """
-        Generate images via LiteLLM endpoint
-
-        Args:
-            body: Request body containing model, messages, etc.
-
-        Yields:
-            JSON chunks with generated image data
-        """
-        try:
-            # Extract the prompt from messages
-            messages = body.get("messages", [])
-            if not messages:
-                yield self._error_response("No messages provided")
-                return
-
-            # Get the last user message as prompt
-            prompt = messages[-1].get("content", "")
-            if not prompt:
-                yield self._error_response("No prompt provided")
-                return
-
-            # Prepare image generation request
-            image_request = {
-                "model": body.get("model", self.valves.DEFAULT_MODEL),
-                "prompt": prompt,
-                "size": body.get("size", self.valves.DEFAULT_SIZE),
-                "n": 1,
-                "response_format": "b64_json"
-            }
-
-            # Call LiteLLM images endpoint
-            response = requests.post(
-                f"{self.valves.LITELLM_API_BASE}/images/generations",
-                json=image_request,
-                headers={
-                    "Content-Type": "application/json",
-                    "Authorization": f"Bearer {self.valves.LITELLM_API_KEY}"
-                },
-                timeout=self.valves.TIMEOUT
-            )
-
-            if response.status_code != 200:
-                yield self._error_response(
-                    f"Image generation failed: {response.status_code} - {response.text}"
-                )
-                return
-
-            # Parse response
-            result = response.json()
-
-            # Check if we got image data
-            if "data" not in result or len(result["data"]) == 0:
-                yield self._error_response("No image data in response")
-                return
-
-            # Get base64 image data
-            image_data = result["data"][0].get("b64_json")
-            if not image_data:
-                yield self._error_response("No base64 image data in response")
-                return
-
-            # Return image as markdown
-            image_markdown = f"![Generated Image](data:image/png;base64,{image_data})\n\n**Prompt:** {prompt}"
-
-            # Yield final response
-            yield json.dumps({
-                "choices": [{
-                    "index": 0,
-                    "message": {
-                        "role": "assistant",
-                        "content": image_markdown
-                    },
-                    "finish_reason": "stop"
-                }]
-            })
-
-        except requests.Timeout:
-            yield self._error_response(f"Request timed out after {self.valves.TIMEOUT}s")
-        except requests.RequestException as e:
-            yield self._error_response(f"Request failed: {str(e)}")
-        except Exception as e:
-            yield self._error_response(f"Unexpected error: {str(e)}")
-
-    def _error_response(self, error_message: str) -> str:
-        """Generate error response in OpenAI format"""
-        return json.dumps({
-            "choices": [{
-                "index": 0,
-                "message": {
-                    "role": "assistant",
-                    "content": f"Error: {error_message}"
-                },
-                "finish_reason": "stop"
-            }]
-        })
diff --git a/ai/supervisor-nginx.conf b/ai/supervisor-nginx.conf
new file mode 100644
index 0000000..4e00b6e
--- /dev/null
+++ b/ai/supervisor-nginx.conf
@@ -0,0 +1,38 @@
+events {
+    worker_connections 1024;
+}
+
+http {
+    # Proxy settings
+    proxy_http_version 1.1;
+    proxy_buffering off;
+    proxy_set_header Host $host;
+    proxy_set_header X-Real-IP $remote_addr;
+    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
+    proxy_set_header X-Forwarded-Proto $scheme;
+
+    # Timeouts for Supervisor web UI (quick responses)
+    proxy_connect_timeout 60;
+    proxy_send_timeout 60;
+    proxy_read_timeout 60;
+    send_timeout 60;
+
+    server {
+        listen 80;
+        server_name _;
+
+        location / {
+            # Proxy to Supervisor on RunPod via Tailscale
+            proxy_pass http://${SUPERVISOR_BACKEND_HOST}:${SUPERVISOR_BACKEND_PORT};
+
+            # Proxy headers
+            proxy_set_header Host $host;
+            proxy_set_header X-Real-IP $remote_addr;
+            proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
+            proxy_set_header X-Forwarded-Proto $scheme;
+
+            # Disable buffering for real-time updates
+            proxy_buffering off;
+        }
+    }
+}
diff --git a/net/authelia/configuration.yml b/net/authelia/configuration.yml
index 42db7ab..7e3c031 100644
--- a/net/authelia/configuration.yml
+++ b/net/authelia/configuration.yml
@@ -75,6 +75,7 @@ access_control:
         - "facefusion.ai.pivoine.art"
         - "pinchflat.media.pivoine.art"
         - "comfy.ai.pivoine.art"
+        - "supervisor.ai.pivoine.art"
       policy: one_factor