feat: add Supervisor proxy with Authelia SSO

Add nginx reverse proxy service for Supervisor web UI at supervisor.ai.pivoine.art with Authelia authentication. Proxies to RunPod GPU instance via Tailscale (100.121.199.88:9001). Changes: - Create supervisor-nginx.conf for nginx proxy configuration - Add supervisor service to docker-compose with Traefik labels - Add supervisor.ai.pivoine.art to Authelia protected domains - Remove deprecated Flux-related files 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-22 13:19:02 +01:00
parent bf402adb25
commit 0fd2eacad1
5 changed files with 72 additions and 339 deletions
--- a/ai/FLUX_SETUP.md
+++ b/ai/FLUX_SETUP.md
@@ -1,181 +0,0 @@
 # Flux Image Generation Setup for Open WebUI
 This guide explains how to add Flux.1 Schnell image generation to your Open WebUI installation.
 ## Architecture
 ```
 Open WebUI → flux_image_gen.py Function → LiteLLM (port 4000) → Orchestrator (RunPod port 9000) → Flux Model
 ```
 ## Installation
 ### Automatic (via Docker Compose)
 The Flux function is **automatically loaded** via Docker volume mount. No manual upload needed!
 **How it works:**
 - Function file: `ai/functions/flux_image_gen.py`
 - Mounted to: `/app/backend/data/functions/` in the container (read-only)
 - Open WebUI automatically discovers and loads functions from this directory on startup
 **To deploy:**
 ```bash
 cd ~/Projects/docker-compose
 pnpm arty up -d ai_webui  # Restart Open WebUI to load function
 ```
 ### Verify Installation
 After restarting Open WebUI, the function should automatically appear in:
 1. **Admin Settings → Functions**: Listed as "Flux Image Generator"
 2. **Model dropdown**: "Flux.1 Schnell (4-5s)" available for selection
 If you don't see it:
 ```bash
 # Check if function is mounted correctly
 docker exec ai_webui ls -la /app/backend/data/functions/
 # Check logs for any loading errors
 docker logs ai_webui | grep -i flux
 ```
 ## Usage
 ### Basic Image Generation
 1. **Select the Flux model:**
   - In Open WebUI chat, select "Flux.1 Schnell (4-5s)" from the model dropdown
 2. **Send your prompt:**
   ```
   A serene mountain landscape at sunset with vibrant colors
   ```
 3. **Wait for generation:**
   - The function will call LiteLLM → Orchestrator → RunPod Flux
   - Image appears in 4-5 seconds
 ### Advanced Options
 The function supports custom sizes (configure in Valves):
 - `1024x1024` (default, square)
 - `1024x768` (landscape)
 - `768x1024` (portrait)
 ## Configuration
 ### Valves (Customization)
 To customize function behavior:
 1. **Access Open WebUI**:
   - Go to https://ai.pivoine.art
   - Profile → Settings → Admin Settings → Functions
 2. **Find Flux Image Generator**:
   - Click on "Flux Image Generator" in the functions list
   - Go to "Valves" tab
 3. **Available Settings:**
   - `LITELLM_API_BASE`: LiteLLM endpoint (default: `http://litellm:4000/v1`)
   - `LITELLM_API_KEY`: API key (default: `dummy` - not needed for internal use)
   - `DEFAULT_MODEL`: Model name (default: `flux-schnell`)
   - `DEFAULT_SIZE`: Image dimensions (default: `1024x1024`)
   - `TIMEOUT`: Request timeout in seconds (default: `120`)
 ## Troubleshooting
 ### Function not appearing in model list
 **Check:**
 1. Function is enabled in Admin Settings → Functions
 2. Function has no syntax errors (check logs)
 3. Refresh browser cache (Ctrl+Shift+R)
 ### Image generation fails
 **Check:**
 1. LiteLLM is running: `docker ps | grep litellm`
 2. LiteLLM can reach orchestrator: Check `docker logs ai_litellm`
 3. Orchestrator is running on RunPod
 4. Flux model is loaded: Check orchestrator logs
 **Test LiteLLM directly:**
 ```bash
 curl -X POST http://localhost:4000/v1/images/generations \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "flux-schnell",
    "prompt": "A test image",
    "size": "1024x1024"
  }'
 ```
 ### Timeout errors
 The default timeout is 120 seconds. If you're getting timeouts:
 1. **Increase timeout in Valves:**
   - Set `TIMEOUT` to `180` or higher
 2. **Check Orchestrator status:**
   - Flux model may still be loading (takes ~1 minute on first request)
 ## Technical Details
 ### How it Works
 1. **User sends prompt** in Open WebUI chat interface
 2. **Function extracts prompt** from messages array
 3. **Function calls LiteLLM** `/v1/images/generations` endpoint
 4. **LiteLLM routes to Orchestrator** via config (`http://100.121.199.88:9000/v1`)
 5. **Orchestrator loads Flux** on RunPod GPU (if not already running)
 6. **Flux generates image** in 4-5 seconds
 7. **Image returns as base64** through the chain
 8. **Function displays image** as markdown in chat
 ### Request Flow
 ```json
 // Function sends to LiteLLM:
 {
  "model": "flux-schnell",
  "prompt": "A serene mountain landscape",
  "size": "1024x1024",
  "n": 1,
  "response_format": "b64_json"
 }
 // LiteLLM response:
 {
  "data": [{
    "b64_json": "iVBORw0KGgoAAAANSUhEUgAA..."
  }]
 }
 // Function converts to markdown:
 ![Generated Image](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAA...)
 ```
 ## Limitations
 - **Single model**: Currently only Flux.1 Schnell is available
 - **Sequential generation**: One image at a time (n=1)
 - **Fixed format**: PNG format only
 - **Orchestrator dependency**: Requires RunPod GPU server to be running
 ## Future Enhancements
 Potential improvements:
 - Multiple size presets in model dropdown
 - Support for other Flux variants (Dev, Pro)
 - Batch generation (n > 1)
 - Image-to-image support
 - Custom aspect ratios
 ## Support
 - **Documentation**: `/home/valknar/Projects/docker-compose/CLAUDE.md`
 - **RunPod README**: `/home/valknar/Projects/runpod/README.md`
 - **LiteLLM Config**: `/home/valknar/Projects/docker-compose/ai/litellm-config.yaml`
--- a/ai/compose.yaml
+++ b/ai/compose.yaml
@@ -232,6 +232,39 @@ services:
      # Watchtower
      - 'com.centurylinklabs.watchtower.enable=${WATCHTOWER_LABEL_ENABLE}'
  # Supervisor - Process manager web UI (proxies to RunPod GPU)
  supervisor:
    image: nginx:alpine
    container_name: ${AI_COMPOSE_PROJECT_NAME}_supervisor
    restart: unless-stopped
    environment:
      TZ: ${TIMEZONE:-Europe/Berlin}
      SUPERVISOR_BACKEND_HOST: ${SUPERVISOR_BACKEND_HOST:-100.121.199.88}
      SUPERVISOR_BACKEND_PORT: ${SUPERVISOR_BACKEND_PORT:-9001}
    volumes:
      - ./supervisor-nginx.conf:/etc/nginx/nginx.conf.template:ro
    command: /bin/sh -c "envsubst '$${SUPERVISOR_BACKEND_HOST},$${SUPERVISOR_BACKEND_PORT}' < /etc/nginx/nginx.conf.template > /etc/nginx/nginx.conf && exec nginx -g 'daemon off;'"
    networks:
      - compose_network
    labels:
      - 'traefik.enable=${AI_SUPERVISOR_TRAEFIK_ENABLED:-true}'
      # HTTP to HTTPS redirect
      - 'traefik.http.middlewares.${AI_COMPOSE_PROJECT_NAME}-supervisor-redirect-web-secure.redirectscheme.scheme=https'
      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web.middlewares=${AI_COMPOSE_PROJECT_NAME}-supervisor-redirect-web-secure'
      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web.rule=Host(`${AI_SUPERVISOR_TRAEFIK_HOST:-supervisor.ai.pivoine.art}`)'
      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web.entrypoints=web'
      # HTTPS router with Authelia SSO
      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure.rule=Host(`${AI_SUPERVISOR_TRAEFIK_HOST:-supervisor.ai.pivoine.art}`)'
      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure.tls.certresolver=resolver'
      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure.entrypoints=web-secure'
      - 'traefik.http.middlewares.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure-compress.compress=true'
      - 'traefik.http.routers.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure.middlewares=${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure-compress,net-authelia,security-headers@file'
      # Service
      - 'traefik.http.services.${AI_COMPOSE_PROJECT_NAME}-supervisor-web-secure.loadbalancer.server.port=80'
      - 'traefik.docker.network=${NETWORK_NAME}'
      # Watchtower
      - 'com.centurylinklabs.watchtower.enable=${WATCHTOWER_LABEL_ENABLE}'
 volumes:
  ai_postgres_data:
    name: ${AI_COMPOSE_PROJECT_NAME}_postgres_data
--- a/ai/functions/flux_image_gen.py
+++ b/ai/functions/flux_image_gen.py
@@ -1,158 +0,0 @@
 """
 title: Flux Image Generator
 author: Valknar
 version: 1.0.0
 license: MIT
 description: Generate images using Flux.1 Schnell via LiteLLM
 requirements: requests, pydantic
 """
 import os
 import base64
 import json
 import requests
 from typing import Generator
 from pydantic import BaseModel, Field
 class Pipe:
    """
    Flux Image Generation Function for Open WebUI
    Routes image generation requests to LiteLLM → Orchestrator → RunPod Flux
    """
    class Valves(BaseModel):
        """Configuration valves for the image generation function"""
        LITELLM_API_BASE: str = Field(
            default="http://litellm:4000/v1",
            description="LiteLLM API base URL"
        )
        LITELLM_API_KEY: str = Field(
            default="dummy",
            description="LiteLLM API key (not required for internal use)"
        )
        DEFAULT_MODEL: str = Field(
            default="flux-schnell",
            description="Default model to use for image generation"
        )
        DEFAULT_SIZE: str = Field(
            default="1024x1024",
            description="Default image size"
        )
        TIMEOUT: int = Field(
            default=120,
            description="Request timeout in seconds"
        )
    def __init__(self):
        self.type = "manifold"
        self.id = "flux_image_gen"
        self.name = "Flux"
        self.valves = self.Valves()
    def pipes(self):
        """Return available models"""
        return [
            {
                "id": "flux-schnell",
                "name": "Flux.1 Schnell (4-5s)"
            }
        ]
    def pipe(self, body: dict) -> Generator[str, None, None]:
        """
        Generate images via LiteLLM endpoint
        Args:
            body: Request body containing model, messages, etc.
        Yields:
            JSON chunks with generated image data
        """
        try:
            # Extract the prompt from messages
            messages = body.get("messages", [])
            if not messages:
                yield self._error_response("No messages provided")
                return
            # Get the last user message as prompt
            prompt = messages[-1].get("content", "")
            if not prompt:
                yield self._error_response("No prompt provided")
                return
            # Prepare image generation request
            image_request = {
                "model": body.get("model", self.valves.DEFAULT_MODEL),
                "prompt": prompt,
                "size": body.get("size", self.valves.DEFAULT_SIZE),
                "n": 1,
                "response_format": "b64_json"
            }
            # Call LiteLLM images endpoint
            response = requests.post(
                f"{self.valves.LITELLM_API_BASE}/images/generations",
                json=image_request,
                headers={
                    "Content-Type": "application/json",
                    "Authorization": f"Bearer {self.valves.LITELLM_API_KEY}"
                },
                timeout=self.valves.TIMEOUT
            )
            if response.status_code != 200:
                yield self._error_response(
                    f"Image generation failed: {response.status_code} - {response.text}"
                )
                return
            # Parse response
            result = response.json()
            # Check if we got image data
            if "data" not in result or len(result["data"]) == 0:
                yield self._error_response("No image data in response")
                return
            # Get base64 image data
            image_data = result["data"][0].get("b64_json")
            if not image_data:
                yield self._error_response("No base64 image data in response")
                return
            # Return image as markdown
            image_markdown = f"![Generated Image](data:image/png;base64,{image_data})\n\n**Prompt:** {prompt}"
            # Yield final response
            yield json.dumps({
                "choices": [{
                    "index": 0,
                    "message": {
                        "role": "assistant",
                        "content": image_markdown
                    },
                    "finish_reason": "stop"
                }]
            })
        except requests.Timeout:
            yield self._error_response(f"Request timed out after {self.valves.TIMEOUT}s")
        except requests.RequestException as e:
            yield self._error_response(f"Request failed: {str(e)}")
        except Exception as e:
            yield self._error_response(f"Unexpected error: {str(e)}")
    def _error_response(self, error_message: str) -> str:
        """Generate error response in OpenAI format"""
        return json.dumps({
            "choices": [{
                "index": 0,
                "message": {
                    "role": "assistant",
                    "content": f"Error: {error_message}"
                },
                "finish_reason": "stop"
            }]
        })
--- a/ai/supervisor-nginx.conf
+++ b/ai/supervisor-nginx.conf
@@ -0,0 +1,38 @@
 events {
    worker_connections 1024;
 }
 http {
    # Proxy settings
    proxy_http_version 1.1;
    proxy_buffering off;
    proxy_set_header Host $host;
    proxy_set_header X-Real-IP $remote_addr;
    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
    proxy_set_header X-Forwarded-Proto $scheme;
    # Timeouts for Supervisor web UI (quick responses)
    proxy_connect_timeout 60;
    proxy_send_timeout 60;
    proxy_read_timeout 60;
    send_timeout 60;
    server {
        listen 80;
        server_name _;
        location / {
            # Proxy to Supervisor on RunPod via Tailscale
            proxy_pass http://${SUPERVISOR_BACKEND_HOST}:${SUPERVISOR_BACKEND_PORT};
            # Proxy headers
            proxy_set_header Host $host;
            proxy_set_header X-Real-IP $remote_addr;
            proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
            proxy_set_header X-Forwarded-Proto $scheme;
            # Disable buffering for real-time updates
            proxy_buffering off;
        }
    }
 }
--- a/net/authelia/configuration.yml
+++ b/net/authelia/configuration.yml
@@ -75,6 +75,7 @@ access_control:
        - "facefusion.ai.pivoine.art"
        - "pinchflat.media.pivoine.art"
        - "comfy.ai.pivoine.art"
        - "supervisor.ai.pivoine.art"
      policy: one_factor