docker-compose

Author	SHA1	Message	Date
Sebastian Krüger	a80c6b931b	fix: update compose.yaml to use new GPU_VLLM URLs	2025-11-23 16:22:54 +01:00
Sebastian Krüger	779e76974d	fix: use complete URL env var for vLLM API base - Replace GPU_TAILSCALE_IP interpolation with GPU_VLLM_API_URL - LiteLLM requires full URL in api_base with os.environ/ syntax 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 13:17:37 +01:00
Sebastian Krüger	f3f32c163f	feat: consolidate GPU IP with single GPU_TAILSCALE_IP variable - Replace COMFYUI_BACKEND_HOST and SUPERVISOR_BACKEND_HOST with GPU_TAILSCALE_IP - Update LiteLLM config to use os.environ/GPU_TAILSCALE_IP for vLLM models - Add GPU_TAILSCALE_IP env var to LiteLLM service - Configure qwen-2.5-7b and llama-3.1-8b to route through orchestrator 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 13:05:33 +01:00
Sebastian Krüger	e00e959543	Update backend IPs for ComfyUI and Supervisor proxies - Remove hardcoded default values from compose.yaml - Backend IPs now managed via environment variables only 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-23 02:11:19 +01:00
Sebastian Krüger	0fd2eacad1	feat: add Supervisor proxy with Authelia SSO Add nginx reverse proxy service for Supervisor web UI at supervisor.ai.pivoine.art with Authelia authentication. Proxies to RunPod GPU instance via Tailscale (100.121.199.88:9001). Changes: - Create supervisor-nginx.conf for nginx proxy configuration - Add supervisor service to docker-compose with Traefik labels - Add supervisor.ai.pivoine.art to Authelia protected domains - Remove deprecated Flux-related files 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-22 13:19:02 +01:00
Sebastian Krüger	ae1c349b55	feat: make ComfyUI backend IP/port configurable via environment variables - Replace hardcoded IP in comfyui-nginx.conf with env vars - Add COMFYUI_BACKEND_HOST and COMFYUI_BACKEND_PORT to compose.yaml - Use envsubst to substitute variables at container startup - Defaults: 100.121.199.88:8188 (current RunPod Tailscale IP)	2025-11-21 21:24:51 +01:00
Sebastian Krüger	904f7d3c2e	feat(ai): add ComfyUI proxy service with Authelia SSO - Add ComfyUI service to AI stack using nginx:alpine as reverse proxy - Proxy to RunPod ComfyUI via Tailscale (100.121.199.88:8188) - Configure Traefik routing for comfy.ai.pivoine.art - Enable Authelia SSO middleware (net-authelia) - Support WebSocket connections for real-time updates - Set appropriate timeouts for image generation (300s) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-21 20:56:20 +01:00
Sebastian Krüger	9a964cff3c	feat: add Flux image generation function for Open WebUI - Add flux_image_gen.py manifold function for Flux.1 Schnell - Auto-mount functions via Docker volume (./functions:/app/backend/data/functions:ro) - Add comprehensive setup guide in FLUX_SETUP.md - Update CLAUDE.md with Flux integration documentation - Infrastructure as code approach - no manual import needed 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-21 20:20:33 +01:00
Sebastian Krüger	155016da97	debug: enable DEBUG logging for LiteLLM to troubleshoot streaming	2025-11-21 19:10:00 +01:00
Sebastian Krüger	01a345979b	fix: disable drop_params to preserve streaming metadata in LiteLLM - Set drop_params: false in litellm_settings - Set modify_params: false in litellm_settings - Set drop_params: false in default_litellm_params - Commented out LITELLM_DROP_PARAMS env var - Removed --drop_params command flag These settings were stripping critical streaming parameters causing vLLM streaming responses to collapse into empty deltas	2025-11-21 18:46:33 +01:00
Sebastian Krüger	c58b5d36ba	revert: remove direct WebUI connection, focus on fixing LiteLLM streaming - Reverted direct orchestrator connection to WebUI - Added stream: true parameter to qwen-2.5-7b model config - Keep LiteLLM as single proxy for all models	2025-11-21 18:42:46 +01:00
Sebastian Krüger	62fcf832da	feat: add direct RunPod orchestrator connection to WebUI for streaming bypass - Configure WebUI with both LiteLLM and direct orchestrator API base URLs - This bypasses LiteLLM's streaming issues for the qwen-2.5-7b model - WebUI will now show models from both endpoints - Allows testing if LiteLLM is the bottleneck for streaming Related to streaming fix in RunPod models/vllm/server.py	2025-11-21 18:38:31 +01:00
Sebastian Krüger	103bbbad51	debug: enable INFO logging in LiteLLM for troubleshooting Enable detailed logging to debug qwen model requests from WebUI. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-21 17:13:38 +01:00
Sebastian Krüger	6aea9d018e	feat(ai): disable Ollama API in WebUI, use LiteLLM only	2025-11-21 16:57:20 +01:00
Sebastian Krüger	8a18ae753d	perf: optimize LiteLLM for better performance Reduce database logging overhead and enable prompt caching: - Disabled verbose logging (set_verbose: false) - Disabled spend tracking logs to reduce DB writes - Disabled tag tracking and daily spend logs - Removed success/failure callbacks - Enabled prompt caching for claude-sonnet-4.5 - Set log level to ERROR only - Removed --detailed_debug flag from command This should significantly improve response times by eliminating unnecessary database writes for every request. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-16 16:03:19 +01:00
Sebastian Krüger	ffbcecc09d	feat: replace Basic Auth with Authelia Replace HTTP Basic Auth with Authelia ForwardAuth for consistent authentication across infrastructure: - Asciinema Admin (admin.asciinema.dev.pivoine.art): Removed Basic Auth, added Authelia protection - FaceFusion (facefusion.ai.pivoine.art): Removed Basic Auth, added Authelia protection Updated Authelia access control to include both services with one_factor policy. All services now use Authelia for authentication, eliminating the need to manage separate Basic Auth credentials. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 21:54:27 +01:00
Sebastian Krüger	51267cc674	feat: add Mailpit SMTP relay and migrate all services - Add Mailpit service to NET stack with web UI at mailpit.pivoine.art - Configure Mailpit to relay all emails through IONOS SMTP - Migrate all 11+ services to use Mailpit instead of direct IONOS SMTP: * SEXY: Directus API * UTIL: Joplin, Mattermost, Vaultwarden, Tandoor, Linkwarden * DEV: Gitea, n8n, Asciinema * AI: Open WebUI * NET: Netdata (via msmtp) - Centralize SMTP credentials in mailpit-relay.yaml - Simplify service configs (no auth/TLS for internal SMTP) - Enable email monitoring via Mailpit web UI with Basic Auth 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-15 18:34:38 +01:00
Sebastian Krüger	709dcd8882	fix: use correct NO_DOCS and NO_REDOC environment variables - Replace DISABLE_SWAGGER_UI with NO_DOCS and NO_REDOC - Following official LiteLLM documentation for disabling API docs - Disables both Swagger UI and Redoc documentation interfaces 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-14 02:17:40 +01:00
Sebastian Krüger	b66e28d874	fix: use DISABLE_SWAGGER_UI environment variable instead of invalid flag - Remove invalid --disable_swagger command flag - Add DISABLE_SWAGGER_UI=true environment variable - Fixes LiteLLM startup error 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-14 02:15:31 +01:00
Sebastian Krüger	f1ff42f452	feat: disable Swagger UI in LiteLLM proxy - Add --disable_swagger flag to LiteLLM command - Improves security by hiding API documentation interface 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-14 02:14:43 +01:00
Sebastian Krüger	2934caa9ed	fix: disable Watchtower for Facefusion custom local image Watchtower was trying to pull updates from Docker Hub for facefusion-patched:3.5.0-cpu which only exists locally, causing spam errors. Disabled Watchtower monitoring for this container since it's a custom-built image with NSFW filter patches. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 08:30:51 +01:00
Sebastian Krüger	f71b150263	feat: add tty flag for Gradio to start properly	2025-11-13 06:18:58 +01:00
Sebastian Krüger	95099a443e	feat: build custom Facefusion image with NSFW filter patch baked in	2025-11-13 06:05:42 +01:00
Sebastian Krüger	8f406f62c1	fix: add command with -u flag to start Facefusion	2025-11-13 06:01:09 +01:00
Sebastian Krüger	c2d25dde59	fix: remove entrypoint override to use default Facefusion startup	2025-11-13 05:59:05 +01:00
Sebastian Krüger	3c56f05286	fix: add Gradio environment variables and remove conflicting command	2025-11-13 05:52:13 +01:00
Sebastian Krüger	59f2e8b0fc	refactor: use source code patch instead of deleting NSFW models Cleaner solution based on Reddit community feedback: - Patch content_analyser.py to return False (always safe) - Remove unused config file - Remove config volume mount from compose - Much simpler and more reliable than file deletion approach Credit: https://www.reddit.com/r/StableDiffusion/comments/1m2w5af/ 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 04:23:38 +00:00
Sebastian Krüger	5768fe65ff	feat: disable NSFW filter in Facefusion - Add entrypoint script to continuously delete NSFW model files - Add Facefusion config file (for future use) - NSFW content filtering now disabled 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-13 03:52:21 +00:00
Sebastian Krüger	c30d2d7407	chore: facefusion	2025-11-12 16:42:41 +01:00
Sebastian Krüger	8445256b0f	chore: facefusion	2025-11-12 16:38:57 +01:00
Sebastian Krüger	9f9119358a	fix: add Python unbuffered flag to see Gradio startup logs	2025-11-12 11:01:23 +01:00
Sebastian Krüger	b7f03a313f	fix: use port 7865 for both Gradio and Traefik	2025-11-12 10:56:30 +01:00
Sebastian Krüger	08cce3479f	fix: add command back with python3 and default port 7860	2025-11-12 10:51:35 +01:00
Sebastian Krüger	22eaaa9b30	fix: remove custom command and use default Gradio port 7860 for Facefusion	2025-11-12 10:50:11 +01:00
Sebastian Krüger	8ac025a14c	fix: add command to start Facefusion web UI	2025-11-12 09:42:31 +01:00
Sebastian Krüger	8b77f92028	feat: integrate Facefusion into AI stack Added Facefusion face swapping service to the AI stack: Configuration: - URL: https://facefusion.ai.pivoine.art - Image: facefusion/facefusion:3.5.0-cpu - Port: 7865 - Container: ai_facefusion - Volume: ai_facefusion_data - HTTP Basic Auth protection - CPU execution mode (GPU when available) Changes: - Added facefusion service to ai/compose.yaml - Added AI_FACEFUSION_* env vars to arty.yml - Created ai_facefusion_data volume - Removed old standalone facefusion stack - Removed ai/README-export.md and ai/webui-export.py Facefusion will run on CPU until GPU server is available. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-12 09:36:52 +01:00
Sebastian Krüger	da0dc2363a	fix: disable prompt caching and responses API in litellm - Add LITELLM_DROP_PARAMS environment variable - Disable cache in litellm_settings - Attempt to disable responses API endpoint - Remove invalid supports_prompt_caching parameter 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 12:27:06 +01:00
Sebastian Krüger	db69b30d06	feat: add PostgreSQL initialization script for AI stack Created database initialization script following the core stack pattern. The script automatically creates required databases on first initialization: - openwebui: Open WebUI application database - litellm: LiteLLM proxy database for API key management and tracking Changes: - Created ai/postgres/init/01-init-databases.sh - Mounted init directory in ai_postgres service - Added automatic privilege grants to AI_DB_USER Note: Init script only runs on first database creation when volume is empty. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:36:50 +01:00
Sebastian Krüger	5a6b007cf3	feat: connect LiteLLM to AI PostgreSQL database LiteLLM now uses the ai_postgres database instance with a dedicated 'litellm' database for API key management, usage tracking, and rate limiting. Changes: - Set DATABASE_URL to postgresql://ai:password@ai_postgres:5432/litellm - Added depends_on ai_postgres to ensure DB starts first 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:34:10 +01:00
Sebastian Krüger	b6cb155da8	fix: remove HTTP Basic Auth from LiteLLM proxy Removed authentication middleware to simplify access. LiteLLM now relies solely on Bearer token authentication via LITELLM_MASTER_KEY. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:30:57 +01:00
Sebastian Krüger	87654f5ae8	feat: enable LiteLLM API key authentication Re-enabled LITELLM_MASTER_KEY for proper API key authentication. LiteLLM supports master key without database for simple auth scenarios. - LiteLLM validates Bearer token against master key - Open WebUI uses same key for internal communication - External access requires both HTTP Basic Auth + API key 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:25:57 +01:00
Sebastian Krüger	7ea4b3ab57	fix: remove LiteLLM MASTER_KEY requirement Removed LITELLM_MASTER_KEY as it requires a database for virtual key management. Security is already provided by HTTP Basic Auth on the public Traefik endpoint. Internal Open WebUI communication doesn't need additional API key auth. Security layers: - Public access: HTTP Basic Auth via Traefik - Internal LiteLLM: Network isolation (no auth needed) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:21:13 +01:00
Sebastian Krüger	2055cbb675	feat: secure LiteLLM API key with environment variable - Added AI_LITELLM_API_KEY environment variable to .env - Configured LiteLLM MASTER_KEY for authentication - Updated Open WebUI to use secure API key from environment - Generated secure 64-character hex key: sk-77b42236... This replaces the insecure hardcoded sk-1234 key with proper secret management via environment variables. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:19:26 +01:00
Sebastian Krüger	16dd8064d4	fix: disable LiteLLM healthcheck due to missing curl Healthcheck was failing because curl is not installed in the LiteLLM container, causing Traefik to mark it as unhealthy and not route traffic. Disabled healthcheck as Traefik doesn't require it for routing. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:13:26 +01:00
Sebastian Krüger	c86faf1898	fix: bind LiteLLM to 0.0.0.0 for Traefik accessibility LiteLLM was binding to localhost by default, making it unreachable from Traefik reverse proxy. Added --host 0.0.0.0 parameter to allow connections from the Docker network. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:10:15 +01:00
Sebastian Krüger	eb4a025c20	feat: add HTTP Basic Auth to LiteLLM for enhanced security Added Traefik Basic Auth middleware to LiteLLM public endpoint for two-layer security: 1. HTTP Basic Auth (Traefik level) 2. API Key authentication (LiteLLM level) Changes: - Added basicauth middleware using AUTH_USERS credentials - Chained auth middleware before compression and security headers - Prevents unauthorized access to public LiteLLM endpoint Usage with Codex: export OPENAI_BASE_URL=https://username:password@llm.ai.pivoine.art export OPENAI_API_KEY=sk-1234 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:04:09 +01:00
Sebastian Krüger	1d69107ebb	feat: expose LiteLLM publicly for Codex CLI integration Added Traefik configuration to make LiteLLM accessible at llm.ai.pivoine.art for use with @openai/codex CLI tool. Changes: - Added AI_LITELLM_TRAEFIK_HOST to arty.yml (llm.ai.pivoine.art) - Updated ai/compose.yaml litellm service with full Traefik labels - HTTP to HTTPS redirect - SSL termination via Let's Encrypt - Compression and security headers This allows external tools like Codex to use Claude models via OpenAI-compatible API endpoint. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 17:56:34 +01:00
Sebastian Krüger	f0e99d2776	revert: remove SFTP integration from AI stack Removed custom Dockerfile and SFTP function integration in favor of the simpler REST API approach (webui-export.py). Changes: - Restored webui service to use official Open WebUI image - Removed custom Dockerfile.webui (paramiko build) - Removed ai/functions/save_to_disk.py SFTP function - Removed SSH key and functions volume mounts The REST API export script (webui-export.py) is a simpler and more flexible solution that doesn't require Docker modifications. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 23:15:27 +01:00
Sebastian Krüger	5818644c1a	feat: add SFTP integration for saving code to local disk Added custom Open WebUI function for SSH/SFTP file operations: New Function: save_to_disk.py - save_file(): Write generated code to local filesystem via SFTP - read_file(): Read files from local disk - list_files(): List directory contents - Configurable via Valves (host, port, username, paths) Custom Dockerfile (Dockerfile.webui) - Based on ghcr.io/open-webui/open-webui:main - Installs paramiko library for SSH/SFTP support - Creates .ssh directory for key storage Configuration Updates - Mount SSH private key from host (/root/.ssh/id_rsa) - Mount functions directory for custom tools - Build custom image with SFTP capabilities Usage in Open WebUI Claude can now use these tools to: - Generate code and save it directly to your local disk - Read existing files for context - List project directories - Create new files in any project Default base path: /home/valknar/Projects Authentication: SSH key-based (passwordless) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 23:07:11 +01:00
Sebastian Krüger	424e6d044d	fix: configure LiteLLM without database requirement	2025-11-08 23:02:07 +01:00

1 2

54 Commits