docker-compose

Author	SHA1	Message	Date
Sebastian Krüger	22eaaa9b30	fix: remove custom command and use default Gradio port 7860 for Facefusion	2025-11-12 10:50:11 +01:00
Sebastian Krüger	8ac025a14c	fix: add command to start Facefusion web UI	2025-11-12 09:42:31 +01:00
Sebastian Krüger	8b77f92028	feat: integrate Facefusion into AI stack Added Facefusion face swapping service to the AI stack: Configuration: - URL: https://facefusion.ai.pivoine.art - Image: facefusion/facefusion:3.5.0-cpu - Port: 7865 - Container: ai_facefusion - Volume: ai_facefusion_data - HTTP Basic Auth protection - CPU execution mode (GPU when available) Changes: - Added facefusion service to ai/compose.yaml - Added AI_FACEFUSION_* env vars to arty.yml - Created ai_facefusion_data volume - Removed old standalone facefusion stack - Removed ai/README-export.md and ai/webui-export.py Facefusion will run on CPU until GPU server is available. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-12 09:36:52 +01:00
Sebastian Krüger	3ddc76e213	fix: add additional_drop_params at global litellm_settings level	2025-11-11 12:36:49 +01:00
Sebastian Krüger	cabac4b767	fix: use additional_drop_params to explicitly drop prompt_cache_key According to litellm docs, drop_params only drops OpenAI parameters. Since prompt_cache_key is an Anthropic-specific parameter, we need to use additional_drop_params to explicitly drop it. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 12:33:10 +01:00
Sebastian Krüger	da0dc2363a	fix: disable prompt caching and responses API in litellm - Add LITELLM_DROP_PARAMS environment variable - Disable cache in litellm_settings - Attempt to disable responses API endpoint - Remove invalid supports_prompt_caching parameter 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 12:27:06 +01:00
Sebastian Krüger	813823995c	fix: disable prompt caching for claude-sonnet-4.5 Explicitly set drop_params and supports_prompt_caching=false for claude-sonnet-4.5 model to prevent prompt_cache_key parameter from being sent to Anthropic API. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 12:22:27 +01:00
Sebastian Krüger	f36e0fa9eb	fix: enhance litellm parameter dropping for codex compatibility Add router_settings and default_litellm_params to ensure unsupported parameters like prompt_cache_key are properly dropped when using codex with the litellm proxy. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-11 12:14:00 +01:00
Sebastian Krüger	ce6c60d8e0	fix: disable responses ID security for Codex CLI compatibility Added disable_responses_id_security setting to allow Codex CLI to access the /responses endpoint without 401 errors. This removes the encryption requirement on response IDs while maintaining API key authentication. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 19:00:55 +01:00
Sebastian Krüger	db69b30d06	feat: add PostgreSQL initialization script for AI stack Created database initialization script following the core stack pattern. The script automatically creates required databases on first initialization: - openwebui: Open WebUI application database - litellm: LiteLLM proxy database for API key management and tracking Changes: - Created ai/postgres/init/01-init-databases.sh - Mounted init directory in ai_postgres service - Added automatic privilege grants to AI_DB_USER Note: Init script only runs on first database creation when volume is empty. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:36:50 +01:00
Sebastian Krüger	5a6b007cf3	feat: connect LiteLLM to AI PostgreSQL database LiteLLM now uses the ai_postgres database instance with a dedicated 'litellm' database for API key management, usage tracking, and rate limiting. Changes: - Set DATABASE_URL to postgresql://ai:password@ai_postgres:5432/litellm - Added depends_on ai_postgres to ensure DB starts first 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:34:10 +01:00
Sebastian Krüger	b6cb155da8	fix: remove HTTP Basic Auth from LiteLLM proxy Removed authentication middleware to simplify access. LiteLLM now relies solely on Bearer token authentication via LITELLM_MASTER_KEY. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:30:57 +01:00
Sebastian Krüger	87654f5ae8	feat: enable LiteLLM API key authentication Re-enabled LITELLM_MASTER_KEY for proper API key authentication. LiteLLM supports master key without database for simple auth scenarios. - LiteLLM validates Bearer token against master key - Open WebUI uses same key for internal communication - External access requires both HTTP Basic Auth + API key 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:25:57 +01:00
Sebastian Krüger	7ea4b3ab57	fix: remove LiteLLM MASTER_KEY requirement Removed LITELLM_MASTER_KEY as it requires a database for virtual key management. Security is already provided by HTTP Basic Auth on the public Traefik endpoint. Internal Open WebUI communication doesn't need additional API key auth. Security layers: - Public access: HTTP Basic Auth via Traefik - Internal LiteLLM: Network isolation (no auth needed) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:21:13 +01:00
Sebastian Krüger	2055cbb675	feat: secure LiteLLM API key with environment variable - Added AI_LITELLM_API_KEY environment variable to .env - Configured LiteLLM MASTER_KEY for authentication - Updated Open WebUI to use secure API key from environment - Generated secure 64-character hex key: sk-77b42236... This replaces the insecure hardcoded sk-1234 key with proper secret management via environment variables. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:19:26 +01:00
Sebastian Krüger	16dd8064d4	fix: disable LiteLLM healthcheck due to missing curl Healthcheck was failing because curl is not installed in the LiteLLM container, causing Traefik to mark it as unhealthy and not route traffic. Disabled healthcheck as Traefik doesn't require it for routing. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:13:26 +01:00
Sebastian Krüger	c86faf1898	fix: bind LiteLLM to 0.0.0.0 for Traefik accessibility LiteLLM was binding to localhost by default, making it unreachable from Traefik reverse proxy. Added --host 0.0.0.0 parameter to allow connections from the Docker network. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:10:15 +01:00
Sebastian Krüger	eb4a025c20	feat: add HTTP Basic Auth to LiteLLM for enhanced security Added Traefik Basic Auth middleware to LiteLLM public endpoint for two-layer security: 1. HTTP Basic Auth (Traefik level) 2. API Key authentication (LiteLLM level) Changes: - Added basicauth middleware using AUTH_USERS credentials - Chained auth middleware before compression and security headers - Prevents unauthorized access to public LiteLLM endpoint Usage with Codex: export OPENAI_BASE_URL=https://username:password@llm.ai.pivoine.art export OPENAI_API_KEY=sk-1234 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 18:04:09 +01:00
Sebastian Krüger	1d69107ebb	feat: expose LiteLLM publicly for Codex CLI integration Added Traefik configuration to make LiteLLM accessible at llm.ai.pivoine.art for use with @openai/codex CLI tool. Changes: - Added AI_LITELLM_TRAEFIK_HOST to arty.yml (llm.ai.pivoine.art) - Updated ai/compose.yaml litellm service with full Traefik labels - HTTP to HTTPS redirect - SSL termination via Let's Encrypt - Compression and security headers This allows external tools like Codex to use Claude models via OpenAI-compatible API endpoint. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 17:56:34 +01:00
Sebastian Krüger	cdb8d2ef34	fix: correct LiteLLM environment variable syntax Changed API key reference from ${ANTHROPIC_API_KEY} to os.environ/ANTHROPIC_API_KEY to match LiteLLM's documented syntax. The os.environ/ prefix tells LiteLLM to use os.getenv() to retrieve the environment variable at runtime, which is the correct way to reference environment variables in LiteLLM config files. Reference: https://docs.litellm.ai/docs/proxy/deploy 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-09 00:30:07 +01:00
Sebastian Krüger	f0e99d2776	revert: remove SFTP integration from AI stack Removed custom Dockerfile and SFTP function integration in favor of the simpler REST API approach (webui-export.py). Changes: - Restored webui service to use official Open WebUI image - Removed custom Dockerfile.webui (paramiko build) - Removed ai/functions/save_to_disk.py SFTP function - Removed SSH key and functions volume mounts The REST API export script (webui-export.py) is a simpler and more flexible solution that doesn't require Docker modifications. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 23:15:27 +01:00
Sebastian Krüger	a0d5006cf5	feat: add Open WebUI code export script via REST API Added Python script to extract and save code blocks from Open WebUI chat conversations to local disk using the REST API. Features: - Export code blocks from specific chats or all chats - Automatic language detection and proper file extensions - Organizes files by chat title with metadata - No Docker modifications needed - Remote access support via SSH tunnel or public URL Usage: python3 ai/webui-export.py --all --output-dir ./exports python3 ai/webui-export.py --chat-id <id> --output-dir ./code This replaces the complex SFTP integration with a simple API-based approach that's easier to maintain and use. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 23:12:37 +01:00
Sebastian Krüger	5818644c1a	feat: add SFTP integration for saving code to local disk Added custom Open WebUI function for SSH/SFTP file operations: New Function: save_to_disk.py - save_file(): Write generated code to local filesystem via SFTP - read_file(): Read files from local disk - list_files(): List directory contents - Configurable via Valves (host, port, username, paths) Custom Dockerfile (Dockerfile.webui) - Based on ghcr.io/open-webui/open-webui:main - Installs paramiko library for SSH/SFTP support - Creates .ssh directory for key storage Configuration Updates - Mount SSH private key from host (/root/.ssh/id_rsa) - Mount functions directory for custom tools - Build custom image with SFTP capabilities Usage in Open WebUI Claude can now use these tools to: - Generate code and save it directly to your local disk - Read existing files for context - List project directories - Create new files in any project Default base path: /home/valknar/Projects Authentication: SSH key-based (passwordless) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 23:07:11 +01:00
Sebastian Krüger	424e6d044d	fix: configure LiteLLM without database requirement	2025-11-08 23:02:07 +01:00
Sebastian Krüger	8ee86b2a0d	fix: correct LiteLLM config volume mount path	2025-11-08 22:59:50 +01:00
Sebastian Krüger	8eae3c650f	feat: add LiteLLM proxy for Anthropic Claude models Added LiteLLM as an OpenAI-compatible proxy for Anthropic's API to enable Claude models in Open WebUI. New Service: litellm - Image: ghcr.io/berriai/litellm:main-latest - Internal proxy on port 4000 - Converts Anthropic API to OpenAI-compatible format - Health check with 30s intervals - Not exposed via Traefik (internal only) LiteLLM Configuration (litellm-config.yaml) - Claude Sonnet 4 (claude-sonnet-4-20250514) - Claude Sonnet 4.5 (claude-sonnet-4-5-20250929) - Claude 3.5 Sonnet (claude-3-5-sonnet-20241022) - Claude 3 Opus (claude-3-opus-20240229) - Claude 3 Haiku (claude-3-haiku-20240307) Open WebUI Configuration Updates - Changed OPENAI_API_BASE_URLS to point to LiteLLM proxy - URL: http://litellm:4000/v1 - Added litellm as dependency for webui service - Dummy API key for proxy authentication Why LiteLLM? Anthropic's API uses different endpoint structure and authentication headers compared to OpenAI. LiteLLM acts as a translation layer, allowing Open WebUI to use Claude models through its OpenAI-compatible interface. Available Models in Open WebUI - claude-sonnet-4 (latest Claude Sonnet 4) - claude-sonnet-4.5 (Claude Sonnet 4.5) - claude-3-5-sonnet - claude-3-opus - claude-3-haiku 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 22:58:09 +01:00
Sebastian Krüger	cdee0f0c42	fix: rename AI postgres service to avoid conflict with core Changed service name from 'postgres' to 'ai_postgres' to avoid naming conflict with the core PostgreSQL service in Docker Compose include. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 22:35:41 +01:00
Sebastian Krüger	0679b7d738	feat: add AI stack with Open WebUI, Crawl4AI, and pgvector Created complete AI infrastructure stack at ai.pivoine.art: New Services: - Open WebUI (ai.pivoine.art) - ChatGPT-like interface for AI models - Multi-user chat with authentication - RAG (Retrieval-Augmented Generation) support - Document upload and processing - Claude API integration via Anthropic - PostgreSQL with pgvector (dedicated AI database) - Vector similarity search for RAG - Separate from production databases - Stores embeddings and documents - Crawl4AI (internal API service) - Web scraping optimized for LLMs - Converts websites to clean Markdown - Called by n8n workflows - No public exposure (internal only) Configuration: - Added 18 AI environment variables to arty.yml - Configured email notifications via IONOS SMTP - OpenAI API compatibility for Claude integration - Traefik SSL termination and compression Backup: - Added 3 AI volumes to Restic backup - Daily backup at 3 AM - Retention: 7 daily, 4 weekly, 6 monthly, 2 yearly Integration: - Shares falcon_network with existing services - Ready for n8n workflow automation - Mattermost notifications support - Watchtower auto-updates enabled Ready for Phase 2: GPU server integration with Ollama, Whisper, and Stable Diffusion when IONOS A10 server is provisioned. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-08 22:34:46 +01:00

28 Commits