feat(ai): add bge-large-en-v1.5 embedding model to litellm

- Add BGE embedding model config (port 8002) to litellm-config.yaml - Add GPU_VLLM_EMBED_URL env var to compose and .env 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-25 06:40:36 +01:00
parent ef0309838c
commit d57a1241d2
2 changed files with 10 additions and 0 deletions
--- a/ai/compose.yaml
+++ b/ai/compose.yaml
@@ -104,6 +104,7 @@ services:
      GPU_TAILSCALE_IP: ${GPU_TAILSCALE_IP}
      GPU_VLLM_QWEN_URL: ${GPU_VLLM_QWEN_URL}
      GPU_VLLM_LLAMA_URL: ${GPU_VLLM_LLAMA_URL}
+      GPU_VLLM_EMBED_URL: ${GPU_VLLM_EMBED_URL}
      # LITELLM_DROP_PARAMS: 'true'  # DISABLED: Was breaking streaming
      NO_DOCS: 'true'
      NO_REDOC: 'true'