docker-compose/ai/litellm-config.yaml at 699c8537b078ead7a7540a8d93fc5b74505e4595

Files

Sebastian Krüger 699c8537b0 fix: use LiteLLM vLLM pass-through for qwen model

- Changed model from openai/qwen-2.5-7b to hosted_vllm/qwen-2.5-7b
- Implements proper vLLM integration per LiteLLM docs
- Fixes streaming response forwarding issue

2025-11-21 17:52:34 +01:00

2.8 KiB

Raw Blame History

View Raw

2.8 KiB Raw Blame History

2.8 KiB

Raw Blame History