- Replace orchestrator routing with direct vLLM server connections - Qwen 2.5 7B on port 8000 (GPU_VLLM_QWEN_URL) - Llama 3.1 8B on port 8001 (GPU_VLLM_LLAMA_URL) - Simplify architecture by removing orchestrator proxy layer 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>