Qwen 3.5 Reverse Proxy for handling instant / thinking modes and their variants automatically
inference reverse-proxy instant thinking openai-api llm vllm genai vllm-serve qwen3-5 sampling-parameters
-
Updated
Apr 2, 2026 - Go