Qwen3.6 35B
2× RTX Pro 6000 Blackwell · vLLM
- throughput:
- 3500.0 t/s gen · 30000.0 t/s pp
Two benchmarks: Qwen3.6 27B BF16 and Qwen3.6 35B BF16. For 35B, best gen tps 3500 at 128 concurrency with MTP off, prompt tps 30000. Also tested 27B with MTP on/off.