Qwen3.6 27B
RTX 3090 · BeeLlama · 128,000 ctx
- quant:
- Q5_K_S (gguf)
- kv:
- Q8
Benchmark of KV cache quantization methods using Qwen3.6 27B at 64k and 128k context. Also tested IQ4_XS quant. Article at https://anbeeld.com/articles/kv-cache-quantization-benchmarks-for-long-context