Gemma 4 5.1B E2B Instruct
AMD Threadripper 256GB · llama.cpp
- throughput:
- 7.5 t/s gen
- quant:
- Q4 (gguf)
text-generation
~5-10 tok/s on CPU. E2B is usable CPU-only. Source: gemma4-ai.com hardware guide
AMD · 256GB unified memory · 1 report
AMD Threadripper 256GB · llama.cpp
~5-10 tok/s on CPU. E2B is usable CPU-only. Source: gemma4-ai.com hardware guide