llamaperf
Calculator
Submit
Sign in
Submit
llamaperf
/
compare
/
a100-40gb-vs-rtx-5090
A100 40GB vs RTX 5090
For running local LLMs · 10 reports across 2 models
Side A
A100 40GB
Vendor
nvidia
VRAM
40GB
Memory
Discrete
Side B
RTX 5090
Vendor
nvidia
VRAM
32GB
Memory
Discrete
Tokens per second by model
Model
A100 40GB
RTX 5090
Qwen3.6
up to 35B
—
3238.0
n=7
Gemma 4
up to 31B
—
578.0
n=3
More comparisons
RTX 3090
vs
RTX 5090
A100 40GB
vs
RTX 3090
M5 Max 128GB
vs
RTX 5090
RTX 3060 12GB
vs
RTX 5090
A100 40GB
vs
M5 Max 128GB
A100 40GB
vs
RTX 3060 12GB