Leaderboard
Real benchmarks from real hardware. Compare local LLM performance across different setups. Run the command below to contribute yours.
$
npm install -g metrillm@latest$
metrillmRequires Node 20+ and Ollama or LM Studio running
Or run without installing: npx metrillm@latest
Trending
Last 7 days- 1 gemma3:1b 3
- 2 deepseek-r1:latest 2
- 3 qwen3:4b 2
- 4 gemma3:4b 2
- 5 gpt-oss:20b 2
- 1 Docker Container 24
- 2 NVIDIA NVIDIA Jetson Orin Nano Engineering Reference Developer Kit Super 17
- 3 NVIDIA NVIDIA_DGX_Spark 14
- 4 Mac mini 6
- 5 AZW GTR Pro 1
- 1 Intel Gen Intel® Core™ i9-11900H 24
- 2 Cortex-A78AE 17
- 3 Cortex-X925 14
- 4 Apple M4 Pro 6
- 5 AMD RYZEN AI MAX+ 395 1
Benchmarks
360
total runs
Models
241
unique models
Families
45
model families
Hardware
31
unique CPUs
Filters
360 results
| # | CPU | Model | Size | RAM | Runtime | tok/s | TTFT | HW Fit | Quality | Global | Flags | Verdict | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Apple M2 Max | qwen3-30b-a3b-thinking-2507-claude-4.5-sonnet-high-reasoning-distill-mlx | 30B/MXFP4 | 96 GB | LM-STUDIOMLX | 79.6 | 1.7 s | 95 | 98 | 97 | THINK | Excellent | ||
| 2 | AMD RYZEN AI MAX+ 395 | nemotron3:33b | 33.0B/Q4_K_M | 125 GB | OLLAMAGGUF | 62.7 | 1.0 s | 95 | 97 | 96 | ECOTHINK | Excellent | ||
| 3 | Apple M4 Pro | gpt-oss-20b | — | 64 GB | LM-STUDIOGGUF | 118.2 | 2.2 s | 94 | 95 | 95 | THINK | Excellent | ||
| 4 | Apple M2 Max | qwen3-30b-a3b-thinking-2507-claude-4.5-sonnet-high-reasoning-distill-mlx | 30B/MXFP4 | 96 GB | LM-STUDIOMLX | 80.4 | 1.8 s | 94 | 96 | 95 | THINK | Excellent | ||
| 5 | Intel Core™ Ultra 9 285K | gpt-oss:20b | 20.9B/MXFP4 | 47 GB | OLLAMAGGUF | 239.9 | 616 ms | 98 | 94 | 95 | THINK | Excellent | ||
| 6 | Apple M4 | openai/gpt-oss-20b | 20B/MXFP4 | 24 GB | LM-STUDIOMLX | 29.3 | 1.8 s | 99 | 93 | 95 | THINK | Excellent | ||
| 7 | Apple M4 Pro | gpt-oss:20b | 20.9B/MXFP4 | 48 GB | OLLAMAGGUF | 54.5 | 2.9 s | 92 | 95 | 94 | THINK | Excellent | ||
| 8 | Apple M4 | openai/gpt-oss-20b | 20B/MXFP4 | 32 GB | LM-STUDIOMLX | 40.7 | 1.3 s | 100 | 92 | 94 | THINK | Excellent | ||
| 9 | Apple M4 | qwen/qwen3-30b-a3b-2507 | 30B/4bit | 32 GB | LM-STUDIOMLX | 44.8 | 405 ms | 100 | 91 | 94 | Excellent | |||
| 10 | Cortex-X925 | gpt-oss:20b | 20.9B/MXFP4 | 122 GB | OLLAMAGGUF | 60.5 | 3.1 s | 87 | 95 | 93 | THINK | Excellent | ||
| 11 | Apple M4 Pro | nemotron-3-nano | — | 64 GB | LM-STUDIOGGUF | 93.3 | 257 ms | 100 | 90 | 93 | THINK | Excellent | ||
| 12 | AMD RYZEN AI MAX+ 395 | gpt-oss:20b | 20.9B/MXFP4 | 125 GB | OLLAMAGGUF | 47.1 | 1.8 s | 85 | 96 | 93 | ECOTHINK | Excellent | ||
| 13 | AMD Ryzen 5 5500 | qwen/qwen3-vl-30b | 30B/Q4_K_M | 31 GB | LM-STUDIOGGUF | 24.8 | 735 ms | 95 | 90 | 92 | Excellent | |||
| 14 | AMD Ryzen 5 5600X 6-Core Processor | qwen3.6-35b-a3b-claude-4.7-opus-reasoning-distilled-apex-mtp | 35B | 47 GB | LM-STUDIOGGUF | 42.6 | 3.1 s | 87 | 94 | 92 | ECOTHINK | Excellent | ||
| 15 | Cortex-X925 | gpt-oss:20b | 20.9B/MXFP4 | 122 GB | OLLAMAGGUF | 60.7 | 3.2 s | 86 | 95 | 92 | THINK | Excellent | ||
| 16 | Intel Core™ Ultra 9 285HX | qwen3-coder:30b-a3b-q4_K_M | 30.5B/Q4_K_M | 63 GB | OLLAMAGGUF | 175.3 | 224 ms | 100 | 87 | 91 | Excellent | |||
| 17 | Intel Core™ Ultra 9 285K | qwen3-coder:30b | 30.5B/Q4_K_M | 47 GB | OLLAMAGGUF | 207.9 | 513 ms | 96 | 87 | 90 | Excellent | |||
| 18 | Intel Core™ Ultra 9 285HX | qwen3-coder:30b-a3b-q8_0 | 30.5B/Q8_0 | 63 GB | OLLAMAGGUF | 43.1 | 957 ms | 89 | 90 | 90 | Excellent | |||
| 19 | AMD Ryzen 5 5500 | openai/gpt-oss-20b | 20B/MXFP4 | 31 GB | LM-STUDIOGGUF | 58.7 | 4.5 s | 88 | 91 | 90 | THINK | Excellent | ||
| 20 | Apple M4 | unsloth/gemma-4-26b-a4b-it | 26B/Q3_K_M | 24 GB | LM-STUDIOGGUF | 24.9 | 817 ms | 85 | 92 | 90 | Excellent |
...
Don't miss new benchmarks
Get notified when new models and hardware configurations are tested. No spam, unsubscribe anytime.