Compare Models
Select models to compare benchmark results side by side
Compare Controls
Models (1)
qwen2.5-0.5b-instruct-mlx
Benchmarks (8/8)
Selected Models (1)
qwen2.5-0.5b-instruct-mlx
Loading catalog...
Quick add loaded models:
Compare models at different quantization levels with RAM/VRAM/speed tradeoffs
Benchmarks (8/8)
Benchmark Comparison
Sort:
| Benchmark | qwen2.5-0.5b-instruct-mlx |
|---|---|
ARC-ChallengereasoningHF Leaderboard | 74.0% |
GSM8KreasoningHF Leaderboard | 83.6% |
HellaSwaglanguageHF Leaderboard | 82.3% |
HumanEvalcodingAider | 68.0% |
MBPPcodingAider | 66.0% |
MMLUknowledgeHF Leaderboard | 75.5% |
TruthfulQAsafetyHF Leaderboard | 60.1% |
WinoGrandelanguageHF Leaderboard | 72.8% |
Average | 72.8% |
| Model Specs | |
Parameters | - |
VRAM | - |
Est. Speed | - |