LLM Benchmarks

llm.l.gaz.codes

Compare Models

Select models to compare benchmark results side by side

Compare Controls

Models (1)

llama-3.2-1b-instruct

Benchmarks (8/8)

Selected Models (1)

llama-3.2-1b-instruct
Loading catalog...
Quick add loaded models:

Compare models at different quantization levels with RAM/VRAM/speed tradeoffs

Benchmarks (8/8)

Benchmark Comparison

Sort:
Benchmark
llama-3.2-1b-instruct
ARC-ChallengereasoningHF Leaderboard
70.6%
GSM8KreasoningHF Leaderboard
77.9%
HellaSwaglanguageHF Leaderboard
77.5%
HumanEvalcodingAider
70.9%
MBPPcodingAider
63.1%
MMLUknowledgeHF Leaderboard
75.9%
TruthfulQAsafetyHF Leaderboard
61.1%
WinoGrandelanguageHF Leaderboard
72.3%
Average
71.2%
Model Specs
Parameters
-
VRAM
-
Est. Speed
-