LLM Benchmarks

llm.l.gaz.codes

Compare Models

Select models to compare benchmark results side by side

Compare Controls

Models (1)

minimax-m2

Benchmarks (8/8)

Selected Models (1)

minimax-m2
Loading catalog...
Quick add loaded models:

Compare models at different quantization levels with RAM/VRAM/speed tradeoffs

Benchmarks (8/8)

Benchmark Comparison

Sort:
Benchmark
minimax-m2
ARC-ChallengereasoningHF Leaderboard
88.1%
GSM8KreasoningHF Leaderboard
93.4%
HellaSwaglanguageHF Leaderboard
93.4%
HumanEvalcodingAider
78.6%
MBPPcodingAider
78.9%
MMLUknowledgeHF Leaderboard
92.0%
TruthfulQAsafetyHF Leaderboard
76.0%
WinoGrandelanguageHF Leaderboard
90.6%
Average
86.4%
Model Specs
Parameters
-
VRAM
-
Est. Speed
-