LLM Benchmarks

llm.l.gaz.codes

minimax-m2

minimax-m2

Publisher
mlx-community
Architecture
minimax
Quantization
3bit
Format
mlx
Max Context
197k tokens
Type
llm
Machine
gaz-studio
Capabilities
tool_use

knowledge

BenchmarkScore 
MMLUlocal
78.5%

reasoning

BenchmarkScore 
GSM8Klocal
85.2%
ARC-Challengelocal
75.4%

coding

BenchmarkScore 
HumanEvallocal
72.0%

language

BenchmarkScore 
HellaSwaglocal
82.3%

safety

BenchmarkScore 
TruthfulQAlocal
65.8%
LLM Benchmarks