LLM Benchmarks

llm.l.gaz.codes

← Back to Models

minimax-m2

minimax-m2

Publisher

mlx-community

Architecture

minimax

Quantization

3bit

Format

mlx

Max Context

197k tokens

Type

llm

Machine

gaz-studio

Capabilities

tool_use

Benchmark Results

Compare with other models →

knowledge

Benchmark	Score
MMLUlocal	78.5%

reasoning

Benchmark	Score
GSM8Klocal	85.2%
ARC-Challengelocal	75.4%

coding

Benchmark	Score
HumanEvallocal	72.0%

language

Benchmark	Score
HellaSwaglocal	82.3%

safety

Benchmark	Score
TruthfulQAlocal	65.8%

LLM Benchmarks