LLM Benchmarks
Models
Compare
Results
llm.l.gaz.codes
Benchmark Results
Local benchmark results from lm-eval and assistant evaluation runs
Loading results...