AI BENCHY

Aibenchy

AI Benchmark Leaderboard

Benchmarks generated from Aibenchy test suites at 2026-02-16T00:55:25.158Z

Models Evaluated: 10

Total Runs: 40

Total Wrong: 21

Rank Model Name Company Avg Score Value Score Tests Correct

Quick Compare

Choose the first model, then click a second model to open a side-by-side page.

Compare Z.ai: GLM 5 against...
Compare StepFun: Step 3.5 Flash against...
Compare Z.ai: GLM 5 against...
Compare MiniMax: MiniMax M2.5 against...
Compare Z.ai: GLM 4.7 Flash against...
Compare Qwen: Qwen3 Coder Next against...
Compare Qwen: Qwen3 Coder Next against...
Compare Z.ai: GLM 4.7 Flash against...
Compare MiniMax: MiniMax M2.5 against...
Compare OpenAI: GPT-4o-mini against...