About
Changelog
2026-06-23:
Added GLM-5.2
2026-06-10:
Added Claude Fable 5
2026-06-01:
Added Qwen3.7-Plus
2026-05-30:
Added Step 3.7 Flash, Hy3 preview
2026-05-29:
Added Composer 2.5, Qwen3.7-Max, Claude Opus 4.8
2026-05-20:
Added Gemini 3.5 Flash
2026-05-16:
Added Grok 4.3
2026-05-14:
Initial frontend design and development completed.
2026-05-11:
Finished evaluating the test suite data, tested 18 LLMs in total. Started designing the frontend.
2026-05-06:
Built and refined the admin backend.
2026-05-05:
Prepared the dataset.