About
Changelog
2026-05-14:
Initial frontend design and development completed.
2026-05-11:
Finished evaluating the test suite data, tested 18 LLMs in total. Started designing the frontend.
2026-05-06:
Built and refined the admin backend.
2026-05-05:
Prepared the dataset.