akazah - Overview
Navigation Menu
Pinned Loading
-
Forked from openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Python
-
Challenge: reproduce GPT-4o-like responses using GPT-5. (Inspired by #Keep4o)