openai/evals

openai/evals

Releases10
Frequency1 month 1 week
Last Release
Stars18.6K
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Subscribe above to receive notifications when new versions are released.
VersionDate
Stability
Stability is determined by the version string and my be inaccurate.
3.0.1 Stable
3.0.0 Stable
2.0.0 Stable
1.0.3 Stable
1.0.3.post1 Unknown
1.0.2 Stable
1.0.2.post1 Unknown
1.0.1 Stable
v0.1.1 Stable
0.1.1 Stable