openai/evals

Releases10

Frequency1 month 1 week

Last Releaseabout 2 years ago

Stars19K

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Subscribe above to receive notifications when new versions are released.

10 releases

Version	Date	Stability Stability is determined by the version string and may be inaccurate.
3.0.1	May 1, 2024Wednesday, 1 May 2024, 00:50	Stable
3.0.0	Apr 17, 2024Wednesday, 17 April 2024, 22:27	Stable
2.0.0	Jan 12, 2024Friday, 12 January 2024, 22:40	Stable
1.0.3	Apr 17, 2023Monday, 17 April 2023, 18:33	Stable
1.0.3.post1	Apr 17, 2023Monday, 17 April 2023, 18:38	Unknown
1.0.2	Apr 13, 2023Thursday, 13 April 2023, 22:10	Stable
1.0.2.post1	Apr 13, 2023Thursday, 13 April 2023, 22:22	Unknown
1.0.1	Apr 13, 2023Thursday, 13 April 2023, 21:35	Stable
v0.1.1	Apr 11, 2023Tuesday, 11 April 2023, 01:10	Stable
0.1.1	Apr 11, 2023Tuesday, 11 April 2023, 01:10	Stable