openai/evals

openai/evals

Releases10

Frequency1 month 1 week

Last Releaseabout 2 years ago

Stars19K

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Log in to subscribe

Collections containing this project

Showing collections based on your access.

This project is not in any collections you can view.