
openai/evals
Releases10
Frequency1 month 1 week
Last Release
Stars18.6K
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Collections containing this project
Showing collections based on your access.
This project is not in any collections you can view.