openai/evals

openai/evals

Releases10
Frequency1 month 1 week
Last Release
Stars18.6K
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Collections containing this project

Showing collections based on your access.

This project is not in any collections you can view.