EleutherAI/lm-evaluation-harness

EleutherAI/lm-evaluation-harness

Releases19
Frequency3 months 3 days
Last Release
Stars12.6K
A framework for few-shot evaluation of language models.
Subscribe above to receive notifications when new versions are released.
VersionDate
Stability
Stability is determined by the version string and my be inaccurate.
v0.5.0.dev1 Development
v0.4.12 Stable
v0.4.11 Stable
v0.4.10 Stable
v0.4.9.2 Stable
v0.4.9.1 Stable
v0.4.9 Stable
v0.4.8 Stable
v0.4.7 Stable
v0.4.6 Stable
v0.4.5 Stable
v0.4.4 Stable
v0.4.3 Stable
v0.4.2 Stable
v0.4.1 Stable
v0.4.0 Stable
v0.3.0 Stable
v0.2.0 Stable
v0.0.1 Stable