huggingface/trl

huggingface/trl

Releases86
Frequency2 weeks 7 hours
Last Release
Stars18.5K
Train transformer language models with reinforcement learning.
Subscribe above to receive notifications when new versions are released.
VersionDate
Stability
Stability is determined by the version string and my be inaccurate.
v1.5.1 Stable
v1.5.0 Stable
v1.4.0 Stable
v1.3.0 Stable
v1.2.0 Stable
v1.1.0 Stable
v1.0.0 Stable
v1.0.0rc1 RC
v0.29.1 Stable
v0.29.0 Stable
v0.28.0 Stable
v0.27.2 Stable
v0.27.1 Stable
v0.27.0 Stable
v0.26.2 Stable
v0.26.1 Stable
v0.26.0 Stable
v0.25.1 Stable
v0.25.0 Stable
v0.24.0 Stable
v0.23.1 Stable
v0.23.0 Stable
v0.22.2 Stable
v0.22.1 Stable
v0.22.0 Stable
Previous1234Next