
huggingface/trl
Releases86
Frequency2 weeks 7 hours
Last Release
Stars18.5K
Train transformer language models with reinforcement learning.
Collections containing this project
Showing collections based on your access.
This project is not in any collections you can view.