Glossary · Term

TRL

← all terms

Definition

Hugging Face's library for reinforcement-learning post-training of language models.

Transformer Reinforcement Learning, an open-source library for SFT, RLHF, and DPO-style post-training of language models.

Mentioned in 1 episode

  1. 009
    How Two Silent Library Bugs Quietly Invalidated a Wave of Reasoning Papers