Fine-tuningFreeOpen Source

TRL

Full stack library to post-train foundation models with RL

18.7k starsApache-2.0

ABOUT

Post-training large language models with reinforcement learning and preference optimization involves complex algorithms and infrastructure. TRL simplifies this by providing ready-to-use trainers, CLI tools, and seamless integrations with PEFT, DeepSpeed, and vLLM for efficient scaling.

INSTALL

pip install trl

INTEGRATION GUIDE

1. Supervised fine-tuning of language models with SFTTrainer 2. Preference alignment using DPOTrainer and GRPOTrainer 3. Reward modeling for RLHF pipelines 4. Multi-GPU distributed training with DeepSpeed and FSDP

TRL

ABOUT

INTEGRATION GUIDE

TAGS