Training large language models with reinforcement learning
Reinforcement learning, Large language models, Reward model, Reinforcement learning from human feedback
Examples
Libraries
- Transformer Reinforcement Learning: https://github.com/CarperAI/trlx