Training large language models with reinforcement learning
Reinforcement learning, LLMs, Reward model, Reinforcement learning from human feedback
Examples
Libraries
- Transformer Reinforcement Learning: https://github.com/CarperAI/trlx
Reinforcement learning, LLMs, Reward model, Reinforcement learning from human feedback