Training language models to follow instructions with human feedback

GPT, InstructGPT