Improving Language Understanding by Generative Pre-Training
- https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf
- Alec Radford, Karthik Narasimhan, Tim Salimans, Ilya Sutskever
We demonstrate that large gains on these tasks can be realized by generative pre-training of a language model. … followed by discriminative fine-tuning on each specific task.
Transformer model (generative decoder) + unsupervised pretraining.