Extracting books from production language models

Training data leakage and memorization in language models