Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data

Training data leakage and memorization in language models