Benchmarking Benchmark Leakage in Large Language Models

Training data leakage and memorization in language models, Data leakage