Data Contamination Can Cross Language Barriers https://arxiv.org/abs/2406.13236 Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang Training data leakage and memorization in language models