Kapoor & Narayanan, "Leakage and the Reproducibility Crisis in ML-based Science" (2022) https://arxiv.org/abs/2207.07048 Sayash Kapoor, Arvind Narayanan Data leakage See also Hofman2023preregistration