Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

https://arxiv.org/abs/2407.21787
Bradley Brown, Jordan Juravsky, Ryan Ehrlich, Ronald Clark, Quoc V. Le, Christopher Ré, Azalia Mirhoseini

sample more verify better.