Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
- https://arxiv.org/abs/2408.03314
- Charlie Snell, Jaehoon Lee, Kelvin Xu, Aviral Kumar
LLMs,
See also Zelikman2024Quert STaR
LLMs,
See also Zelikman2024Quert STaR