Algorithmic progress in language models

Scaling law