Training Large Language Models to Reason in a Continuous Latent Space

Latent reasoning

Training large language models to reason in a continuous latent space – COCONUT Paper explained