Training Large Language Models to Reason in a Continuous Latent Space
- https://arxiv.org/abs/2412.06769
- Shibo Hao, Sainbayar Sukhbaatar, DiJia Su, Xian Li, Zhiting Hu, Jason Weston, Yuandong Tian
Training large language models to reason in a continuous latent space – COCONUT Paper explained