From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step

Chain-of-thought prompting

we investigate if models can be taught to internalize these CoT steps. … starting with a model trained for explicit CoT reasoning, we gradually remove the intermediate steps and finetune the model.

Merrill2024expressive may have proven that this is not really possible? HT Ben Waber