Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models

LLMs, Bias

We evaluate a comprehensive end-user-focused iterative framework of debiasing that applies System 2 thinking processes … the more complex System 2-based Implicative Prompts significantly improve over other techniques demonstrating lower mean bias in the outputs with competitive performance on the downstream tasks.