Alignment problem in AI

The alignment problem from a deep learning perspective

Superalignment refers to the problem of aligning stronger models with weaker models.

Align to whom? Sorensen2024roadmap discusses pluralistic alignment.