Bridging the Alignment Gap

Bridging the Alignment Gap

How societal frameworks can improve LLM alignment with human values

This research proposes using societal alignment frameworks to address the disconnect between human values and technological approaches in LLM alignment.

  • Identifies incomplete contracts as a fundamental challenge in current alignment methods
  • Advocates for incorporating socially-informed standards rather than relying solely on technical solutions
  • Suggests cross-disciplinary approaches that draw from social sciences and humanities
  • Demonstrates how established societal frameworks can enhance both safety and utility of LLMs

From a security perspective, this approach addresses the root causes of alignment failures that can lead to harmful outputs, making LLMs more robust and trustworthy for real-world deployment.

Societal Alignment Frameworks Can Improve LLM Alignment

75 | 124