
Bridging the Alignment Gap
How societal frameworks can improve LLM alignment with human values
This research proposes using societal alignment frameworks to address the disconnect between human values and technological approaches in LLM alignment.
- Identifies incomplete contracts as a fundamental challenge in current alignment methods
- Advocates for incorporating socially-informed standards rather than relying solely on technical solutions
- Suggests cross-disciplinary approaches that draw from social sciences and humanities
- Demonstrates how established societal frameworks can enhance both safety and utility of LLMs
From a security perspective, this approach addresses the root causes of alignment failures that can lead to harmful outputs, making LLMs more robust and trustworthy for real-world deployment.