Chain of Alignment: Integrating Public Will with Expert Intelligence for Language Model Alignment
Authors: Andrew Konya, Aviv Ovadya, Kevin Feng, Quan Ze Chen, Lisa Schirch, Colin Irwin
Published: 2024
Publication: ArXiv
Research paper: Chain of Alignment: Integrating Public Will with Expert Intelligence for Language Model Alignment
Institution: AI & Democracy Foundation,Remesh,University of Washington
Research Area: AI Alignment, Public Will, Expert Intelligence, Rule-based Reward
Discipline: Artificial Intelligence , Computational Social Science