Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback

Authors: HR Kirk, B Vidgen, P Röttger, SA Hale

Published: 2023

Publication: arXiv preprint arXiv:2303.05453, 2023 - arxiv.org

Research paper: Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback

Institution: The Alan Turing Institute, University of Oxford, Imperial College London, King's College London, Google DeepMind

Research Area: Large Language Model Alignment, Safety,Personalization Risks

Discipline: Artificial Intelligence

Citations: 146

DOI: https://doi.org/10.48550/arXiv.2303.05453