Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback
Authors: HR Kirk, B Vidgen, P Röttger, SA Hale
Published: 2023
Publication: arXiv preprint arXiv:2303.05453, 2023 - arxiv.org
Research paper: Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback
Institution: The Alan Turing Institute, University of Oxford, Imperial College London, King's College London, Google DeepMind
Research Area: Large Language Model Alignment, Safety,Personalization Risks
Discipline: Artificial Intelligence
Citations: 146
DOI: https://doi.org/10.48550/arXiv.2303.05453