Strong and weak alignment of large language models with human values
Authors: Mehdi Khamassi, Marceau Nahon1 and Raja Chatila
Published: 2024
Publication: ArXiv
Research paper: Strong and weak alignment of large language models with human values
Institution: Sorbonne University
Research Area: AI Alignment, AI Ethics, Computational Cognition
Discipline: Artificial Intelligence, Ethics, Computational Cognition