Tk Gilbert: Researcher — Prolific Citations Library
Explore 1 peer-reviewed study by Tk Gilbert in Reinforcement Learning from Human Feedback (RLHF) and Alignment (2023). Discover research powered by Prolific's participant panel.
This page lists 1 peer-reviewed paper authored or co-authored by Tk Gilbert in the Prolific Citations Library, a curated collection of research powered by high-quality human data from Prolific.
Papers (1 of 1)
-
Authors: S Casper, X Davies, C Shi, TK Gilbert
Year: 2023
Published in: arXiv preprint arXiv ..., 2023 - arxiv.org
Institution: Columbia University, Cornell Tech, Apollo Research, ETH Zurich, UC Berkeley, University of Sussex, Independent
Research Area: Reinforcement Learning from Human Feedback (RLHF), Alignment, LLM Limitations
Discipline: Artificial Intelligence
DOI: https://doi.org/10.48550/arXiv.2307.15217
Citations: 848