Papers by Tk Gilbert

Explore 1 peer-reviewed study by Tk Gilbert in Reinforcement Learning from Human Feedback (RLHF) and Alignment (2023). Discover research powered by Prolific's participant panel.

This page lists 1 peer-reviewed paper authored or co-authored by Tk Gilbert in the Prolific Citations Library, a curated collection of research powered by high-quality human data from Prolific.

Papers (1 of 1)

Open problems and fundamental limitations of reinforcement learning from human feedback

Authors: S Casper, X Davies, C Shi, TK Gilbert

Year: 2023

Published in: arXiv preprint arXiv ..., 2023 - arxiv.org

Institution: Columbia University, Cornell Tech, Apollo Research, ETH Zurich, UC Berkeley, University of Sussex, Independent

Research Area: Reinforcement Learning from Human Feedback (RLHF), Alignment, LLM Limitations

Discipline: Artificial Intelligence

DOI: https://doi.org/10.48550/arXiv.2307.15217

Citations: 848