Large Language Models Pass the Turing Test
Authors: Cameron R. Jones, Benjamin K. Bergen
Published: 2025
Publication: ArXiv
GPT-4.5 passed the Turing Test by being misidentified as human 73% of the time, surpassing real humans and other models, marking the first conclusive evidence of an AI achieving this standard.
Methods: Randomised, controlled, pre-registered Turing Test where 5-minute conversations were conducted between human participants and AI systems, followed by judgments on which partner was human.
Key Findings: The ability of AI systems (ELIZA, GPT-4o, LLaMa-3.1-405B, GPT-4.5) to mimic human conversational behavior and be perceived as human.
Institution: University of California San Diego
Research Area: Artificial Intelligence, Computational Linguistics, Turing Test, AI Evaluation
Discipline: Artificial Intelligence