Jamie A Ward
@jamieward.bsky.social
Wearable computing & social neuroscience. Professor of Computing at Goldsmiths University of London.
Reposted by Jamie A Ward
And still a benchmark for chatbots: arxiv.org/abs/2310.20216
Does GPT-4 pass the Turing test?
We evaluated GPT-4 in a public online Turing test. The best-performing GPT-4 prompt passed in 49.7% of games, outperforming ELIZA (22%) and GPT-3.5 (20%), but falling short of the baseline set by huma...
arxiv.org
January 28, 2025 at 3:13 PM
And still a benchmark for chatbots: arxiv.org/abs/2310.20216