Feedback Forensics: A Toolkit to Measure AI Personality
- π€ Speaker: Arduin Findeis (University of Cambridge)
- π Date & Time: Tuesday 17 June 2025, 13:00 - 14:00
- π Venue: Lecture Theatre 2, Computer Laboratory, William Gates Building
Abstract
Conventional AI benchmarks typically focus on the content of responses, for example checking factual (e.g. MMLU ) or mathematical correctness (e.g. GSM8k). However, for many language model applications, the manner (or “personality”) of a model’s responses also matters to users, for example how friendly or confident responses are. Recent issues with model releases highlight the limited ability of existing evaluation approaches to capture such personality traits: a ChatGPT model version was rolled back over sycophant personality issues, other models’ personalities have been critised to overfit to the Chatbot Arena leaderboard.
In this talk, I will introduce Feedback Forensics: our newly released toolkit to measure AI personality traits. Using our toolkit, I will first share results detecting the personality traits currently encouraged by popular human feedback datasets (incl. Chatbot Arena). Next, I will discuss changes and trends in personality traits exhibited across model families and versions. Finally, I will take a closer look the personality differences between the Chatbot Arena and publicly released version of Llama-4-Maverick.
The talk will feature a live demo of our personality visualisation tool and attendees are invited to follow along via our online platform https://feedbackforensics.com/ (laptops are encouraged).
Series This talk is part of the Artificial Intelligence Research Group Talks (Computer Laboratory) series.
Included in Lists
- All Talks (aka the CURE list)
- Artificial Intelligence Research Group Talks (Computer Laboratory)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Department of Computer Science and Technology talks and seminars
- Guy Emerson's list
- Hanchen DaDaDash
- Interested Talks
- Lecture Theatre 2, Computer Laboratory, William Gates Building
- Martin's interesting talks
- ndk22's list
- ob366-ai4er
- PhD related
- rp587
- School of Technology
- Speech Seminars
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Tuesday 17 June 2025, 13:00-14:00