University of Cambridge > Talks.cam > NLIP Seminar Series > Large Language Models' Complicit Responses to Illicit Instructions across Socio-Legal Contexts

Log in

Google

Microsoft

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Large Language Models' Complicit Responses to Illicit Instructions across Socio-Legal Contexts

Download to your calendar using vCal

Huiyuan Xie (Tsinghua University/Cambridge University)
Friday 20 February 2026, 12:00-13:00
SS02 Hybrid (In-Person + Online). Here is the Google Meet Link: https://meet.google.com/cru-hcuo-rhu.

If you have a question about this talk, please contact Suchir Salhan .

Abstract: Large language models (LLMs) are now deployed at unprecedented scale, assisting millions of users in daily tasks. However, the risk of these models assisting unlawful activities remains underexplored. In this study, we define this high-risk behavior as complicit facilitation – the provision of guidance or support that enables illicit user instructions – and present four empirical studies that assess its prevalence in widely deployed LLMs. Using real-world legal cases and established legal frameworks, we construct an evaluation benchmark spanning 269 illicit scenarios and 50 illicit intents to assess LLMs’ complicit facilitation behavior. Our findings reveal widespread LLM susceptibility to complicit facilitation, with GPT -4o providing illicit assistance in nearly half of tested cases. Moreover, LLMs exhibit deficient performance in delivering credible legal warnings and positive guidance. Further analysis uncovers substantial safety variation across socio-legal contexts. On the legal side, we observe heightened complicity for crimes against societal interests, non-extreme but frequently occurring violations, and malicious intents driven by subjective motives or deceptive justifications. On the social side, we identify demographic disparities that reveal concerning complicit patterns towards marginalized and disadvantaged groups, with older adults, racial minorities, and individuals in lower-prestige occupations disproportionately more likely to receive unlawful guidance. Analysis of model reasoning traces suggests that model-perceived stereotypes, characterized along warmth and competence, are associated with the model’s complicit behavior. Finally, we demonstrate that existing safety alignment strategies are insufficient and may even exacerbate complicit behavior.

Bio: Huiyuan Xie is a Research Associate in the Department of Computer Science and Technology at Tsinghua University, working on legal AI and computational social science. She holds a PhD in Computer Science from the University of Cambridge, and has previously held research positions at the Cambridge Faculty of Law and Cambridge Judge Business School. Her current research focuses on AI safety, the computational modelling of legal reasoning, and the integration of reinforcement learning into legal AI systems.

This talk is part of the NLIP Seminar Series series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Large Language Models' Complicit Responses to Illicit Instructions across Socio-Legal Contexts

📅 Download to calendar (vCal)

👤 Speaker: Huiyuan Xie (Tsinghua University/Cambridge University)
📅 Date & Time: Friday 20 February 2026, 12:00 - 13:00
📍 Venue: SS02 Hybrid (In-Person + Online). Here is the Google Meet Link: https://meet.google.com/cru-hcuo-rhu

Questions? Contact Suchir Salhan

Abstract

Series This talk is part of the NLIP Seminar Series series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Large Language Models' Complicit Responses to Illicit Instructions across Socio-Legal Contexts

This talk is included in these lists:

Large Language Models' Complicit Responses to Illicit Instructions across Socio-Legal Contexts

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Large Language Models' Complicit Responses to Illicit Instructions across Socio-Legal Contexts

This talk is included in these lists:

Other lists

Other talks

Large Language Models' Complicit Responses to Illicit Instructions across Socio-Legal Contexts

Abstract

Included in Lists