Provably Safe Certification for Machine Learning Models under Adversarial Attacks
- π€ Speaker: Prof. Miguel Rodrigues, UCL π Website
- π Date & Time: Wednesday 22 November 2023, 14:00 - 15:00
- π Venue: MR5, CMS Pavilion A
Abstract
It is widely known that state-of-the-art machine learning models β including vision and language ones β can be seriously compromised by adversarial perturbations, so it is also increasingly relevant to develop capability to certify their performance in the presence of the most effective adversarial attacks.
This talk will introduce an approach inspired by distribution-free risk controlling procedures to certify the performance of machine learning models in the presence of adversarial attacks, with population level risk guarantees. In particular, given a specific attack, we will introduce the notion of a machine learning model (alpha, zeta)βsafety guarantee: this guarantee, which is supported by a testing procedure based on the availability of a calibration set, entails one will only declare that a machine learning model adversarial (population) risk is less than alpha (i.e. the model is safe) given that the model adversarial (population) risk is higher than alpha (i.e. the model is in fact unsafe), with probability less than zeta. We will also introduce Bayesian optimization oriented approaches to determine very efficiently whether or not a machine learning model is (alpha, zeta)-safe in the presence of an adversarial attack, along with their statistical guarantees.
This talk will also illustrate how to apply our framework to a range of machine learning models β including various sizes of vision Transformer (ViT) and ResNet models β impaired by a variety of adversarial attacks.
Series This talk is part of the Information Theory Seminar series.
Included in Lists
- All CMS events
- All Talks (aka the CURE list)
- bld31
- CMS Events
- DPMMS info aggregator
- DPMMS lists
- DPMMS Lists
- Hanchen DaDaDash
- Information Theory Seminar
- Interested Talks
- MR5, CMS Pavilion A
- School of Physical Sciences
- Statistical Laboratory info aggregator
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Prof. Miguel Rodrigues, UCL 
Wednesday 22 November 2023, 14:00-15:00