Tackling Label Corruptions: Univariate Polynomial Regression and Generalized Linear Models
- đ¤ Speaker: Sushrut Karmalkar, Microsoft Research đ Website
- đ Date & Time: Tuesday 03 February 2026, 14:00 - 15:00
- đ Venue: FW 26, Computer Laboratory, William Gates Building
Abstract
Label corruptions pose a significant challenge in various machine learning tasks, affecting the accuracy and reliability of models. In this talk, we will address two distinct problems involving label corruptions, and present approaches to handle them effectively.
The first problem we consider is that of robust univariate polynomial regression. In this problem the goal is to recover a polynomial which is pointwise close to a target polynomial, given samples where, with probability $\alpha$ the samples are clean (satisfy the model); and with probability $1-\alpha$ the label is corrupted (completely arbitrary). We propose an approach which can tolerate a corruption fraction as large as any constant less than 1/2, which is the information theoretic limit for unique recovery in this problem.
In the second problem, we examine the challenge of learning a linear function composed with a generalized linear model (GLM). We focus on the oblivious noise setting, where up to any constant fraction of the labels are corrupted via arbitrary independent and additive noise. We show that in this setting, it is always possible to recover a polynomial-sized list of candidates, one of which is arbitrarily close to the true answer. Furthermore, under mild distributional assumptions, we show this recovery is unique.
This talk is co-hosted by the Computer Laboratory AI Research Group.
Series This talk is part of the Machine learning theory series.
Included in Lists
- All CMS events
- All Talks (aka the CURE list)
- Artificial Intelligence Research Group Talks (Computer Laboratory)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- CMS Events
- Department of Computer Science and Technology talks and seminars
- DPMMS info aggregator
- FW 26, Computer Laboratory, William Gates Building
- Guy Emerson's list
- Hanchen DaDaDash
- Interested Talks
- Machine learning theory
- Martin's interesting talks
- ml@cl-math
- ndk22's list
- ob366-ai4er
- PhD related
- rp587
- School of Technology
- Speech Seminars
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)



Tuesday 03 February 2026, 14:00-15:00