Neural Networks for High-Dimensional Tabular Biomedical Datasets
- đ¤ Speaker: Andrei Margeloiu (University of Cambridge)
- đ Date & Time: Tuesday 21 February 2023, 13:00 - 14:00
- đ Venue: Lecture Theatre 2
Abstract
Modern machine learning algorithms frequently overfit on small-sample size and high-dimensional tabular datasets, which are common in medicine, bioinformatics and drug discovery. How can we reduce the overfitting on tabular datasets with D>>N?
This talk presents two neural methods for learning from small-sample size and high-dimensional tabular datasets. First, we present WPFS , a parameter-efficient neural architecture that performs global feature selection during training. Second, we present GCondNet, a general approach which combines Graph Neural Networks (GNNs) for incorporating the implicit relationships between samples when training standard neural networks. GCondNet exploits the high-dimensionality of the data by creating many small graphs to capture the structure between samples within a feature. We show that WPFS and GCondNet outperform both standard and more recent methods on real-world biomedical datasets.
Series This talk is part of the Artificial Intelligence Research Group Talks (Computer Laboratory) series.
Included in Lists
- All Talks (aka the CURE list)
- Artificial Intelligence Research Group Talks (Computer Laboratory)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Department of Computer Science and Technology talks and seminars
- Guy Emerson's list
- Hanchen DaDaDash
- Interested Talks
- Lecture Theatre 2
- Martin's interesting talks
- ndk22's list
- ob366-ai4er
- PhD related
- rp587
- School of Technology
- Speech Seminars
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Tuesday 21 February 2023, 13:00-14:00