Sketching methods
- 👤 Speaker: John Bradshaw
- 📅 Date & Time: Thursday 10 November 2016, 13:30 - 15:00
- 📍 Venue: Engineering Department, CBL Room 438
Abstract
Sketching methods (or sometimes called streaming algorithms) are useful when you want an approximate property of a dataset when the computation of its true value would take too long or use too much memory. In this talk we give a brief overview of some of the most popular sketching methods, including: the Flajolet-Martin algorithm, for estimating the cardinality of a dataset; the Bloom Filter for testing set membership and the Count-Min Sketch for estimating occurrences of each item. We will try to provide examples of when these methods are useful and where they are used in big data applications.
There is no need to read any material prior to the meeting.
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Engineering Department, CBL Room 438
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

John Bradshaw
Thursday 10 November 2016, 13:30-15:00