Firmament: fast, centralized cluster scheduling at scale
- 👤 Speaker: Ionel Gog (Computer Lab) 🔗 Website
- 📅 Date & Time: Thursday 13 October 2016, 15:00 - 16:00
- 📍 Venue: FW26, Computer Laboratory, William Gates Building
Abstract
Scheduling tasks on “warehouse-scale” clusters is a challenging undertaking: thousands of tasks must cleverly and rapidly be placed in order to achieve high utilization and good application-level performance. Centralized data-center schedulers make high-quality placement decisions, but they come at the cost of high decision latency at scale, which degrades response time for interactive jobs. Distributed schedulers, by contrast, make low-latency decisions at scale, but are restricted to simple algorithms and can make poor decisions as a result.
In this talk, I present Firmament, a centralized scheduler that scales to tens of thousands of machines at sub-second latency, even though it performs a computationally expensive min cost flow optimization. To achieve this, Firmament automatically chooses between different min-cost flow algorithms, solves the optimization problem incrementally when possible, and applies problem-specific optimizations to common min-cost flow algorithms.
Our experiments with a Google workload trace from a 12,500-machine cluster show that Firmament places tasks in hundreds of milliseconds, improving decision latency by more than 10x over Quincy, a similar prior scheduler. Moreover, since Firmament can efficiently solve large optimization problems, it supports novel scheduler features that would otherwise be too expensive. Firmament outperforms state-of-the-art distributed schedulers in placement quality without an increase in scheduling decision latency for common industry workloads.
Series This talk is part of the Computer Laboratory Systems Research Group Seminar series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- CL's SRG seminar
- Computer Laboratory Systems Research Group Seminar
- Department of Computer Science and Technology talks and seminars
- FW26, Computer Laboratory, William Gates Building
- Interested Talks
- ndk22's list
- ob366-ai4er
- rp587
- School of Technology
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Ionel Gog (Computer Lab) 
Thursday 13 October 2016, 15:00-16:00