Predicting Faults in Heterogeneous, Federated Distributed Systems
- 👤 Speaker: Marco Canini (EPFL)
- 📅 Date & Time: Wednesday 29 September 2010, 14:30 - 15:30
- 📍 Venue: FW26, Computer Laboratory, William Gates Builiding
Abstract
It is notoriously difficult to make distributed systems reliable. This becomes even harder in the case of the widely-deployed systems that become heterogeneous and federated. The set of routers in charge of the inter-domain routing in the Internet is a prime example of such a system. The unanticipated interaction of nodes under seemingly valid configuration changes and local fault-handling can have a profound effect. For example, the Internet has suffered from multiple IP prefix hijackings, as well as performance and reliability problems due to emergent behavior resulting from a local session reset.
We argue that the key step in making these systems reliable is the need to automatically predict faults. In this talk, I will describe the design and implementation of DiCE, a system that uses temporal and spatial awareness to predict faults in heterogeneous, federated systems. Our live evaluation in the testbed shows that DiCE quickly and successfully predicts two important classes of faults, operator mistakes and programming errors, that have plagued BGP routing in the Internet.
Joint work with Vojin Jovanovic, Gautam Kumar, and Dejan Kostic
Marco’s home page: http://people.epfl.ch/marco.canini
Series This talk is part of the Computer Laboratory Systems Research Group Seminar series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- CL's SRG seminar
- Computer Laboratory Systems Research Group Seminar
- Department of Computer Science and Technology talks and seminars
- FW26, Computer Laboratory, William Gates Builiding
- Interested Talks
- ndk22's list
- ob366-ai4er
- rp587
- School of Technology
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Marco Canini (EPFL)
Wednesday 29 September 2010, 14:30-15:30