Facebook systems are complex and are stressing the Linux kernel in various, extreme, ways. As we move to kernels that are closer to mainline, we need tools to help us improve the kernel stability and performance. This presentation will describe how we use the various Linux tracing tools for debugging and continuous performance monitoring. We will also cover the road blocks that we faced and what improvements would be a benefit to us.
Tracing tool developers to learn how their tools are used in a large scale system and general tracing users to see some real life usages.
Yannick Brosseau is a Production Engineer on the Kernel team at Facebook. As such he works on improving the stability and performance of the kernels deployed on the Facebook infrastructure and develops testing, monitoring and deployment tools to help in this endeavor. Previously, he was a Research associate at École Polytechnique de Montréal where he worked on performance analysis tools for Linux. He worked on several part of the LTTng project. He was also an open source software consultant for several years and is a Fedora Packager.