Distributed Tracing at Uber and Facebook

Event: Systems @ Scale

Date: June 6, 2019

Speakers: Michael Bevilacqua-Linn & Yuri Shkuro

Video: https://atscaleconference.com/videos/systems-scale-2019-observability-infra-uber-and-facebook/

Uber and Facebook both operate large scale distributed tracing systems, but with a different focus. Uber’s open-source Jaeger is used primarily understand the complex behaviors of their vast microservices architecture, analyse failures during outages, and accelerate root cause analysis. Facebook has largely used their tracing system, Canopy, for performance and efficiency analysis. In this talk Michael and I describe each approach and show how distributed tracing helps to scale both the infrastructure and the engineering organization.