Jash Mistry is a Senior Software Engineer at eBay. As a member of the Site Reliability Engineering team, he played a crucial role in the evolution of monitoring—expanding on absolute error counts and average latencies to develop a highly reliable SLO-driven observability platform. He has a Master's Degree in Computer Engineering from Georgia Institute of Technology. Movie theatres are his second home, but he does not mind seeing one from the couch as long as it's on Mubi or the Criterion Channel.

Presentations

22x

Beyond Sequential: A Recipe for Async Pipeline Observability and Alerting

While implementing observability for thousands of microservices, we faced challenges with asynchronous pipelines that weren't covered by traditional API-based SLO approaches. This talk presents our company's journey in implementing SLOs for async systems: identifying key customer experience KPIs, leveraging Prometheus, defining good/valid events, and establishing burn rate alert thresholds. We'll share real case studies demonstrating the impact, including how we detected and resolved pipeline latency issues that affected thousands of events, along with best practices and lessons learned

See Presentation