SRE Lessons from the Trenches

Published: April 15, 2020, 5 a.m.

b'

Emil Stolarsky (@emilstolarsky) and Jaime Woo (@jaimewoo), co-founders of @IncidentLabsInc talk about experiences running web applications at scale, evolving into SRE roles, communicating SRE concepts across teams, and tips for initial success.\\xa0

SHOW: 446

SHOW SPONSOR LINKS:

CLOUD NEWS OF THE WEEK - http://bit.ly/cloudcast-cnotw

SHOW NOTES:

Topic 1 - Welcome to the show. Tell us a little bit about your backgrounds, and some of your experiences that lead you to focus on SRE. \\xa0

Topic 2 - SRE is still an evolving concept, and people are still learning about it. How do you frame a conversation with people about how SRE works? How much is technology-centric and how much is culture/process-centric?

Topic 3 - We\\u2019re all living in an unusual time, given the current COVID-19 pandemic. How do you see SRE changing as work environments change (e.g. WFH) or volume or change-rate is dramatically impacted?\\xa0

Topic 4 - What have you found are successful communication and collaboration models for SREs engineers with their associated teams (or other stakeholders)?

Topic 5 - How well do you find different groups understand the concepts around error budgets and SLOs?\\xa0

Topic 6 - If people are just now getting started with SRE, what are some early tips (or tools) that you recommend for them to have initial success (or avoid failures)?

FEEDBACK?

'