Episode 15: Nagios was the Original Call of Duty

Published: June 20, 2018, 6 a.m.

Let\u2019s chat about the Cloud and everything in between. The people in this world are pretty comfortable with not running physical servers on their own, but trusting someone else to run them. Yet, people suffer from the psychological barrier of thinking they need to build, design, and run their own monitoring system. Fortunately, more companies are turning to Datadog.\nToday, we\u2019re talking to Ilan Rabinovitch, Datadog\u2019s vice president of product and community. He spends his days diving into container monitoring metrics, collaborating with Datadog\u2019s open source community, and evangelizing observability best practices. Previously, Ilan led infrastructure and reliability engineering teams at various organizations, including Ooyala and Edmunds.com. He\u2019s active in the open source and DevOps communities, where he is a co-organizer of events, such as SCALE and Texas Linux Fest.\nSome of the highlights of the show include:\n\nDatadog is well-known, especially because it is a frequent sponsor\nMore organizations know their core competency is not monitoring or managing servers\nMonitoring/metrics is a big data problem; Datadog takes monitoring off your plate\nAlternate ways, other than using Nagios, to monitor instances and regenerate configurations\nDatadog is first to identify patterns when there is a widespread underlying infrastructure issue\nTrends of moving from on-premise to Cloud; serverless is on the horizon\nHow trends affect evolution of Datadog; adjusting tools to monitor customers\u2019 environments\nDatadog\u2019s scope is enormous; the company tries to present relevant information as the scale of what it\u2019s watching continues to grow\nDatadog\u2019s pricing is straightforward and simple to understand; how much Cloud providers charge to use Datadog is less clear\nSingle Pane of Glass: Too much data to gather in small areas (dashboards) \xa0\nWhy didn\u2019t monitoring catch this? Alerts need to be actionable and relevant\nHow to use Datadog\u2019s workflow for setting alerts and work metrics\nDatadog\u2019s first Dash user conference will be held in July in New York; addresses how to solve real business problems, how to scale/speed up your organization\n\nLinks:\n\nIlan Rabinovitch on Twitter\nDatadog\nDocker Adoption Survey Results \xa0\nRubric for Setting Alerts/Work Metrics\nDash Conference\nre:Invent\nNagios