Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Intro

I needed a tool to alert me when my cron jobs silently fail. There is already a number of existing services for this, but it seemed like a fun thing to build myself. So I present to you: healthchecks.io. I am using this myself and it has already been useful for me a couple times. Say, a seemingly benign code change in one service causes my batch job to fail 12 hours later, in the middle of night.

Understand.io – Log Management with Powerful Searching

Understand.io is an online event and log management system which enables you to release immense value from log data that you already collect. You can get complete application visibility in real-time, helping you to track down error messages, application requests, customer problems, and much more.

Create ServiceNow tickets from Datadog

ServiceNow is a popular IT service management platform for recording, tracking, and managing a company's enterprise-level IT processes in a single location. In addition to helping you manage your ServiceNow CMDB, Datadog integrates with ServiceNow IT Operations Management (ITOM) and IT Service Management (ITSM), enabling you to automatically create and manage ServiceNow incidents and events from the Datadog platform.