Operations | Monitoring | ITSM | DevOps | Cloud

20 Operations Manager Tips in 20 Minutes

If you’ve ever worked with SCOM then you’ll know it needs a little love to get the most out of it. Together with Tao Yang MVP, I co-presented a session called “20 Operations Manager Tips in 20 Minutes” at Experts Live USA this year which brings together many of the SCOM tips and tricks we’ve accumulated over the years. That session wasn’t recorded, so we decided to publish the tips here for everyone to benefit from. It’s time to get your SCOM game on.

Taming A Game-Changer: Honeycomb and GraphQL at VendHQ

This guest post is from Evan Shaw, Lead Engineer at vendhq.com. GraphQL is a query language for APIs. It allows you to expose all your data through a single queryable graph. Compared to RESTful APIs, GraphQL brings greater flexibility in how your data is exposed, a more structured schema for type safety, and fewer round trips to your server for better latency. When we introduced a GraphQL at Vend, the feedback from our frontend engineers was clear: “This is a game-changer.”

Three Insights from the World's Leading IT Organizations

IT is in a state of change on a consistent basis, but never more so than now. With the increasing importance of the digital customer experience across all industries, technology is at a new inflection point in how it can fundamentally impact executive priorities. And while nobody can speak for the direction of IT as a whole, there are a few resources so comprehensive that they’re worth reviewing when they’re released.

Packet Errors, Packet Discards, and Packet Loss: What's the Difference?

It’s a question the Auvik support team receives often: “What’s the difference between packet errors, packet discards, and packet loss?” And if you’ve ever typed that sentence—or any other variation—into Google, you’ll know it’s a tricky answer to find. Until now. Before we break down the three packet terms, let’s first look at packets themselves.

Is just systems monitoring good enough?

We are often asked this question – we are monitoring our systems and able to keep their uptime high. Isn’t that enough? Unfortunately that only provides one side of the story. Yes, they are up and resource utilization on them may be well within the limits. But it doesn’t tell us In fact, low resource utilization is very misleading because low utilization also happens when things are stuck waiting on external services response and nothing is processing.

A Look at Healthchecks.io Hosting Setup, Summer 2019

For a monitoring service, uptime and reliability is of course a critical feature: customers are placing trust in the service to detect problems and deliver timely and accurate alerts. While I cannot guarantee that Healthchecks.io will absolutely never let you down, I can offer transparency on how it is currently being hosted and operated.