Operations | Monitoring | ITSM | DevOps | Cloud

Blog

Bullet-proof MSP critical alerting

Dynamic Network Solutions, located in the greater Washington D.C. area, prides itself in providing the highest quality customer service. The company works extensively with its customers to manage client needs and provide them with significant training to avoid unwanted security incidents and downtime. Dynamic Network Solutions has many points of touch with their clients in order to ensure great service and ensure networks and technologies are always running.

Server monitoring best practices: 9 dos and don'ts

Have you ever had responsibility for an application and been the last to know about an outage? I have, and it’s terrible. You go to check your phone in the morning over coffee, after waking up, and you see a flood of missed calls and tons of emails. Customers are angry. Your boss is demanding to know what’s happening. Even the company’s executives are involved. How did this happen?

The State of Operations Health in the World of DevOps

At PagerDuty, we believe the best way to truly understand the health of your employees is to leverage the real-time human data that is already flowing through your systems. PagerDuty’s platform for action and real-time IT Operations orchestration consists of multiple facets and interlocking capabilities.

How To Share Your Brain: Collaborate With Boards

As we are fond of saying here at Honeycomb, context is king, and one of our favorite ways to share the context in our brains is with Boards. We recommend using Boards to share query structures you’ve developed for reuse, share visual graphs for ongoing review of systems, share your brain with your colleagues…and your future self.

Monitoring and securing Java apps at Quby.

Moving to a Docker-based cloud for Java apps orchestrated by Mesos Marathon required a different approach to monitoring and security for Quby, the Amsterdam-based developer of smart home solutions and maker of smart thermostat and service platform ‘Toon.’ That’s when they found Sysdig. The Sysdig Cloud-Native Intelligence Platform helps Quby resolve issues faster, and reduces monitoring system administration effort by 400%.

Restricting CFEngine to one CPU core using Systemd

In some performance critical situations, it makes sense to limit management software to a single CPU (core). We can do this using systemd and cgroups. CFEngine already provides systemd units on relevant platforms, we just need to tweak them. I’m using CFEngine Enterprise 3.12 on CentOS 7, but the steps should be very similar on other platforms/versions.

The Serverless Revolution: Why and How The Movement Will Allow Teams to Deploy With More Velocity and Confidence

Serverless or Function-as-a-Service (FaaS) design patterns have been picking up steam. With the recent release of KNative from Google Cloud, let’s take a closer look at the serverless movement.

Splunk vs SumoLogic vs ELK

From production monitoring to security concerns, it’s critical for businesses to analyze and review their log data. This is particularly true for large and enterprise companies, where the sheer amount of data makes log analysis the most efficient way to track key indicators. CTOs, in particular, are dealing with the challenges of this massive amount of data flowing through their organization, including how to harness it, gather insights from it, and secure it.