Latest News

Log analytics and dashboarding in Datadog

Sep 19, 2018 By Stephen Lechner In Datadog

Achieving optimal performance can be challenging when you depend on separate platforms to monitor service health and to manage your logs. When data about your systems is spread across multiple platforms, investigating issues—and ultimately resolving them—takes longer and requires expertise with more tools. It takes more effort to identify real customer impact, as well as to verify that your responses to an incident are having the desired effect.

Read Post

Datadog

Read more about Log analytics and dashboarding in Datadog

Managing Python Processes with PM2

Sep 19, 2018 By Alexandre Strzelewicz In PM2

PM2 is a production-grade process manager that makes management of background process easy. In the Python world we could compare PM2 to Supervisord, but PM2 has some nifty features you might like. With PM2, rolling restarts, monitoring, checking logs and even deploying application has never been that simple. We really value CLI UX, so PM2 is really simple to use and master.

Read Post

PM2

Read more about Managing Python Processes with PM2

Monitoring Social Signals to Reduce Alert Fatigue With SignalFx and PagerDuty

Sep 19, 2018 By Arijit Mukherji In PagerDuty

“I need to be notified if there’s a significant event ongoing with SignalFx.” This is what I tell my team. However, despite being the CTO of a monitoring company, creating the right set of alerts for me to stay informed of incidents in progress or potential issues was harder than it seemed at first glance. Why?

Read Post

PagerDuty

Read more about Monitoring Social Signals to Reduce Alert Fatigue With SignalFx and PagerDuty

Massachusetts Natural Gas Explosions - A Lesson in The Importance of Alert Automation

Sep 19, 2018 By Shawn Lazarus In OnPage

The pressure in the natural gas pipelines under three Massachusetts communities spiked to 12 times their normal level last week, just before the explosions and fires that destroyed dozens of homes and killed an 18-year-old man. Columbia Gas went under fire for their mismanagement of the incident. The NTSB says a Columbia Gas control room in Columbus, Ohio, registered pressures of 6 pounds per square inch last Thursday in pipelines that are intended to carry just 0.5 PSI.

Read Post

OnPage

Read more about Massachusetts Natural Gas Explosions - A Lesson in The Importance of Alert Automation

Saving lives by ensuring uptime of mission-critical IT at Gift of Hope

Sep 19, 2018 By Derdack In Derdack

Gift of Hope Organ & Tissue Donor Network is a non-profit organ procurement organization that coordinates organ and tissue donation and provides public education on donation in Illinois and northwest Indiana. As one of 58 OPOs that make up the nation’s donation system, Gift of Hope works with 180 hospitals and serves 12 million people in their donation service area.

Read Post

Derdack

Read more about Saving lives by ensuring uptime of mission-critical IT at Gift of Hope

Alert fatigue, part 2: alert reduction with Sensu filters & token substitution

Sep 19, 2018 By Ben Abrams In Sensu

In my previous post, I talked about the real costs of alert fatigue — the toll it can take on your engineers as well as your business — and some suggestions for rethinking alerting. In part 2 of this series, I’ll share some best practices for fine-tuning Sensu to help reduce alert fatigue.

Read Post

Sensu

Read more about Alert fatigue, part 2: alert reduction with Sensu filters & token substitution

Sentry + Microsoft Azure DevOps: Error-Tracking, Crash-Reporting, & More

Sep 18, 2018 By Erin Dame In Sentry

Sentry is updating our key integrations for Azure DevOps (formerly VSTS). With these tightly-woven integrations, developers (like you) can unlock enhanced release tracking, informative deploy emails, and assignee suggestions for new errors. Route alerts to the right person based on the Azure DevOps commit that caused the issue, cutting remediation time to five minutes.

Read Post

Sentry

Read more about Sentry + Microsoft Azure DevOps: Error-Tracking, Crash-Reporting, & More

ManageEngine Strengthens Endpoint Security with the Launch of Browser Security Plus at London User Conference

Sep 18, 2018 By ManageEngine In ManageEngine

LONDON - Sept. 18, 2018 - ManageEngine, the real-time IT management company, today announced its launch of Browser Security Plus, a browser management solution that helps organisations secure their corporate data in the cloud and protect their networks from web-based cyberattacks. Available immediately, Browser Security Plus provides organisations with a layer of management capabilities for browsers and their add-ons to maintain robust enterprise security.

Read Post

ManageEngine

Read more about ManageEngine Strengthens Endpoint Security with the Launch of Browser Security Plus at London User Conference

Connect Insights to Real-Time Action With PagerDuty Visibility

Sep 18, 2018 By Jeremy Bourque In PagerDuty

Have you ever gotten that dreaded text from your boss: “The site is down”? Maybe you were meeting with a customer. Or having dinner with your family. Maybe you were presenting at a conference. Doesn’t matter. Whatever else you were doing, now you’re doing emergency incident communication too. You check in with your team leads and confirm there is a problem. You let your boss know the response is under way.

Read Post

PagerDuty

Read more about Connect Insights to Real-Time Action With PagerDuty Visibility

The Honeycomb Beeline for Go v2 is...Go!

Sep 18, 2018 By Ben Hartshorne In Honeycomb

We’ve seen folks do amazing things using our Honeycomb Beelines–getting their apps instrumented in next-to-no time, expanding their observability, growing their understanding of what is happening in their code in production. Now, prepare to do more with improved tracing support!

Read Post

Honeycomb

Read more about The Honeycomb Beeline for Go v2 is...Go!

Operations | Monitoring | ITSM | DevOps | Cloud

Log analytics and dashboarding in Datadog

Managing Python Processes with PM2

Monitoring Social Signals to Reduce Alert Fatigue With SignalFx and PagerDuty

Massachusetts Natural Gas Explosions - A Lesson in The Importance of Alert Automation

Saving lives by ensuring uptime of mission-critical IT at Gift of Hope

Alert fatigue, part 2: alert reduction with Sensu filters & token substitution

Sentry + Microsoft Azure DevOps: Error-Tracking, Crash-Reporting, & More

ManageEngine Strengthens Endpoint Security with the Launch of Browser Security Plus at London User Conference

Connect Insights to Real-Time Action With PagerDuty Visibility

The Honeycomb Beeline for Go v2 is...Go!

Monthly Archive

Follow Us