Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Are your SLOs realistic? How to analyze your risks like an SRE

Setting up Service Level Objectives (SLOs) is one of the foundational tasks of Site Reliability Engineering (SRE) practices, giving the SRE team a target against which to evaluate whether or not a service is running reliably enough. The inverse of your SLO is your error budget — how much unreliability you are willing to tolerate.

Sponsored Post

How to implement a Blameless Postmortem (part two)

This is Part 2 of a two-part series on Blameless Postmortems. The previous article went into why blameless postmortems are so effective; this second part goes into detail on how to build your own postmortem process and kick it into overdrive. Read Part 1 here. So you've read our first installment and recognized the value of the blameless postmortem for efficiency, culture, and output. Now you're ready to get off the blame train and kickstart a blameless postmortem process of your own. Where to begin?

May 2022 Update - Templates, scheduler enhancements, landline numbers, and more

Our May update brings Signl templates for manual alerting, improvements for duty scheduling and various enhancements in the web portal. Another new feature is the possibility to notify through calling landline numbers. All details can be found in this blog article.

SIEM: Introduction to SIEM and 4 Top SIEM Tools

Security Information and Event Management (SIEM) technology has become a fundamental part of identifying and guarding against cyber attacks. It is one of the essential technologies powering the modern security operations center (SOC). SIEM is an umbrella term that includes multiple technologies, including log management, security log aggregation, event management, event correlation, behavioral analytics, and security automation.

Derdack SIGNL4 Joins Microsoft Intelligent Security Association (MISA)

Today, Derdack SIGNL4 (www.signl4.com), provider of critical alerting and anywhere incident response for SecOps teams, announced it has joined the Microsoft Intelligent Security Association (MISA), an ecosystem of independent software vendors and managed security service providers that have integrated their solutions to better defend against a world of increasing threats.

Interlink Software: Enterprise AIOps Platform Mobile App

To protect the availability of the services your customers rely on, AIOps adoption is an imperative for large enterprises. Interlink Software’s AIOps platform applies machine learning to automate ITOps; reducing alert noise, performing event correlation, anomaly detection and root cause determination. As the world emerges from the Covid-19 pandemic, organizations are increasingly embracing the flexibility of home and hybrid working.

AlertOps And BMC Partner To Reduce Incident Resolution Times

Chicago, IL – April 27, 2022 – AlertOps, a major incident response orchestration platform, today announced a technology integration partnership with BMC Helix, a service management platform. This new relationship empowers Helix users with intelligent alerting, advanced escalation policies, schedule management, workflow automations for complex enterprise teams to rapidly remediate major incidents.

Monthly Moo | April 2022

We are well into 2022 and are busy bringing new exciting features to market. Our customers continue to provide input into our product roadmap and many new features are based on this collaborative effort. A big thank you to our valued customers. Throughout the year we will continue to drive innovation and allow our customers, of all sizes, to implement the most advanced AIOps solution in the shortest time possible.

Logbook: Team Discussion and Full Incident History

We've launched a feature that will help you fix errors and performance issues as a team! 🎉 With Logbook you get the full incident history. Read and leave team comments, see which notifications were sent at what time, and see team activity for changes in incident states. It's now easier than ever to see what the current state of an incident is.