%term

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

The differences between reactive vs proactive incident response

Jul 10, 2023 By xMatters In xMatters

Most commonly, businesses take a reactive approach to incident management. After all, the concept of incident response seems inherently reactive. However, it is possible—and often necessary—to take more proactive measures. This entails identifying potential problems and taking steps to remediate them before they become incidents.

Read Post

xMatters

Read more about The differences between reactive vs proactive incident response

Effective incident escalations

Jul 10, 2023 By Chris Evans In Incident.io

In the ever-evolving digital landscape, every organization must confront its fair share of incidents. Regardless of the sector or size, one common thread weaves through them all: the need for effective incident management. A crucial part of this management is incident escalation, a topic on which we've had many discussions with various companies.

Read Post

Incident.io

Read more about Effective incident escalations

Better security for your app's secrets

Jul 10, 2023 By Lawrence Jones In Incident.io

The incident-io/core application uses a mixture of environment variables, config files and secrets stored in Google Secret Manager to configure the app. This is a reference guide to all the parts that make up this flow.

Read Post

Incident.io

Read more about Better security for your app's secrets

5 Takeaways from Gartner's Latest AIOps Analysis

Jul 6, 2023 By Moogsoft Team In Moogsoft

If you’re still unpacking the latest terminology from Gartner’s 2023 AIOps market update, you aren’t alone. Subject matter experts from Moogsoft recently joined thought leaders from TIAA and Windward Consulting for a debrief on the panel interview Accelerating Your AIOps Journey Webinar. Almost half of technology leaders looking to improve productivity and fuel greater collaboration are struggling to explain AIOps use cases, benefits, and value to other business leaders.

Read Post

Moogsoft

Read more about 5 Takeaways from Gartner's Latest AIOps Analysis

Incident severity: why you need it and how to ensure it's set

Jul 5, 2023 By Mike Lacsamana In FireHydrant

Defined severity levels quickly get responders and stakeholders on the same page on the impact of the incident, and they set expectations for the level of response effort — both of which help you fix the problem faster. But sometimes, for whatever reason, a severity level just doesn’t get set. Maybe there’s confusion around what severity level to use. Or maybe you have a low barrier to declaration and your responders just need a little nudge.

Read Post

FireHydrant

Read more about Incident severity: why you need it and how to ensure it's set

Improve MTBF and MTTR for your Application Platforms by using MESH Observability

Jul 3, 2023 By Navdeep Sidhu In meshIQ

When businesses look at how best to understand the performance levels of their platforms, some of the best incident management metrics to look at are Mean Time Between Failures (MTBF) and Mean Time ToResolution(MTTR). These two measurements will give an excellent indication of the health and speed of the system, as well as the ability of the platform to take care of any anomalies that have been detected or to flag them up for others to take action to resolve them.

Read Post

meshIQ

Read more about Improve MTBF and MTTR for your Application Platforms by using MESH Observability

Tips on making on-call manageable

Jul 3, 2023 By Ritika Bramhe In OnPage

On-call responsibilities are a crucial part of many industries, ensuring that businesses can provide round-the-clock support to their customers. However, the demanding nature of on-call duty can lead to burnout and reduced productivity if not managed effectively. In this article, we will explore various strategies and tips to make on-call more manageable, enabling professionals to maintain a healthy work-life balance and deliver exceptional service.

Read Post

OnPage

Read more about Tips on making on-call manageable

Carrier reduced MTTR and gained visibility across multiple IT environments

Jul 3, 2023 By LogicMonitor In LogicMonitor

Hear Rich Johnston, Director of Hosting Platforms, describe Carrier’s observability goals to create a unified view of their IT environment for predictive monitoring. Rich describes Carrier’s desire to see issues before customer complaints, and how LogicMonitor implemented extensive visibility on a single platform, including multiple cloud platforms, networking, compute, storage, and more. LogicMonitor helped Carrier quickly and easily deploy dashboards to see how their technology performed, while reducing root cause analysis and shortening resolution time.

View Video

LogicMonitor

Read more about Carrier reduced MTTR and gained visibility across multiple IT environments

Docker Compose Logs: Guide & Best Practices

Jul 2, 2023 By Squadcast Community In Squadcast

Docker Compose is a tool for defining and running multi-container Docker applications. It allows developers to streamline the process of configuring, building, and running multiple containers as a single unit with a docker-compose.yml. This configuration file specifies the services, networks, and volumes required for an application, and their relationships and dependencies. The docker-compose logs command displays the logs of all services defined in the docker-compose.yml file.

Read Post

Squadcast

Read more about Docker Compose Logs: Guide & Best Practices

How Schneider Electric reduced MTTI and alert noise by consolidating monitoring tools

Jul 1, 2023 By LogicMonitor In LogicMonitor

Hear Observability and Monitoring Strategist, Arun Mandayam, describe challenges that Schneider Electric faced around data interpretation and difficulties when using multiple monitoring tools. Arun describes how LogicMonitor helped consolidate monitoring tools, enabled them to onboard new cloud accounts, network devices, and on-prem systems on a unified platform, and helped significantly reduce MTTI and alert noise.

View Video