Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

How we achieved pixel-perfect polish during our Status Pages launch

Jul 13, 2023 By Dimitra Zuccarelli In Incident.io

A few months ago, we released Status Pages. This project was quite different from anything we’ve approached before, given that: And our goals were a departure from one's we had set in the past: With this in mind, we worked closely with our designer throughout the process of building Status Pages. Here is how we approached it and a few lessons we learned along the way!

Read Post

Incident.io

Read more about How we achieved pixel-perfect polish during our Status Pages launch

Catalog vs. Thanos: Who came out on top?

Jul 13, 2023 By incident.io In Incident.io

Catalog is really, really powerful. To prove it, our latest product went up against the almighty Thanos and won decisively. Don’t believe us? Just look at how unscathed Catalog was once the dust settled: All jokes aside, we spent months building out what, we think, is one of the most capable products on the market today. Designed to be a map of everything that exists in your organization Catalog can meaningfully help you level up your incident response.

Read Post

Incident.io

Read more about Catalog vs. Thanos: Who came out on top?

Powering ConnectWise PSA With a New Alerting Workflow

Jul 13, 2023 By Ritika Bramhe In OnPage

In our previous blog from the ConnectWise series titled “OnPage-ConnectWise Incident Alert Management Workflows,” we discussed how customers are optimizing their investments in ConnectWise PSA. Now, we’re excited to present a new and powerful workflow specifically designed for after-hours that addresses the evolving needs of IT and Managed IT clients.

Read Post

OnPage

Read more about Powering ConnectWise PSA With a New Alerting Workflow

Understanding Chaos Engineering and its Benefits

Jul 12, 2023 By Anjali Udasi In Zenduty

In today's fast-paced technological landscape, ensuring the resilience and dependability of systems is crucial. This is where Chaos Engineering comes in, transforming how organizations approach system testing and fortification. Chaos Engineering helps find vulnerabilities that could go undetected under normal circumstances by purposefully introducing controlled interruptions and failures.

Read Post

Zenduty

Read more about Understanding Chaos Engineering and its Benefits

MTTR vs. MTBF vs. MTTF: Understanding Failure Metrics

Jul 12, 2023 By Pavithra Parthiban In Atatus

In the dynamic landscape of software and web applications, failures can have severe consequences, impacting user experience, business continuity, and overall performance. To proactively address these challenges, organizations rely on robust monitoring practices supported by failure metrics. Failure metrics, specifically tailored to software and web application monitoring, provide crucial insights into system health, reliability, and optimization opportunities.

Read Post

Atatus

Read more about MTTR vs. MTBF vs. MTTF: Understanding Failure Metrics

Correlation & Collaboration Product Enhancements

Jul 12, 2023 By Moogsoft Team In Moogsoft

Moogsoft continues to prioritize Correlation and Collaboration – Check out these product enhancements!

Read Post

Moogsoft

Read more about Correlation & Collaboration Product Enhancements

The Importance of Log Monitoring for Incident Response

Jul 12, 2023 By Ritika Bramhe In OnPage

In the face of growing security threats and incidents, businesses must prioritize their ability to detect, investigate, and respond effectively. Timely incident response is crucial for maintaining the security and integrity of systems and data. Among the essential tools in the incident response arsenal, log monitoring stands out as a critical component. By closely analyzing logs, organizations gain valuable insights into system events, user activities, and network traffic.

Read Post

OnPage

Read more about The Importance of Log Monitoring for Incident Response

26 DevOps Automation Tools that SaaS Loves in 2023 | Blameless

Jul 12, 2023 By Emily Arnott In Blameless

DevOps is a term combining “development” and “operations”. It involves the use of tools and processes to minimize the time and effort spent on software creation and maintenance. Many DevOps technologies use automation to reduce manual tasks. These DevOps automation tools sometimes use AI-based technology to remove human-based operations, or simpler scripting and processing. This increases speed in feedback and performance between development and operations departments.

Read Post

Blameless

Read more about 26 DevOps Automation Tools that SaaS Loves in 2023 | Blameless

SIGNL4 Onboarding: Alert Notifications & Handling

Jul 12, 2023 By SIGNL4 In SIGNL4

The SIGNL4 Onboarding series walks users through the process's of SIGNL4 from Signup to Alerts to Settings. Today's video focuses on receiving alerts and all of the options available inside of your SIGNL4 alerts. This video is packed with helpful tips to help you get the most out of your account.

View Video

SIGNL4

Read more about SIGNL4 Onboarding: Alert Notifications & Handling

The Unplanned Show, Episode 4: Sriram Subramanian on Responsible Generative AI

Jul 12, 2023 By PagerDuty In PagerDuty

Generative AI is a rapidly-evolving ecosystem with a lot of attention. In this episode, Dormain Drewitz asks Sriram Subramanian about the main challenges to responsibly implement generative AI, including content that’s harmful, inaccurate or violates privacy or security standards. Sriram discusses Microsoft’s 6 tenets to responsible generative AI, as well as the notion of shared responsibility between platform providers and foundational LLMs and the developers and data engineers building on top. Sriram also answers questions about where to get started safely with generative AI and shares his framework for identifying opportunities to add value.

View Video