Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Why SREs Need to Embrace Chaos Engineering

Reliability and chaos might seem like opposite ideas. But, as Netflix learned in 2010, introducing a bit of chaos—and carefully measuring the results of that chaos—can be a great recipe for reliability. Although most software is created in a tightly controlled environment and carefully tested before release, the production environment is harsher and much less controlled.

The Improved xMatters Group Experience: Product Feature Updates

We’re constantly looking for new ways to help DevOps, SREs, and operations teams automate operations workflows, secure infrastructure and applications, and rapidly deliver their products at scale. This commitment to our customers — and yours! — led us to redesign the way you experience groups in xMatters.

What It Means to Be an Incident Commander

Leadership is essential in an organization. Establishing a leadership hierarchy helps teams avoid getting confused about who to turn to with questions and concerns, allowing them to focus their efforts where needed. High-quality leadership is vital to success but becomes even more important when the pressure to resolve an issue with minimal downtime is turned up.

Sponsored Post

Best Practices for Communicating with Customers During an Outage

Incidents are unavoidable when running a business. When an incident does inevitably occur, communication is critical while your teams are working to minimize the impact and expedite a solution. For technical resolvers, the first steps during an incident are to look for any leads that point to the source of the issue. Customer service and communications teams, however, must prioritize establishing effective communication with impacted users. Both teams have the right frame of mind, they need to be aligned. This becomes more complicated when such an incident is an outage.

Introducing xMatters New Integration with Everbridge Signal

When Russia invaded Ukraine on February 24, 2022, it sent ripples through many markets. Ukrainian car factories which supplied Europe were interrupted, oil and gas supply from Russia was throttled, and the supplies of steel, sunflowers, corn, and wheat were affected. Prices of sugar and petroleum surged, a threat of long-lasting high inflation emerged, and social unrest began to foment, with cyber-attacks coming both out of and going into Russia.

How To Build an Escalation Policy for Effective Incident Management

Regardless of your organization’s size, industry, or security measures, you will inevitably face IT incidents. But what do you do if an incident affects a critical system and your on-call responders can’t resolve it? Does your team have a set of clearly outlined next steps they should take to handle the issue? Answering these questions can be complicated, even more so for large organizations that rely on cloud-based services to fuel their IT environment.

Sponsored Post

Major Incident Process Is at the Heart of Effectiveness

Read the new white paper on major incident management. Businesses need to be prepared for minor and major incidents to happen to their technologies, be it an integration disconnecting or an entire system being taken offline. Preparation ensure that not only can losses be minimized, but they can protect themselves and potentially their clients from risky impacts.

When Does a Problem Become an Incident?

Incident management is a practice that seeks to resolve business-impacting events in the most efficient manner possible. But not every problem that arises requires an incident response, and it’s crucial that teams know the difference between a problem and an incident. Responding to problems may be part of daily routines, or small ad hoc projects that don’t require more than one resource or a significant time commitment.

Sponsored Post

Your Goals Could Be Holding Your DevOps Teams Back

In the era of Agile, organizations are increasingly moving their IT service management teams toward a DevOps world. There are significant challenges to transforming ITSM to DevOps, but one of the most significant is goal setting. In today's face-paced business environment, it's important to establish the parameters for measuring success and determine which objectives teams need to meet to accomplish business goals.