Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Kubernetes Health Check Using Probes

Kubernetes is an open source container orchestration platform that significantly simplifies an application's creation and management. Distributed systems like Kubernetes can be hard to manage, as they involve many moving parts and all of them must work for the system to function. Even if a small part breaks, it needs to be detected, routed and fixed. These actions also need to be automated. Kubernetes allows us to do that with the help of readiness and liveness probes.

The BigPanda ScaleUp Journey: Human/AI Collaboration, Predictive Accuracy, and Scale Power in AIOps

At the beginning of the COVID-19 pandemic, we anticipated a slow-down in IT-related spending. In reality, the opposite occurred. Companies massively expanded their digital offerings using the same IT staff they’d had pre-pandemic, even as the teams lost access to many of their existing tools while working from home. This acceleration put immense pressure on IT teams everywhere, resulting in messy incident management, outages, and a huge shortage of talent.

xMatters Out Run Release Recap: Service-centric Automations, Callable Flows, and More!

What’s one of the fundamental principles of DevOps? Automation. There are many ways to leverage automation to facilitate DevOps practices for enabling consistency, reliability, and efficiency within the organization. That’s why we’re taking serious strides to ensure that xMatters can allow full automation and coordination of the many tools we use to make incident management easier and more efficient for front-line responders.

Creating Subscription Forms - xMatters Support

In xMatters, you can use subscriptions to ensure that you are always informed about certain events. These subscriptions will send you notifications whenever an event occurs that matches your pre-determined criteria, even if you are not directly targeted to receive a notification for that event. Follow us on social!

Traditional vs Modern Incident Response

An incident is an event (network outage, system failure, data breach, etc.) that can lead to loss of, or disruption to, an organization's operations, services or functions. Incident Response is an organization’s effort to detect, analyze and correct the hazards caused due to an incident. In the most common cases, when an incident response is mentioned, it usually relates to security incidents. Sometimes incident response and incident management are more or less used interchangeably.

Finding a pricing model that's just right

Getting your pricing right is critical to the success of any SaaS company, but finding a model that works can be tough. Price too high, you won’t close enough deals - your business will fail. Price too low, your business model will be unsustainable - your business will fail. To add to the complication, when you’re a new startup your goals are evolving.

Putting the "Action" in Actionable Intelligence

AIOps combines machine learning and people to deliver technical outcomes in IT operations. The promise of this capability continues to drive new contenders to the market. AIOps has become a core messaging component for all the major event management players. Many have just rebranded their products to specifically highlight AIOps features. Emerging event management players have arrived and tried to also claim the AIOps space.

Can Endpoint Protection Keep up With Modern Threats?

Endpoint protection is a security approach that focuses on monitoring and securing endpoints, such as desktops, mobile devices, laptops, and tablets. It involves deploying security solutions on endpoints to monitor and protect these devices against cyber threats. The goal is to establish protection regardless of the endpoint’s location, inside or outside the network.