Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Outages Happen. Now What?

Network outages happen more often than you think. We may not experience them directly or even know they're occurring at all. When outages affect household names like Facebook, Amazon, Microsoft, and others, however, we're sure to find out after the fact that there was an issue. Depending on the user's activities and the duration of the issue, stress and frustration levels can vary. When a marketer can’t get that ground-breaking advertisement up on Facebook, they can get antsy.

Webinar Recap: How Observability Impacts SRE, Development, and Security Teams

In today’s fast paced and constantly evolving digital landscape, observability has become a critical component of effective software development. Companies are relying more on and using machine and telemetry data to fix customer problems, refine software and applications, and enhance security. However, while more data has empowered teams with more insights, the value derived from that data isn’t keeping pace with this growth. So how can these teams derive more value from telemetry data?

GitHub Tried to Change the Checksum for Release Archives. You Should Start Hosting Your Own.

Yesterday, GitHub changed how the archives they provided are made. The result of this change surprised developers, triggering pipeline failures all over the world in most ecosystems. According to this GitHub post, this is a consequence of recent changes to Git itself, released almost six months ago and just deployed within GitHub now with unforeseen impact. This change has thankfully been retracted.

Datadog's commitment to OpenTelemetry and the open source community

The OpenTelemetry (OTel) project is an open source initiative with the goal of providing vendor-neutral standards and tools that enable users to collect telemetry from any source in their environment and send it to any backend. A core tenet of Datadog is to provide a single, unified platform for customers to easily collect and monitor all of their observability data, regardless of where it comes from.

Test Observability with Sumo Logic

The software industry has seen many evolutions. There is a new disruption in the market every five years or so. Software testing cannot remain isolated from all the latest trends and technologies. Testing strategies need to keep up with agile development, faster deployments and increasing customer demand for reliability and user-friendly interfacing. They need to be able to grow just as quickly and just as reliably as the business logic.

3 Key Questions to Ask Before Getting Started with Kubernetes

If you need to deploy a lot of microservices at once and manage them at scale, Kubernetes is hard to beat. But Kubernetes also brings additional complexity that you just might not need. You would be smart to ask yourself these three questions before getting started with Kubernetes.

What is ITSM? A Guide to IT Service Management

ITSM, or IT Service Management, is a holistic approach to managing and delivering technology-based services to an organization. It involves various aspects such as planning, designing, building, deploying, maintaining, and improving IT services to meet the needs of both internal and external stakeholders. The goal of ITSM is to consistently improve the quality and speed of service delivery and, ultimately, enhance customer satisfaction.

A Complete Guide to PagerDuty Alternatives

Exploring Options for Incident Management: A Comparison of PagerDuty and Other Tools Effective incident response is crucial for managing operational issues and resolving them in a complex technology environment. With the increasing complexity of systems built from numerous services, it is important for companies to have a way to keep these systems running smoothly.

What's New: January 2023

We’re excited to announce a new set of updates and enhancements to the PagerDuty Operations Cloud. Recent development and app updates from the product team include Incident Response, PagerDuty® Process Automation, the PagerDuty Mobile App, Integrations, as well as Community & Advocacy Events updates. We continue to help customers further automate to optimize cloud operations and reduce the amount of issues escalated to other teams. Get started now and learn about.