Operations | Monitoring | ITSM | DevOps | Cloud

Operations Management

The latest News and Information on IT Operations Management and related technologies.

PD Summit21: The Netflix Reliability Story: A Brief History of How We Evolved Resilience to Failure

In Netflix engineering, we’re driven by ensuring Netflix is there when you need it to be. We strive to provide a service that people love and can enjoy anytime, anywhere. An important foundation for bringing our customers joy is a strong focus on reliability that ensures Netflix will be available when they need it. In this talk, I’ll tell the story of how we've grown our reliability practices over time to meet the changing demands of microservices and distributed computing.

PD Summit21: Adopting and Maturing to Service Ownership with PagerDuty and Rundeck

Among the common goals of today's engineering and operations teams is to adopt a culture of service ownership: ""You build it, you own it."" As with many ancillary objectives to driving DevOps across an organization, this is easier said than done. Sometimes this is in small part due to the technology stack/architecture of a given company. But more often than not, this is because teams lack the human-to-technology mechanisms that allow for a culture of service ownership.

PD Summit21: Migrating to L1 Support to PagerDuty

Learn how Maersk transitioned from operating with an L1 support team to using PagerDuty to drive an efficient operational support model. In this talk you will learn how implementing PagerDuty within the platform SRE team was part of a major re-org with the goal of driving a new operations model for a highly available (99.999%) platform that lead to outstanding results. At Maersk, we saw increased efficiencies and reduced TTR along with other significant advantages of using PagerDuty from both on-call and management perspectives.

PD Summit21: AWS and PagerDuty: Better Together -- A Digital Transformation Journey

PagerDuty’s platform for real-time operations helps teams manage a complex transition from siloed and centralized approaches to multiple, distributed teams supporting a hybrid cloud infrastructure. To make this journey successful, one thing is clear: your people, technology, and operational processes need to be aligned in real time. That’s why we’re continuing to invest in our partnership with AWS. The integrations we’re bringing to market have always been centered on unlocking AWS’s unprecedented scale and agility for our joint customers.

PD Summit21: Sentry: Alert with Precision and Context Using Sentry + PagerDuty

Phillip Jones (Product Manager, Sentry) and Michael Aravopoulos (Solutions Consultant, PagerDuty) discover and triage their way through production errors using the new PagerDuty + Sentry integration. In this session, we will implement the PagerDuty integration and investigate low & high urgency error alerts.

PD Summit21: Sumo Logic: Streamline Incident Management to Drive Application Modernization

As application modernization drives an increase in complexity, managing the signals they generate becomes increasingly important in order to manage alert fatigue, mantain reliability, and accelerate innovation. Sumo Logic provides a unique, two-way integration with PagerDuty that collects incident messages from PagerDuty and populates pre-configured dashboards to provide a complete view of their alerts by displaying top incidents, escalations, teams and urgency, as well as providing the capability for users to send notifications to PagerDuty when critical conditions in their applications or infrastructure are detected in Sumo Logic.

PD Summit21: MUX: Video Observability: Operational Alerting for Responding to Issues In Real-time

Streaming video accounts for the majority of internet traffic and your applications and infrastructure almost certainly include video. Mux Data allows you to easily monitor the real-time quality of experience delivered to your video viewers and integrating with PagerDuty you can automate a response and reduce the time to resolution when something goes wrong. We will cover the basics of video monitoring and how integrating with PagerDuty can ensure a great experience for viewers.

What's New: Updates to Event Intelligence, Integrations, and More!

If you thought that the product announcements from PagerDuty’s largest event of the year, PagerDuty Summit 2021, was all we had in store for you, think again! We’re excited to announce that the July Release comes with a new set of updates and enhancements to the PagerDuty platform! You can learn about our latest capabilities via the Q1 PagerDuty Pulse or read below for the highlights.

4 benefits of combining ITSM and ITOM

IT management can be costly and time-consuming without streamlined processes and systems to support your business goals. With the quickened pace of business requiring faster scale, leaders and decision-makers must find ways to adapt and optimize their processes. Combining IT Service Management (ITSM) and IT Operations Management (ITOM) can help you prioritize operations efficiency while delivering the best service to your employees.