Operations | Monitoring | ITSM | DevOps | Cloud

Operations Management

The latest News and Information on IT Operations Management and related technologies.

APAC Retrospective: Learnings from a Year of Tech Outages: Reactive to Proactive

As we reach the end of our blog series on the occurrences in 2023 from the fourth installment of our blog series, Restore: Repair vs. Root Cause, the unavoidable truth is that incidents are a universal challenge for organisations, regardless of their scale or field. In the APAC region, there’s a noticeable increase in regulatory bodies imposing strict penalties on major companies for service failures.

APAC Retrospective: Learnings from a Year of Tech Outages, Restore: Repair vs Root Cause

As our exploration of 2023 continues from the third-part of our blog series, Dismantling Knowledge Silos, one undeniable fact persists: Incidents are an unavoidable reality for organisations, irrespective of their industry or size. Recent APAC trends show that regulatory bodies are cracking down harder on large corporations for poor service delivery, imposing harsh penalties as a result of the negative consequences.

APAC Retrospective: Learnings from a Year of Tech Outages - Dismantling Knowledge Silos

As our exploration through 2023 continues from the second blog segment, “Mobilise: From Signal to Action”, one undeniable fact persists: Incidents are an unavoidable reality for organisations, irrespective of their industry or size. In the APAC region, a surge in regulatory enforcement has been observed against large corporations failing to meet service standards, resulting in severe penalties.

Modernize your ITSM with the New PagerDuty Application for ServiceNow

We live in an always-on world, where things move fast and break often. Building stronger resilience is critical for operational efficiency and delivering great customer experiences. CIOs have heavily invested in ITSM solutions, but a centralized, queued approach is no longer meeting the needs of modern organizations when it comes to critical, customer-impacting issues.

Unlocking the Value of your Runbook Automation Value Metrics with Snowflake, Jupyter Notebooks, and Python

This blog was co-authored by Justyn Roberts, Senior Solutions Consultant, PagerDuty Automation has become an integral piece in business practices of the modern organization. Oftentimes when folks hear “automation,” they think of it as a means to remove the manual aspect of the work and speed up the process; however, what lacks the spotlight is the value and return automation can offer to an organization, a team, or even just one specific process.

APAC Retrospective, Part 2: Mobilise: From Signal to Action

Continuing our series on 2023 learnings from APAC, it’s increasingly evident that incidents in organisations are not a matter of ‘if’ but ‘when,’ regardless of their size or industry. Recently, the APAC region has been witnessing regulatory bodies taking stricter actions against major companies for subpar services, leading to substantial penalties.

Practitioners Share How They Remove the Fear of On-Call

Being on-call isn’t likely to be the most enjoyable aspect of a job. In fact, there might be a certain level of stress and fear around engineering teams about going on call: maybe the page will be missed, or maybe a page will come in at 2am and require troubleshooting a production issue for hours.

APAC Retrospective: Learnings from a Year of Tech Turbulence

Throughout 2023, one thing has become abundantly clear: regardless of an organization’s size or industry, incidents are inevitable. Recently across the APAC region, we’ve seen numerous regulatory bodies clamp down on large companies who are failing to provide acceptable service, with some handing out quite severe penalties. For many, the cost of an incident is no longer just lost revenue and customer trust, but financial penalties and business restrictions.

Build Operational Resilience with Generative AI and Automation

For modern enterprises aiming to innovate faster, gain efficiency, and mitigate the risk of failure, operational resilience has become a key competitive differentiator. But growing complexity, noisy systems, and siloed infrastructure have created fragility in today’s IT operations, making the task of building resilient operations increasingly challenging.