Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

See Your PagerDuty Account Clearly in 2020

What better way to start off the new year than reflecting on the past 12 months and conducting a retrospective of your systems, processes, and culture at your organization? For instance, what did your overall incident response look like in 2019? Was it a smooth and streamlined process or did chaos reign during incident conference calls? But when burning sage and holding magic crystals don’t refresh your office vibes or your incident response process, PagerDuty University has got you covered.

Making Observability Actionable at Scale - Sisir Koppaka | DBS DevConnect 2019

Many organisations already possess a vast amount of existing data about production systems. As customer expectations evolve, organisations are often challenged to find more proactive ways of dealing with traditionally reactive incident response activity. In this talk, we discuss approaches to unlock value from this data by making it truly actionable.

Docker Commands Cheat Sheet

In this article I will highlight the 6 key docker commands I use on a daily basis while using Docker in the real world. By no means is this an extensive list of commands, I kept it short on purpose so you could use it as a quick reference guide. I’ve also omitted the topic of building images and the commands that are associated with that.

OnPage Celebrates Successful Year

OnPage welcomes the new year with open arms. Though the team is excited for the new decade, we’d like to look back at our organizational growth and success in 2019. The previous year consisted of several Gartner mentions and the release of innovative, new OnPage capabilities. This post discusses and provides detail into these notable accomplishments.

The Role of Live Event Notifications in Your Incident Response Plan

According to a study from the University of Maryland, a hacking attack occurs every 39 seconds. During a quick coffee break, your systems could be attacked up to a dozen times. Depending on how your alerts are set up, you might miss a dozen or more notifications. Missed or delayed alerts, and the resulting slow responses, provide attackers with more time. Every minute provides attackers another opportunity to damage your systems or steal your data.

Five IT Trends to Look Forward to in 2020

New Year’s Eve marks the transition into a new decade, beginning with personal resolutions and expectations for 2020. Much is the same in the IT industry, as support teams expect to adopt trending technologies to reduce their mean time to repair (MTTR) and improve incident resolution. This post will provide an in-depth look into five trends, discussing how growing technologies streamline IT workflows in the new year.

Squadcast's Year in Review, 2019

We’re heading into 2020 with a platform full of features and a heart full of happiness! It’s the end of a decade and this year has been nothing short of great for us! 2019 gave us an accelerated product growth and our team grew by 2x in size. We kick-started this year with a complete UI refresh and a whole bunch of new features. We also sponsored some of the major tech events and conducted our first ever community driven meetup!

Gartner Publishes New Report: Six Smart Steps to ITSM Tools

Information technology service management (ITSM) tools streamline and regulate how IT services are delivered. ITSM tools include help-desk (e.g., ConnectWise Manage and ServiceNow) and monitoring software, providing smart ticketing capabilities and live system statuses, respectively. Unfortunately, Gartner Research reports that organizations tend to overbuy ITSM tools beyond their needs. For instance, organizations purchase unnecessary capabilities and features when adopting new ITSM technology.

SREcon19 AsiaPacific -"Transparency in Incident response" Lightning Talk

Squadcast is an incident management tool that’s purpose-built for SRE. Create a blameless culture by reducing the need for physical war rooms, centralize SLO dashboards, unify internal and external SLIs and automate incident resolution with Squadcast Actions and create a knowledge base to effectively handle incidents.