Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Automating Work in Real Time Through the PagerDuty Operations Cloud

Earlier this fall, we announced a significant evolution in the IT process automation portfolio at PagerDuty—the general availability of PagerDuty Rundeck Actions and early access for Rundeck Cloud. These new offerings reflect our vision to enable companies to take real-time actions by democratizing access to automation. In other words, to quickly and safely delegate automated IT processes to the IT users (and APIs) that need them to get work done.

Sponsored Post

The Principles of DevSecOps

As a Solution Architect here at xMatters, an Everbridge Company, and through my 30-year career in the IT industry, I've seen many frameworks offering bold new ideas. CMMI, ITIL, Prince 2, Agile, Scrum, and most recently, DevOps. These frameworks come and go, offering huge improvements in the way we deliver and manage our IT capabilities, but never lasting long enough to act on those promises. That's not to say they haven't made a marked difference in the IT space, or that they haven't been hugely impactful for organizations around the globe. They become launching off points for a new framework, and now there's a new term that's appeared, DevSecOps.

What Does ROI Really Mean?

ROI might be one of the most popular business acronyms in recent memory, and business to business, the definition remains the same: return on investment. No matter the industry, leaders are concerned with ROI and ensuring that every dollar spent is used in the best interest of the organization. But in practice, what does ROI really mean? Let’s discuss!

Shhh... we have Private Incidents

We’re excited to announce that private incidents are now available on FireHydrant. For the first time, incidents can have visibility limited to only permissioned users are able to see. This is a great solution for security and compliance teams who need to collaborate with their engineering counterparts to resolve incidents. The nature of these incidents that these teams work on dramatically differs from operational incidents.

Uncovering the Importance of Mean Time Between Failures

In the IT world, application service providers (ASPs) build customer trust by ensuring the continuous, uninterrupted availability of their services and software. Service availability allows customers to operate normally and generate revenue without being directly impacted by their providers’ system failures. Though providers work to ensure system uptime, they are often challenged by unexpected technical issues that impact customer-facing systems.