Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

The Coming Decentralization of Cloud

This quote resonates deeply when considering the pendulum swings in technology. We’ve seen boom-and-bust cycles with various trends, from blockchain to AI. Some trends have more staying power than others, but the pendulum swings one way, only to swing back—sometimes with a vengeance, correcting the overreach of the previous swing. One of the most significant pendulum swings of the last few decades was the shift to cloud computing.

Stop drowning in alerts: 12 DevOps alert management strategies that actually work

System outages cost businesses an average of $5,600 per minute, according to Gartner. That's over $300,000 per hour of downtime. But beyond the financial impact, downtime destroys customer trust, damages your reputation, and creates a backlog of urgent work for your already busy technical teams. The key to minimizing downtime? A robust DevOps alert management system that notifies you of issues before they become full-blown disasters.

The Critical Role of Observability in Healthcare IT

Healthcare organizations are increasingly leading the charge in technology adoption, rapidly deploying advanced applications and digital tools to improve patient outcomes and operational efficiency. However, this acceleration is placing unprecedented pressure on existing IT infrastructure. Teams are being asked to support next-generation workloads, such as AI-powered diagnostics and real-time data platforms, on legacy systems, often without the benefit of increased budget or headcount.

Opsgenie Is Sunsetting: What to Look for in an Alternative

Atlassian is retiring Opsgenie, and if you're one of the teams relying on it to manage on-call and incidents, you're facing a tough question: Do you make the forced migration to Jira Service Management or Compass, scramble for a lookalike tool — or use this moment to upgrade your entire approach to incident response? If you’re facing that decision, we get it. Changing tools midstream isn’t ideal (to say the least). But it’s also a rare opportunity to take a meaningful step forward.

Leveraging an IDP for Navigating Staff Changes: Onboarding and Layoffs

Change is constant in engineering organizations. Whether you’re growing quickly and onboarding dozens of engineers—or navigating the difficult process of layoffs—your systems, services, and institutional knowledge don’t pause. That’s where an Internal Developer Portal (IDP) becomes indispensable.

Comparing ELK, Grafana, and Prometheus for Observability

Monitoring and observability are cornerstones of modern infrastructure management. Three popular solutions that often come up in this space are the ELK Stack, Grafana, and Prometheus. This comparison breaks down the key differences, use cases, and integration capabilities to help you determine which tool or combination better suits your operational needs.

Metrics That Matter: Measuring Developer Productivity in the AI Era

In this episode, Ryan McDonald is joined by Mark Quigley, Head of Platform Engineering at Ninety.io, for a conversation that cuts through the noise around developer productivity metrics and AI. Mark dives deep into how teams can measure what matters—without falling into the trap of turning every measure into a target. He shares how tools like Developer NPS, DORA metrics, and balanced scorecards can help teams optimize for both output and well-being—but only when framed with the right intent.