Operations | Monitoring | ITSM | DevOps | Cloud

The strategic art of build vs. buy in software delivery ft. Tara Hernandez of MongoDB

Rob Zuber sits down with Tara Hernandez, VP of Developer Productivity at MongoDB and former Netscape engineer who helped create early continuous integration systems, to explore strategic frameworks for build vs. buy decisions in modern software delivery.

Librato on Heroku is Going Away and Hosted Graphite Is the Better Next Step

Librato (a SolarWinds product) is being sunsetted summer of 2025, and that directly affects Heroku teams who’ve relied on the Librato add-on for “good enough” visibility into dynos, routers, and Postgres. If you’re in that group, you’ll need a replacement monitoring add-on that keeps you covered on Heroku and lets you grow beyond it without re-architecting how you ship metrics.

What Is a Rapid Response Team (RRT) in Hospitals? Why Do They Matter?

Imagine you’re working on a hospital floor when suddenly a patient’s condition starts to deteriorate. What happens next can mean the difference between life and death. That’s where a Rapid Response Team (RRT) steps in: a specially trained group of healthcare professionals who respond quickly to patients showing early signs of crisis to prevent emergencies like cardiac arrest or respiratory failure. But how common are these teams? What do they really do day-to-day?

Abusing the Target Revision Field for Argo CD Promotions

In our big guide on how to use ApplicationSets for Argo CD applications, we explained the best practice of having a 3-level structure for all manifests with a clear distinction between Argo CD Application files and Kubernetes resource files. In that article, we also outlined several anti-patterns that we have seen in the wild, meaning questionable practices that might seem ok at first glance but are problematic in the long run both for developers and for Argo CD operators.

Securing the Invisible: Why Ambient AI Needs Next-Gen Security

If, like me, you’re continuously striving to keep pace with the ever-evolving world of artificial intelligence, you’re probably hearing a lot about how Ambient AI is poised to dominate discussions and developments throughout the second half of 2025. Ambient AI refers to artificial intelligence systems that operate unobtrusively in the background of our daily environments, constantly sensing, analyzing, and responding to various inputs without explicit human interaction.

Bridging the Gap: 3 Practical Strategies to Align Security and Operations in DevOps

The gap between security operations and IT operations poses significant risk. It’s increasingly clear that DevOps leaders, IT managers, and enterprise teams face an uphill battle to manage growing threat complexity, endless patches, and compliance requirements while operating in silos. Bridging this gap is essential to effectively manage risks and enhance operational efficiency.

PagerDuty Named a Leader and Outperformer in the 2025 GigaOm Radar for AIOps

There’s no shortage of hype around AI in operations, but recognition from a trusted source like GigaOm cuts through the noise. We are excited to share that PagerDuty earned a top spot as a Leader and Outperformer in the 2025 report. It’s recognition that reflects the progress we’ve made in delivering an AI-powered platform that actually helps teams move faster, reduce costs, and operate with confidence in complex environments.

Applying AI/ML in Observability - Tech Talk #7

Ready to master anomaly detection? Join us for Part 2 of our "Applying AI/ML in Observability" series, where we do a deep dive into vmanomaly! In this live stream, Mathis and Marc will be joined by a very special guest: Fred Navruzov, the lead developer and mastermind behind VictoriaMetrics' vmanomaly. If you want to move beyond the basics and unlock the full potential of AI-driven observability, this is a session you can't afford to miss.

Automated Seer in Under 2 Minutes

What if you had 5 errors, and instead of coming back to 5 issues in your feed, you got 5 pull requests fixing them? Seer is Sentry's new AI Debugging agent. it's able to stitch together all the context from your logs, stack traces, distributed tracing, codebase, and issues and figure out what broke, where, and how to fix it. Seer automation lets you automate that flow - and end up with a nice PR waiting for you to merge if it looks good. Check it out!