Operations | Monitoring | ITSM | DevOps | Cloud

AWS And Azure Outages Will Recur - Here's How You Ensure Resilience

The cloud has long promised limitless scalability and near-perfect uptime. But if you tried to access your Microsoft 365 dashboard or recline your smart bed last week, and got nothing but a spinning icon, you weren’t alone. In the span of 10 days, both Amazon Web Services (AWS) and Microsoft’s Azure Cloud suffered widespread outages that rippled across industries.

Uptrends x OpenTelemetry: Stream browser-level synthetic data into your observability stack

Dashboards and alerts can tell you something’s wrong, but they don’t immediately tell you why. A red indicator or synthetic test failure prompts detective work. You flip between dashboards, timestamps, and logs, trying to line up what the check saw with what the system did. Now imagine your monitoring could explain itself by sending traces directly into your OpenTelemetry (OTel) backend.

When Bots Grow Brains: RPA and Agentic AI For the Win

For a long time, robotic process automation (RPA) was the fastest way to scale repetitive digital work. Bots copied, clicked, and executed rule-based tasks faster than any human. They reduced error rates and delivered early wins for efficiency. Sounds just fine, right? Prepare for a Matrix moment, because the truth is that IT teams built RPA only for predictability. It could follow instructions, but it couldn’t adapt when something unexpected happened.

Maintaining Software Excellence in the Age of AI Coding Assistance

In this preview of his AWS re:Invent session, Cortex CTO & Co-Founder Ganesh Datta breaks down how AI coding assistants are transforming software development, and what high-performing teams are doing to keep speed and reliability in balance. You’ll learn: If you care about AI, engineering velocity, or building sustainable systems, this is a must-watch. Full Session: December 3 at 2:30 PM Learn more about Cortex: go.cortex.io/reinvent.

Strengthening Open Source Facter: Ensuring Compatibility and Essential Maintenance

Over the course of 2025, the Puppet Core team has been committed to developing secure, hardened Puppet code that our customers can rely on. As part of that shift, many Puppet platform components, including Facter, were brought under the Puppet Core model and were moved into private repositories.

Distributed Tracing for Microservices: 10 Essential Best Practices for 2026

Distributed tracing tracks how a single request moves across multiple microservices, helping teams see the entire execution path end to end. In modern architectures where dozens of services interact, it becomes difficult to understand where latency starts, why bottlenecks appear, and which component breaks under load. Traditional monitoring only shows isolated metrics. Distributed tracing connects those dots.