Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Kubernetes monitoring 101: Best practices to kickstart your journey

Use this guide to help you build a solid observability foundation without getting overwhelmed and get started with the best practices for a practical Kubernetes management. Starting your Kubernetes journey can feel like diving into the deep end; with hundreds of metrics, endless logs, and a growing list of tools, it's easy to lose focus. But here's the good news: you don't need to monitor everything from day one. Instead, start small.

From Logs to Insights: Accelerate Customer-Impact Analysis with Datadog Sheets

Datadog Sheets helps you move from log exploration to actionable insights quickly and with no code required. In this demo, see how to enrich logs with Salesforce data, build pivot tables, uncover customer impact trends, and build shareable reporting, all within Datadog.

Datadog Feature Flags, track Claude costs, migrate historical logs, and more | This Month in Datadog

See how you can reduce risk during feature rollouts in September’s This Month in Datadog. This episode, we spotlight Datadog Feature Flags, which combines advanced targeting with built-in observability, and guardrails to make rollouts safer and more controlled. Plus, we cover: This Month in Datadog brings you the latest updates on our newest product features, announcements, resources, and events.

Your infrastructure Is more distributed than you think.

An eCommerce platform, a banking app, even a simple user portal depends on a web of APIs, cloud tools, hosting services, and edge networks. Each one introduces another potential point of failure. And when those dependencies break? User experience suffers. Brand trust takes a hit. Millions in revenue are at risk. That’s why leading digital businesses, especially in eCommerce and banking, are expanding visibility beyond the application stack.

Resolve website transaction bottlenecks faster with Step Summary and Step Performance Reports

Ever wondered why some steps on your website feel slower than others? In this video, we’ll show you how to spot slow logins, delayed checkouts, and page load issues, and how to pinpoint their causes so you can fix them fast using the Step Summary and Step Performance reports. You’ll learn how to access these reports, what insights they provide, and how they help you quickly pinpoint performance bottlenecks to ensure a seamless user experience.

What Is RabbitMQ And How Do You Manage It With Kubernetes?

The world of Kubernetes and RabbitMQ evolves rapidly. Our popular 2022 post laid the groundwork for HA deployments; now, join us for the crucial 2025 update to ensure your architecture remains cutting-edge. As organizations continue their powerful shift from monolithic architecture (where all the code building the application exists as a single, monolithic entity) to microservices architecture.

How to Boost Revenue and Cut Network Spending with Kentik Traffic Costs

Network operators across the digital ecosystem are under pressure to cut costs while protecting revenue. This post explores three practical use cases where Kentik Traffic Costs helps turn traffic insight into commercial intelligence that helps teams negotiate smarter, protect margins, and boost profitability.

The Compliance Shortcut: Automation as the New Operating System for Resilience

For years, compliance has been synonymous with checklists, manual reporting, and time-consuming audits. That definition no longer holds. In our September 2025 webinar, Patrick Hubbard, Technical Marketing Director, led a conversation with JB Baker, Vice President of Product Engineering, and Marc Jensen, Channel Sales Engineer. Together, they showed how automation is transforming compliance into something far more strategic: the foundation of modern resilience.