Operations | Monitoring | ITSM | DevOps | Cloud

Top 5 outages detected by StatusGator in June 2025

June 2025 saw several high-impact outages across popular cloud services — from infrastructure giants like Google Cloud to developer platforms like Supabase and Heroku. For IT teams, MSPs, and developers, even short service disruptions can have ripple effects across workflows and customer experience. At StatusGator, we continuously monitor thousands of services to detect issues in real time — often before they’re publicly acknowledged.

Pepperdata Helps Karpenter Work Better

Running Kubernetes on AWS? You're probably using Karpenter, the open-source autoscaler that dynamically provisions new instances as your EKS workloads grow. Karpenter launches rightsized instances in real time in response to pending pods, based on available instance types and the resources applications need. It also terminates underutilized nodes to reduce costs.

Faster incident response through distributed tracing: Inside Glovo's use of Traces Drilldown

It’s almost 1 p.m. on a Monday afternoon and you’re hungry. You pull up your meal delivery app and select your favorite restaurant and dish. Then you go to check out and nothing happens. Your frustration mounts as you get hungrier by the minute. But there’s frustration on the other side of that transaction as well—engineers are scrambling to figure out what’s wrong as orders drop and revenue losses rise.

Why GovRAMP-authorized observability matters for state, local, and education IT teams

Building on our FedRAMP Moderate authorization and our “In Process” status for FedRAMP High, Datadog for Government is now "In Process" for GovRAMP High Authorization, giving agencies a unified observability platform that meets the toughest public-sector security bars.

What Impacts GKE Pricing? A Guide To Kubernetes Spending

Google Cloud released Google Kubernetes Engine (GKE) as a commercial version of native Kubernetes (K8s). GKE promises a user-friendly, reliable, and cost-effective service. Yet calculating GKE costs can be daunting, including understanding what you’re paying for and maximizing your return on investment. In this GKE pricing guide, we’ll discuss how GKE pricing works, what it costs, and more.

PHP Monitoring Best Practices for Developers, DevOps, and SREs

In 2025, PHP still powers over 75% of the web from ecommerce platforms like Magento to CMSs like WordPress and Laravel-powered web apps. As user expectations rise and digital experiences become mission-critical, real-time PHP monitoring has moved from a luxury to a necessity. According to Statista, PHP continues to rank in the top 10 most-used programming languages globally. Despite the popularity of modern stacks, legacy and modern PHP coexist in thousands of production environments.

Perform Distributed Tracing for your MCP system with OpenTelemetry

2025 has truly been the year of Agentic AI, with MCP (Model Context Protocol) emerging as one of its flashy and most talked-about innovations. While many products have seamlessly integrated MCP servers into their systems, these servers are increasingly being labelled as black boxes, opaque components that handle critical tasks but offer little visibility into what’s happening under the hood. We prompt an agent, a tool gets invoked, and a response is generated. But what really happens in between? And when something breaks, how do we trace the failure and debug it effectively?