Operations | Monitoring | ITSM | DevOps | Cloud

Why GovRAMP-authorized observability matters for state, local, and education IT teams

Building on our FedRAMP Moderate authorization and our “In Process” status for FedRAMP High, Datadog for Government is now "In Process" for GovRAMP High Authorization, giving agencies a unified observability platform that meets the toughest public-sector security bars.

PHP Monitoring Best Practices for Developers, DevOps, and SREs

In 2025, PHP still powers over 75% of the web from ecommerce platforms like Magento to CMSs like WordPress and Laravel-powered web apps. As user expectations rise and digital experiences become mission-critical, real-time PHP monitoring has moved from a luxury to a necessity. According to Statista, PHP continues to rank in the top 10 most-used programming languages globally. Despite the popularity of modern stacks, legacy and modern PHP coexist in thousands of production environments.

Enhanced monitoring of Amazon EKS with Elastic add-on capabilities

Easily enable Elastic add-on within the Amazon EKS Console for streamlined monitoring and quick data onboarding. Amazon Elastic Kubernetes Service (EKS) makes running Kubernetes on AWS simple and scalable. But as your workloads grow, so does the need for robust monitoring and observability. Enter Elastic Agent, a powerful, unified way to collect logs, metrics, and security data from your EKS clusters, all managed through Elastic Fleet.

Automate server restarts in SCOM with the Opslogix Autonomous Maintenance Mode Management Pack

Automate server restarts in SCOM with the Opslogix Autonomous Maintenance Mode Management Pack Server restarts are routine, but in SCOM they often result in unwanted alerts if not handled properly. The Opslogix Autonomous Maintenance Mode Management Pack addresses this by automatically managing maintenance mode during restarts, minimizing false alerts and improving operational efficiency.

Status Page Aggregator: Best Practices and Use Cases

A status page aggregator is a powerful tool that brings together the status updates of multiple cloud services, SaaS providers, and third-party services into a single, unified view. Whether you’re tracking the health of critical dependencies like AWS, Cloudflare, or niche SaaS tools your teams rely on, a status page aggregator simplifies monitoring and helps you stay ahead of outages.

When Will We See the First $1 Billion Company Run by a Single Individual?

It’s only a matter of time. OpenAI CEO Sam Altman said in 2024 that he thought this could be achieved by the end of 2026. Personally, I feel this is a little optimistic; however, based on the evidence I’ve seen, it won’t be long after that. Consider Telegram: a global messaging giant with just 30 employees, already achieving a remarkable $1 billion in revenue. Or Midjourney, revolutionizing creative industries with only 40 employees and generating an impressive $500 million.

Can Claude Code Observe Its Own Code?

One of the great things about OpenTelemetry is that it’s a standard, and standards tend to proliferate. I was excited to see Claude Code add OpenTelemetry metric and log support in a recent release. What was really interesting—beyond the ability to capture usage data from Claude Code—is that you can also get pretty detailed logs about what you’re doing with Claude Code.

The Dos and Don'ts of Successful Software Rollouts

Launching new enterprise software is one of the most strategic—but risk-laden—internal initiatives any organization can undertake. Done right, it accelerates transformation, streamlines operations, and boosts employee productivity. Done wrong, it can paralyze teams, spike IT tickets, and erode employee trust in the tools they’re given and the teams that support them.

Elephant Flows: The Hidden Heavyweights of AI Data Center Networks

Elephant flows are no longer rare. They’re foundational to AI workloads. In today’s GPU-heavy data centers, long-lived, high-volume flows can distort ECMP, overflow buffers, and rack up unexpected cloud bills. Kentik helps you see and tame these elephants with real-time flow analytics, automated alerting, and predictive capacity planning.

The Hidden Cost of Downtime: Why IT Leaders Are Prioritizing Resilient Operations

No business sets out to tolerate downtime. And yet, across industries, unexpected service disruptions continue to drain revenue, erode customer trust, and expose operational fragility. For CIOs and IT leaders, the real concern isn’t if systems will break, it’s whether your team can outpace the fallout. Because in a crisis, speed isn’t just an advantage it’s survival.