Operations | Monitoring | ITSM | DevOps | Cloud

Checkly and the Agentic Software Layer

November 24th, the Opus 4.5 release turned around the entire tech industry. This was the moment when agents became capable. Capable enough to write solid staff-level code. Capable enough to reason about alerts, investigate root causes much faster than most engineers, and set up the reliability layer faster. For me, this feels like an iPhone moment on steroids; the adoption of AI is accelerating much faster than any adoption curve I’ve seen over the past few decades.

The Future of Kafka and Steaming

Join Jeff Mery and Josep Prat as they discuss the future of Kafka and Streaming. In this deep dive, we break down the architectural shifts and hidden "taxes" currently hitting the data streaming ecosystem—and how to engineer your way out of them. In this video, you’ll see: The "Streaming Tax" Breakdown: A transparent look at how 3x replication, inter-AZ egress, and eCKU markups are inflating your TCO by up to 500%.

In a world built by code, design lives between the lines

Design is the art of solving problems; open source makes that visible. In this video, Open Source Designer Eriol Fox dives into the pragmatic world of design and usability within the FOSS ecosystem. We discuss how product designers and user researchers are driving long-term software sustainability through accessibility and smarter design.

How to Reduce MTTR with AI

The quick download: AI reduces MTTR by helping teams detect issues sooner, pinpoint root causes faster, and resolve incidents with less manual effort. IT downtime costs organizations an average of $9,000 per minute. AI-powered observability can cut incident resolution time by up to 70%. Here’s what it takes to get there. Every minute an incident goes unresolved, the meter is running.

Automate Your Monitoring and Incident Handling: How Agents Dominate the Checkly CLI

50% of Checkly's CLI users are already coding agents. We predict that agents will become dominant by the end of 2026. This video demonstrates an agentic workflow where an alert reports a broken Shopify store login flow, and Claude Code, using the installed Checkly Skill and the Checkly CLI, pulls monitoring results, identifies a Playwright test failure, investigates the codebase, finds and fixes a bug, and then updates a Checkly status page by creating an incident.

Introducing Bits AI Dev Agent for Code Security

As organizations adopt AI-assisted development and increase their release velocity, they are not only generating more code but also finding more vulnerabilities from static analysis. The traditional remediation workflow of manually triaging issues, creating tickets, and opening individual pull requests (PRs) cannot keep pace. Fixing tens of thousands of vulnerabilities one by one is not a viable remediation strategy.

Datadog achieves ISO 42001 certification for responsible AI

As AI-powered products and services become central to how organizations operate, the need for responsible AI governance has never been greater. Customers, partners, and regulators are seeking assurance that AI systems are built, managed, and monitored responsibly and effectively. Datadog is committed to the responsible use of AI, both in how we build our products and in how we help customers observe their AI workloads.

Monitor Nutanix clusters, hosts, and VMs with Datadog

Nutanix is a hyperconverged infrastructure (HCI) platform that combines compute, storage, and virtualization into a single software-defined stack. By collapsing traditional infrastructure tiers into one platform, Nutanix simplifies provisioning and operations for virtualized workloads. Clusters are managed through Prism Central, which provides visibility into health, performance, capacity, and operational activity across hosts and VMs.

Announcing the Next Chapter for Bitbucket Pipelines Runners

In December 2025, we announced our intention to introduce pricing for self-hosted runners so we could provide stronger support and keep investing in new features and ongoing improvements. You’ve told us that having a free option is important. As a result, we’re introducing a new operating model that lets you continue using self‑hosted runners for free with the option to upgrade to a paid premium runners tier as your needs grow.

Automating Employee Offboarding: Simplicity Just One Click Away

Employee offboarding looks simple from the outside. Someone gives notice, HR processes the paperwork, and IT handles the rest. In practice, "the rest" is where things get complicated. A single departure can touch dozens of systems, and the handoffs between them (between HR and IT, between tools, between teams) are exactly where access stays on longer than it should, steps get missed, and audit findings show up months later.