Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Make Your Engineering Processes Resilient. Not Your Opinions About AI

Why strong reviews, accountability, and monitoring matter more in an AI-assisted world Artificial intelligence has become the latest fault line in software development. For some teams, it’s an obvious productivity multiplier. For others, it’s viewed with suspicion. A source of low-quality code, unreviewable pull requests, and latent production risk. One concern we hear frequently goes something like this: It’s an understandable fear; and also the wrong conclusion.

How to prevent outdated server inventory risks with efficient server monitoring

At any point in time, your IT teams are constantly working on performance monitoring, security patching, scaling, and related activities. Most teams overlook one critical pillar: a reliable and up-to-date server inventory. Why did we emphasize the phrase "reliable and up-to-date"? Because there are still teams using a spreadsheet that was last updated years ago when a server inventory report is requested. What follows when you do not maintain an updated server inventory repository is.

Building Operational Resilience for the Year Ahead with Teneo's Digital Employee Experience (DEX)

As we step into a new year, one truth stands firm in financial services: resilience isn’t optional – it’s expected. Markets fluctuate, regulations evolve, and technology accelerates. Amid this complexity, IT leaders carry the responsibility of ensuring that operations don’t just survive disruption, they thrive through it.

Implement dbt data quality checks with dbt-expectations

dbt is one of the most popular solutions for data transformations and modeling. Many commercial data pipelines rely on dozens, or even hundreds, of individual dbt jobs. Data engineers, data platform engineers, and analytics engineers who own these pipelines need to maintain a testing framework to prevent mistakes in data processing that can compromise analysis.

Grafana Tempo: vParquet5 is coming soon (January 2026 Community Call)

vParquet5 is coming soon, learn about all the improvements and how to use them Have questions? Please bring them! Can't comment in the chat? You may need to create a channel -- you can do this by clicking your photo in the top right corner. Grafana Cloud is the easiest way to get started with Grafana dashboards, metrics, logs, traces, and profiles. Our forever-free tier includes access to 10k metrics, 50GB logs, 50GB traces and more.

Easiest Way to Connect InfluxDB to a Grafana Data Source

InfluxDB is a widely used time-series database designed for storing and querying metrics, events, and telemetry data. It’s commonly used for infrastructure monitoring, application instrumentation, and IoT-style workloads where time-based data is central. In many environments, InfluxDB already exists as part of the monitoring or data collection pipeline, and the primary need is simply to visualize that data effectively.

Build custom apps in seconds with conversational AI in App Builder

Using a drag-and-drop interface, engineering teams can create apps that support troubleshooting, improve day-to-day operations, and offer self-service access without leaving Datadog. With the new conversational AI feature, teams can turn an idea into a working app in seconds. Watch the video to see how it works..

Fleet Management: Manage your telemetry collectors at scale

In this video, we introduce Fleet Management and how it helps teams control their telemetry estate as it scales. See how you can centrally manage collectors and agents, standardize configurations across environments, and roll out updates confidently, reducing operational effort and risk.

Trace-connected structured logging with LogTape and Sentry

As our applications grow from simple side projects into complex distributed systems with many users, the “old way” of console.log debugging isn’t going to hold up. To build truly observable systems, we have to transition from simple text logs to structured, queryable, trace-connected events.