Operations | Monitoring | ITSM | DevOps | Cloud

Bridging partners in pursuit of agentic AI - Part 1: Why partnerships matter for enterprise intelligence

The pace of change in AI development has been dizzying. In just a few years, we’ve moved from experimenting with AI, machine learning (ML), retrieval augmented generation (RAG), and agents to asking how these innovations can solve real business problems. Enterprises are no longer impressed by the novelty and possibilities; instead, they expect outcomes.

Navigating the Database Ecosystem in 2025

In 2025, the database ecosystem is more diverse and interconnected than ever before. From AI-assisted natural language queries that analyze your data to open table formats that make it easy to bridge systems, data infrastructure is moving towards openness, intelligence, and composability. Modern databases are no longer isolated systems; they are part of a broader ecosystem where interoperability is as important as performance.

RED Metrics & Monitoring: Using Rate, Errors, and Duration

The RED method is a streamlined approach for monitoring microservices and other request-driven applications, focusing on three critical metrics: Rate, Errors, and Duration. Originating from the principles established by Google's "Four Golden Signals," the RED monitoring framework offers a pragmatic and user-centric perspective on service assurance and service performance.

Application Performance Monitoring (APM) Guide: Monitor and Optimize Application Performance

Every millisecond your application takes to respond can decide whether a user stays or leaves. But here’s the catch, you can’t improve what you can’t see. Behind every slow page load, failed API call, or random spike in latency lies a story your application is trying to tell. Application Performance Monitoring (APM) is how you listen to that story.

10 Best Log Monitoring Tools

Log monitoring stands as the backbone of resilient, secure, and high-performing digital operations. Every digital service, application, cloud platform, and network device leaves behind a trail of log files, containing raw, unstructured data that chronicles system events, user actions, errors, security activities, and business transactions. For organizations striving to achieve operational excellence, these logs are more than archives; they're the heartbeat of every mission-critical system.

Microsoft Teams Troubleshooting for Teams Performance and Connection Issues

How many times has this happened? You're on a Microsoft Teams call, and your call disconnects, lags or freezes. so you go to Google to look up how to solve the problem. Well look no further! If you're using Microsoft Teams, there are proven ways to troubleshoot those pesky performance and connection issues that are putting a damper on your team's collaboration.

Show me the (meeting) money: How to monitor the real-time costs of a meeting in Grafana

This meeting could’ve been an email. It’s a phrase most of us have said (or at least thought) at some point in our careers. For me, that realization hit years ago while working for a government organization. I’d frequently sit through long, agendaless meetings that seemingly went nowhere. I wasn’t sure why I was there. And because I’m an engineer at heart, I started to wonder: what were these meetings actually costing the organization?

The Hidden Barrier to Network Automation Isn't Your AI - It's Your Data

For years, the promise of AI-driven network automation has loomed large. Vendors and analysts alike have painted a future where autonomous operations handle outages before they happen, root causes are explained instantly, and teams finally escape the endless cycle of alerts, tickets, and manual troubleshooting. But in practice, most automation initiatives stall long before they reach that vision.

Ingest OTLP metrics directly into Datadog with the new OTLP Metrics API

Many organizations rely on OpenTelemetry (OTel) to standardize observability across distributed systems. These organizations are at varying stages of adoption and are implementing OTel in complex environments with diverse configurations. To support this range of use cases, Datadog offers many ways to use OpenTelemetry with Datadog.