Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Elastic and Microsoft partnership achievements in 2025

Highlights of another successful year of customer-centric collaboration Once again, our partnership delivered an impressive year of innovation with Microsoft Azure, Azure AI Foundry, and Azure OpenAI. This blog highlights our continued collaboration with Microsoft to better serve customers throughout 2025 and our key moments at Microsoft Ignite.

How Aerospace Companies Use InfluxDB

Over the past two decades, we’ve witnessed the instrumentation of virtually everything in the aerospace industry, from manufacturing floors to satellites orbiting Earth. And it’s no longer just NASA and other government organizations leading the charge. The commercial space industry has grown exponentially, with private companies developing everything from GPS satellites to electric VTOL aircraft.

Grafana Tempo: Upcoming 2.10/3.0 Releases (Community Call December 2025)

Upcoming 2.10/3.0 Releases New maintainer, Oleg Have questions? Please bring them! Can't comment in the chat? You may need to create a channel -- you can do this by clicking your photo in the top right corner. Grafana Cloud is the easiest way to get started with Grafana dashboards, metrics, logs, traces, and profiles. Our forever-free tier includes access to 10k metrics, 50GB logs, 50GB traces and more.

Why AI agents need a common data model #ai #telemetry

Clint Sharp explains why a common model like OCSF is critical for the future of AI. Agents need standardized data to analyze information effectively on your behalf. He contrasts the traditional manual workflow of checking Slack, tickets, and wikis while asking colleagues with a future where AI fuses this human context with machine data. Instead of just search results, AI agents will hand you examined hypotheses so you know exactly where to take your investigation.

How AI Agents automate incident response #ai #cybersecurity #telemetry

Clint Sharp demonstrates how Cribl Search leverages AI to streamline incident investigation. Starting from a Slack channel, the AI builds an interactive notebook, analyzes order processing logs, and identifies suspicious traffic spikes. It connects high CPU usage to a recent Jenkins deployment, hypothesizing a supply chain attack, and ultimately recommends a rollback. This isn't a far off concept. It is the future of operations arriving right now.

Let's Encrypt 45-Day Certificate Expiration: Monitoring & More

The move by Let’s Encrypt from 90-day certificates to 45-day certificates is more than a policy shift. It changes how teams must manage renewals, detect failures, and validate that certificates are deployed consistently across distributed systems. A shorter lifecycle compresses the margin of error. Automation that previously limped along unnoticed now breaks on a far tighter schedule. And every misconfiguration hits users faster.

How to Handle Cloud Monitoring Overload?

Reduce alert noise by 70% through intelligent aggregation, clear ownership boundaries, and filtering metrics that don't map to user-facing issues. Monitoring starts with a straightforward goal: understand your system's health and identify issues before users notice them. You set up metrics, create dashboards, and configure some alerts. At first, it works well. Over time, your stack gets bigger and more complicated. New services get added.

Obkio 2025 Year in Review

2025 was big! This year, we stopped talking about what Obkio could be and started showing what it is: a full network observability platform built for the networks you actually run. We released features that solve real problems. We showed up where network pros gather. And we proved that a Canadian-built tool can compete with anyone. Here's what happened.

Microsoft Teams outage - December 10th, 2025

On the morning of December 10, 2025, Microsoft Teams experienced a service disruption affecting users across Australia. Although Microsoft 365 users reported issues across several apps, the hardest hit service was Microsoft Teams which became completely unusable for many organizations. While Microsoft did not acknowledge the incident until 03:46 UTC StatusGator identified the issue at 02:52 UTC through incoming outage reports and delivered an Early Warning Signal at 03:01 UTC.