Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Top tips for smoother IT incident management

Top tips is a weekly column where we highlight what’s trending in the tech world and share ways to stay ahead. This week, we’re talking about something every IT team knows too well—incidents. Whether it’s a sudden server crash, a network outage, or a system slowdown right before an important client call, incidents always seem to strike at the worst possible time. No matter how strong your IT setup is, issues are bound to happen.

[Workshop] Fixing Your Frontend: Performance Monitoring Best Practices

​The holiday season is here. Is your frontend ready for the traffic spike, or are you preparing for a debugging nightmare? ​In this live, hands-on workshop, we'll dive into the best practices for modern error and performance monitoring in Sentry. In this live hands on session, we’ll cover: ​Instrumenting Sentry and alert rules to surface and fix critical errors fast ​Optimizing site performance using Web Vitals like TTFB and LCP.

Auvik Named a Leader Across G2's Fall 2025 Reports for Network Management

In G2’s Fall 2025 reports, Auvik earned top recognition as a leader in network management tools across small-business, mid-market, and enterprise categories. IT professionals rated Auvik highly for implementation, usability, results, relationship, and overall Grid® performance, reflecting one thing above all: real-world trust from the IT professionals who use Auvik every day.

DNS Outages Expose Hidden Risks. Edwin AI Finds Them Faster.

The recent AWS outage exposed how fragile the internet remains. Amazon traced the hours-long disruption to a DNS error—a small failure with massive reach. For most organizations, DNS operates quietly in the background. When it fails, every digital service connected to it stops. One of LogicMonitor’s valued customers, IG Group, faced a similar event less than ten hours after enabling Edwin AI.

How to Use the Power BI Desktop InfluxDB 3 ODBC Connector

The challenge of storing, processing, and alerting on your time series data is only part of the battle when it comes to deriving value from time-stamped data. While InfluxDB 3 addresses those hurdles with the database and Python processing engine, data analytics teams still need to be able to visualize their data and build dashboards to complete the time series story.

OpenTelemetry Spans Explained: Deconstructing Distributed Tracing

In a microservices architecture, a single user request can pass through multiple services before completing. When performance drops or an error occurs, tracing that journey is the only way to locate the source. Distributed tracing provides that visibility. At its core are OpenTelemetry Spans — units of work that capture what each service does during a request.

Why Your APM Needs Observability - Metrics, Logs, and Traces Explained

Modern software applications are increasingly complex. Microservices, cloud infrastructure, and distributed architectures make it challenging for developers, DevOps engineers, and SREs to maintain high performance and a seamless user experience. Traditional Application Performance Monitoring (APM) provides critical insights into how applications perform, but alone, it often leaves blind spots when it comes to diagnosing issues or understanding the full system behavior.

Meet Olly - The Coralogix AI Observability Agent (Demo)

Olly is Coralogix’s AI-native observability agent that makes observability data fast, accessible, and actionable—for everyone. Traditionally, teams have spent valuable time piecing together dashboards and writing queries to troubleshoot issues. Olly changes that by letting you ask real questions in natural language and delivering instant, intelligent answers from across your logs, metrics, and traces.