Latest News

Claude Code + OpenTelemetry: Per-Session Cost and Token Tracking

Feb 25, 2026 By Adnan Rahic In ObservIQ

I was looking at our Claude Code spend in the Anthropic console the other day. Aggregate cost, aggregate tokens — no breakdown by developer, no breakdown by session. I knew my Hackathon team had been using it heavily on building out new features for the OpenTelemetry Distro Builder. But heavily how? I had no idea. Turns out Claude Code has been emitting OpenTelemetry signals the whole time. Per-session cost, token counts, every tool call it makes on your codebase.

Read Post

ObservIQ

Read more about Claude Code + OpenTelemetry: Per-Session Cost and Token Tracking

Digital Employee Experience Is Now Core to IT - Recognized by Analysts, Reinforced by Customers

Feb 25, 2026 By Paul Gentile In Nexthink

Over the past few years, Digital Employee Experience (DEX) has moved from emerging concept to essential capability for modern IT organizations. The conversation has changed. IT is no longer measured only by system uptime or ticket resolution. Today, success is defined by how technology actually performs for employees — and how consistently organizations can deliver productive, friction-free digital work.

Read Post

Nexthink

Read more about Digital Employee Experience Is Now Core to IT - Recognized by Analysts, Reinforced by Customers

Incident Report: Exercises, Cleanups, and Evacuations

Feb 25, 2026 By Fred Hebert In Honeycomb

Every year, Honeycomb runs disaster recovery scenarios in multiple environments, including in production. Although each of our instances runs in a single region, on at least three Availability Zones (AZs), we have multiple plans for partial regional failures, and particularly, zonal failures. One of these tests was run on December 5th, and after its successful completion came its cleanup steps.

Read Post

Honeycomb

Read more about Incident Report: Exercises, Cleanups, and Evacuations

Alerting Is a Socio-Technical System

Feb 25, 2026 By James Barnes In StatusCake

In the previous posts, we’ve looked at how alert noise emerges from design decisions, why notification lists fail to create accountability, and why alerts only work when they’re designed around a clear outcome. Taken together, these ideas point to a broader conclusion. That alerting is not just a technical system, it’s a socio-technical one. Alerting systems encode assumptions about how people behave, how responsibility is distributed, and how decisions are made under pressure.

Read Post

StatusCake

Read more about Alerting Is a Socio-Technical System

Case Study - Troubleshooting Storage Failures in a VMware ESXi Infrastructure

Feb 25, 2026 By Karthik G In eG Innovations

IT problems happen even in the best architected infrastructure due to configuration changes, failures, upgrades and such. How quickly and effectively you can detect and resolve such problems dictates how efficient your IT operation is. Today, I’ll cover how eG Enterprise helped us troubleshoot a hardware failure (a storage battery failure) that that caused a cascade of failures in a VMware ESXi infrastructure.

Read Post

eG Innovations

Read more about Case Study - Troubleshooting Storage Failures in a VMware ESXi Infrastructure

Notes from the Field: XenServer falling back to file-based licensing when using LAS

Feb 24, 2026 By GripMatix In GripMatix

Citrix has been transitioning products toward License Access Service (LAS) as the modern licensing method. Unlike traditional file-based licensing, LAS introduces service-based communication between products and the Citrix License Server. As of 15 April 2026, LAS becomes the mandatory licensing method for supported products. Environments still relying on file-based licensing will need to transition before that date.

Read Post

GripMatix

Read more about Notes from the Field: XenServer falling back to file-based licensing when using LAS

Microsoft SCOM Tips & Tricks

Feb 24, 2026 By NiCE IT Mgmt In NiCE IT Mgmt

This one is for all the Microsoft SCOM geeks out there — 99 practical tips & tricks to make managing SCOM way easier. The tips compiled here draw from community experts, SCOM-focused blogs, Microsoft’s official documentation, and the hands-on experience at NiCE. You may already know some of them, but having them all organized in one place makes it easy to reference and put them into practice.

Read Post

NiCE IT Mgmt

Read more about Microsoft SCOM Tips & Tricks

API update: Service status page ratings now available

Feb 24, 2026 By Valeria Kurolapova In StatusGator

We’ve added service status page ratings to API v3. You can now access the same letter grades, descriptions, and average acknowledgment delay metrics that appear on StatusGator service pages – directly from the API.

Read Post

StatusGator

Read more about API update: Service status page ratings now available

Reinventing the Incident Responder's Day: Empowering Tier 2 SOC Analysts with Splunk's Agentic SOC Platform

Feb 24, 2026 By Milena Chen In Splunk

The Tier 2 SOC Analyst or the Incident Responder (often hailed as the "Sherlock Holmes of the network") faces an increasingly complex and relentless digital landscape. In a world where analysts are being overwhelmed by alerts, held back by fragmented, manual tooling and inefficient workflows, incident responders are charged with the critical task of identifying, analyzing, and mitigating security threats.

Read Post

Splunk

Read more about Reinventing the Incident Responder's Day: Empowering Tier 2 SOC Analysts with Splunk's Agentic SOC Platform

The Grafana Cloud identity blueprint: balancing security and scale

Feb 24, 2026 By Sarah Constant In Grafana

If you've ever rolled out Grafana Cloud to a growing engineering organization, this pattern may sound familiar: Everything feels simple at first. You invite a few teammates, give them access, and dashboards start appearing. Then the team grows. Then the number of stacks grows. Over time, a model that once felt fast and empowering starts to feel risky, difficult to understand, and even harder to undo. This post is about avoiding that moment.

Read Post

Grafana

Read more about The Grafana Cloud identity blueprint: balancing security and scale

Operations | Monitoring | ITSM | DevOps | Cloud

Claude Code + OpenTelemetry: Per-Session Cost and Token Tracking

Digital Employee Experience Is Now Core to IT - Recognized by Analysts, Reinforced by Customers

Incident Report: Exercises, Cleanups, and Evacuations

Alerting Is a Socio-Technical System

Case Study - Troubleshooting Storage Failures in a VMware ESXi Infrastructure

Notes from the Field: XenServer falling back to file-based licensing when using LAS

Microsoft SCOM Tips & Tricks

API update: Service status page ratings now available

Reinventing the Incident Responder's Day: Empowering Tier 2 SOC Analysts with Splunk's Agentic SOC Platform

The Grafana Cloud identity blueprint: balancing security and scale

Monthly Archive

Follow Us