Operations | Monitoring | ITSM | DevOps | Cloud

Test your AI model training reliability, too

Training is at the heart of every LLM model, but it’s still an application running on an infrastructure, which means it can fail. Our GPU test helps you test your training GPUs so you don’t lose that valuable work. TRANSCRIPT: One of the things we built recently was the GPU Gremlin. So if you are training a bunch of models and you're doing a bunch of GPU testing. You know, we want to give you the tools to be able to go test that, to understand how training the model could fail.

Reduce alert noise with Site24x7's Event Correlation

Alert fatigue remains one of the most underestimated problems in IT operations. Srinivasa Raghavan, director of product management, explains how event correlation addresses it. Event correlation is the process of grouping related alerts from across your infrastructure into a single, contextual incident to reduce the volume of noise during an outage or service degradation. In this short clip, Srinivasa walks through what how the feature functions and why high-volume alert environments make this kind of signal-to-noise reduction operationally significant.

GreenOps in Practice: What It Means and How to Get Ready for 2026

In this informational webinar, Freddie Booth, FinOps Consultant at Capgemini Invent, explains what GreenOps means in the context of modern cloud operations. You’ll learn why GreenOps is gaining attention across organizations using Azure, how it connects cost management with sustainability, and what steps teams can take today to start preparing for 2026. The session focuses on practical ways to improve cloud efficiency, reduce unnecessary Azure usage, and align FinOps practices with emerging sustainability goals.

Becoming an Azure Expert MSP

Recently, Wortell achieved the Microsoft Azure Expert MSP designation, a milestone that places them among a select group of managed service providers recognized for their Azure expertise and operational maturity. In this webinar, Alex Tilgenkamp (Azure Cloud Architect at Wortell) shares insights into what it takes to achieve this designation and what it means for organizations building and scaling their Azure managed services practice.

Claude Agent SDK Monitoring & Observability with OpenTelemetry and SigNoz

Learn how to implement monitoring and observability for the Claude Agent SDK using OpenTelemetry and SigNoz. In this video, we walk through instrumenting your Claude-based agents, capturing traces, metrics, and logs, and visualizing everything in SigNoz for real-time insights. You’ll learn how to debug agent behavior, identify latency bottlenecks, and monitor performance in production environments.

Why SSIS will never die - with Tim Mitchell

Steve is joined by business intelligence architect and author Tim Mitchell. They discuss why SSIS will never die, the general pros of cons of integration services, the evolution from XML to JSON, how AI can help with coding, and taekwondo – among other topics! Recorded on-site at PASS Data Community Summit 2025.