Operations | Monitoring | ITSM | DevOps | Cloud

The Future of Incident Management: Your Blueprint for Operational Excellence

This is the first post in a series examining the requirements necessary to achieve operational excellence. In today’s dynamic digital landscape, operational resilience is no longer optional; it’s essential. Organizations must proactively embrace solutions designed to meet tomorrow’s challenges, not just today’s demands. Everbridge xMatters emerges as the clear leader in this space, delivering unmatched automation, sophisticated intelligence, and exceptional adaptability.

CI/CD Observability with OpenTelemetry - A Step by Step Guide

In the fast-paced world of CI/CD, understanding the performance and behaviour of your pipelines is crucial. GitHub Actions has become a popular choice for automating builds and deployments, but anyone who's debugged a flaky workflow or long-running job knows how challenging it can be to get visibility into what's happening under the hood. We usually rely on build logs, timing data, or guesswork when something goes wrong.

Multi-Stage Malware Attack on PyPI: Malicious Package Threatens Chimera Sandbox Users

Open-source package repositories like the Python Package Index (PyPI) play a crucial role in software development. However, these platforms are also potential targets for malicious actors attempting to exploit application software vulnerabilities. The JFrog Security Research team regularly monitors open source software repositories using advanced automated tools, in order to detect malicious packages.

Built for Impact: What Happens When LogicMonitor Edwin AI Meets Infosys AIOps Insights

Today’s IT environments span legacy infrastructure, multiple cloud platforms, and edge systems—each producing fragmented data, inconsistent signals, and hidden points of failure. This scale brings opportunity, but also operational strain: fragmented visibility, overwhelming alert noise, and slower time to resolution. With good reason, public and private sector organizations alike are moving beyond basic visibility, demanding hybrid observability that’s context-aware and action-oriented.

(Full Episode)IT Horror Stories: Point of No Rollback Ep8 S1

When a huge cloud platform migration stalls out mid-flight with no hope of changing course, panic lunges to take the wheel. In this episode, Mahesh Guruswamy, CTO at Kickstarter and author of How to Deliver Bad News and Get Away with It, recounts the night he took down production and had no other choice but to push through. Tune in for the blow-by-blow — from anxiously watching the clock as queries time-out to the tightrope walk of breaking bad news without breaking trust. You'll leave with the brutally simple playbook Mahesh now swears by to make sure your next migration doesn’t become an IT horror sequel.

The Mindset Shift: IT Operations to Security - SolarWinds TechPod 099

In this episode, hosts Sean Sebring and Chrystal Taylor engage with actual rock star Chris Greer, a Security Engineering Manager at SolarWinds, to explore the multifaceted world of cybersecurity. Chris shares his unconventional journey from being a musician to entering the IT field, emphasizing the importance of certifications and the mindset shift required when transitioning from IT operations to security.

DASH by Datadog 2025 Keynote

At the 2025 DASH Keynote and be the first to experience Datadog's latest product innovations. This year, we're unveiling next-generation observability features, innovative ways to secure your AI workloads, and powerful agentic AI capabilities throughout the Datadog platform. Discover the new ways your teams can observe, secure, and act in the age of AI.

#045 - Beyond Cluster Creation: Mastering Multi-Cluster Kubernetes with Gianluca Mardente (Cisco)

Join Itiel as he chats with Gianluca Mardente, a Principal Engineer at Cisco Systems. Gianluca shares his path to tech and Kubernetes, including his work history and the inspiration behind his open-source project, Sveltos. They dive into the significant challenges of managing a large fleet of Kubernetes clusters – ensuring consistency, handling upgrades, and coordinating resources across different clusters.