Operations | Monitoring | ITSM | DevOps | Cloud

The riskiest thing you can do is not measure your risk

Hiring good engineers is important, but it’s not enough to prevent outages. You need to measure and track your risk to get real results. Full transcript:   My name's Jeff Nickoloff. I'm a principal engineer here at Gremlin.  What I hear non-technical functions talk about is really they are much happier to sort of lean on their great engineers. Oh, we've got a great engineering culture. "We don't have reliability issues because we hire the best people.".

AI-driven alert triage and root cause analysis (RCA) that proactively responds to production alerts

Watch AI transform alert management in real-time. This technical demonstration compares manual alert investigation with AI alert investigation. It shows how AI agents automatically investigate production alerts, correlate telemetry across distributed systems, and identify root cause, faster and with more insights than manual processes. Watch and learn how to shift your team from reactive firefighting to proactive system reliability management with agentic AI.

Navigating the Growth of Digital Infrastructure in Brazil with Carlos Eduardo Sedeh

What does it take to build a telecom network that actually listens? In this episode of Uplink, Carlos Eduardo Sedeh, CEO of SAMM (formerly Megatelecom), joins host Michael Reid to explore how a flat-fee dial-up service launched in 1999 laid the groundwork for a customer-first telecom strategy that continues to reshape Brazil’s enterprise connectivity landscape.

Getting closer to space with Canonical #ubuntu #space #shorts

@EuropeanSpaceAgency is scaling to support more missions than ever. Canonical makes it possible with open source infrastructure built for space. Watch the full video to see how we're helping ESA automate, scale, and future-proof its operations. Subscribe for more tech stories from space.