Operations | Monitoring | ITSM | DevOps | Cloud

Reliability is when customers aren't impacted

Ultimately, a system is reliable when customers and engineers can count on it. Full transcript:  When I get to hear stories like, "Hey, we just had our holiday sales event kick off and everything went smoothly and I didn't have to wake up in the middle of the night." That is really the true definition of reliability these people that are constantly hands-on keyboard in charge of making sure that people like myself and like you aren't impacted when we're going to, for example, buy a new pair of sneakers, or we're going to get some sort of limited edition release that's coming out, right?

Real-Time Analytics Made Simple with Kafka and Iceberg

AIVEN DATA PLATFORM The Aiven Platform is more than a collection of open source services for streaming, storing and analyzing data. The platform ensures that all services run reliably and securely in the clouds of your choice, are observable, and can easily be integrated with each other and with external 3rd party tools.

Cortex MCP set up

Learn how to set up the Cortex MCP in under 5 minutes. The MCP integrates directly into your IDE, giving instant access to Cortex data without leaving your coding environment. It reduces context switching by enabling natural questions about services and teams, and streamlines workflows with real-time data from Cortex, Jira, GitHub, and more.

Mike Long and DORA Community Discussion - Software Delivery Governance

Manual governance in regulated industries is like steering a ship with last year’s map. Approvals, ticket queues, and after-the-fact evidence collection slow delivery and increase risk. By the time an audit arrives, teams are scrambling to prove they followed the process. Watch Kosli’s Mike join Nathen Harvey at DORA to unpack why this happens — and what continuous, automated governance can do to fix it.

Sentry MCP server monitoring

We just launched MCP server monitoring in beta. You can instrument most server-side JavaScript SDK based MCP servers with one line of instrumentation code within your MCP SDK implementation using: wrapMcpServerWithSentry(McpServer) See details like protocol usage, client usage, traffic, tool usage, and performance across your MCP implementation so you you can get visibility into all the sharp edges that your MCP server has — who’s using it, how it’s working (or not), and get alerted when things break.

Fiber Paths and Failsafes: Why Your Network Design Matters

Redundancy isn’t just a buzzword – it’s the design principle keeping modern AI and cloud applications online. In this Uplink episode, Kevin Schlosser, Interconnection Product Manager at NTT Global Data Centers, explains how resilient infrastructure is engineered to expect failure but remain operational. We explore: Diverse entry points and fiber path management AI-driven bandwidth growth: 100G standard, 400G emerging Cooling innovations for intense compute workloads Why providers without their own fiber may offer the most resilient paths.