Operations | Monitoring | ITSM | DevOps | Cloud

Understanding Kafka with Speedscale #speedscale #kafka #visualization #engineering #production

In this video, we're breaking down the complex world of Apache Kafka and showing you how to gain deep visibility into your event streaming architecture using Speedscale. Kafka is the backbone of modern, cloud-native systems, but understanding what's happening in production—which topics are receiving traffic, where messages are going, and how services are interacting can be a real challenge. We'll cover how Speedscale makes Kafka visualization and debugging simple by.

Introducing Bits AI SRE, your AI on-call teammate

Bits AI SRE is your AI on-call teammate, built to autonomously investigate alerts and coordinate incident response. Integrated with Datadog, Slack, GitHub, Confluence, and more, Bits analyzes telemetry, reads documentation, and reviews recent deployments to determine the root cause of alerts—often before you’ve even opened your laptop. In fact, if you're using Datadog On-Call, you can view Bits’s findings right from your phone—so you’re always one step ahead, no matter where you are.

Essential Updates for Server 2008 and 2012 #shorts #patch

Servicing stack updates for Server 2008 and 2012 require users to remain current to avoid operational impacts. Development tools also receive necessary updates, prompting action from development and operations teams. Key updates for Azure Monitor and Microsoft Visual Studio emphasize the importance of team engagement. The content also addresses the end of life for certain services, highlighting the need for awareness and preparation.

Demo Roundups! Building Resilient On-Call Operations for the Holiday Season

The holidays are retailers' make-or-break moment - when every minute of downtime directly impacts revenue and customer experience. Join us for a retail-focused deep dive into building holiday-ready on-call operations that protect your peak season revenue. We'll demonstrate how PagerDuty's new scheduling experience and AI assistance ensure seamless coverage during your busiest - and most critical - time of year.

New Feature Friday: Understand & Improve Your DORA Performance with Cortex

This week on New Feature Friday, we’re highlighting two new releases that make it easier than ever to understand and improve your DORA performance: DORA Academy Course A guided learning experience that shows you how to use DORA Metrics and Cortex together to drive better engineering outcomes—without the data chaos. DORA Operational Readiness Scorecard An out-of-the-box template that benchmarks each service against DORA standards, giving teams an instant snapshot of where they stand and where to focus.

The next era of IT management with ManageEngine: What agentic AI will unlock

Agentic AI is generating a lot of buzz, but what does it actually do for IT teams? Join us as we showcase how the industry is evolving in the new era. What once took hours or days will soon take minutes—unlocking a new level of productivity and efficiency for IT operations. The foundation of this evolution? AI-driven contextual analytics. Agenda.

Data Observability: Build confidence in the data life cycle

Datadog Data Observability provides a complete solution with quality checks (e.g., volume, row changes, freshness), custom SQL-based monitors, anomaly detection, column-level lineage across systems like Snowflake and Tableau, full pipeline visibility, and targeted alerts when data issues arise.