Operations | Monitoring | ITSM | DevOps | Cloud

Though the crisis: AXA IM

Our customer base was impacted globally by COVID-19, and we were honored to be able to support them whenever and however we could. Now, in this new Through the Crisis series, they’ll be sharing some of their stories from the IT frontline. The first contribution comes from Christophe Verducci, Head of Proximity Services at leading financial services provider AXA IM.

Datadog on Kubernetes

When 2 years ago Datadog decided to move its infrastructure platform to Kubernetes we didn’t expect to find so many roadblocks, but ingesting trillions of datapoints per day in a reliable fashion requires pushing the limits of cloud computing. Creating and managing dozens of clusters, with thousands of nodes each and operating in several clouds was a challenging but rewarding learning experience. In this episode Ara Pulido, Developer Advocate, will chat with Laurent Bernaille, Staff Engineer at Datadog and part of the team that created Datadog’s Kubernetes platform. We’ll cover the challenges we found creating and scaling Datadog’s Kubernetes platform and how we overcame them.

KMC - Running a Multi-Cluster Service Mesh in Rancher

If you are or your organization is running Kubernetes, then it's likely that you are running more than one cluster. This model works well for most, but starts to break down when microservices and service mesh enter the discussion. However, with some additional configuration and tooling , a multicluster mesh can take advantage of distributed Kubernetes environments and the services therein. In this Master Class, recorded on June 20, 2020, Rancher Field Engineer Jason Skrzypek discusses and demos a multi-cluster service mesh running Istio.

Datadog on Kafka

As a company, Datadog ingests trillions of data points per day. Kafka is the messaging persistence layer underlying many of our high-traffic services. Consequently, our Kafka usage is quite high: double-digit gigabytes per second bandwidth and the need for petabytes of high performance storage, even for relatively short retention windows. In this episode, we’ll speak with two engineers responsible for scaling the Kafka infrastructure within Datadog, Balthazar Rouberol and Jamie Alquiza. They'll share their strategy in scaling Kafka, how it’s been deployed on Kubernetes, and introduce kafka-kit; our open source toolkit for scaling Kafka clusters. You'll leave with lessons learned while scaling persistent storage on modern orchestrated infrastructure, and actionable insights you can apply at your organization

Introduction to Service Request Automation

A brief introduction to Service Request Automation using the Kelverion Runbook Suite. The Kelverion Runbook Suite provides a cloud automation platform with a range of automation tools including; a rich graphical design experience, smart integrations, ready built solutions and the option of an easy to configure self-service automation portal.