Operations | Monitoring | ITSM | DevOps | Cloud

Datadog on Kafka

As a company, Datadog ingests trillions of data points per day. Kafka is the messaging persistence layer underlying many of our high-traffic services. Consequently, our Kafka usage is quite high: double-digit gigabytes per second bandwidth and the need for petabytes of high performance storage, even for relatively short retention windows. In this episode, we’ll speak with two engineers responsible for scaling the Kafka infrastructure within Datadog, Balthazar Rouberol and Jamie Alquiza. They'll share their strategy in scaling Kafka, how it’s been deployed on Kubernetes, and introduce kafka-kit; our open source toolkit for scaling Kafka clusters. You'll leave with lessons learned while scaling persistent storage on modern orchestrated infrastructure, and actionable insights you can apply at your organization

How the new normal will change company culture for good

Last night I dreamt I was back in the office for the first time. Our long communal tables in the kitchen were gone. My desk was surrounded on all sides by plexiglass – including overhead, which, for a guy my height, means stooped shoulders and a future riddled with chiropractic appointments. Nobody talked to each other except over Slack. The smell of disinfectant was inescapable. I couldn’t wait to go back home. Or at least, wake up.

Introduction to Service Request Automation

A brief introduction to Service Request Automation using the Kelverion Runbook Suite. The Kelverion Runbook Suite provides a cloud automation platform with a range of automation tools including; a rich graphical design experience, smart integrations, ready built solutions and the option of an easy to configure self-service automation portal.

Scaling open source Puppet

In my Puppet travels over the last 10 or so years, one topic has continued to arise time and again, and that has been the ability to scale open source Puppet to thousands of nodes. While the best route is to use Puppet Enterprise for solid support and a team of talented engineers to help you in your configuration management journey, sometimes the right solution for your needs is open source Puppet.

Introduction to KUDO: Automate Day-2 Operations (II)

In a previous article, we discussed KUDO and the benefits of it when you want to create or manage Operators. In this article we will focus on how to start to work with KUDO: Installation, using a predefined Operator and create your own one. Installing KUDO To install KUDO the first step is to install the CLI plugin in order to manage KUDO via CLI. Depending on your OS you can use a package manager like Brew or Krew, however installing the binary is a straightforward option to proceed.

Meet Flowmon Packet Investigator

Flowmon Packet Investigator (FPI) is an automated network traffic auditing tool that records and interprets full packet data. Where flow data is not sufficient, and more detail is needed, the Investigator captures all the packets of traffic surrounding the event for in-depth troubleshooting. What sets the Investigator apart, is built-in expert knowledge. It not only provides extensive details but automates the analysis, assessing the captured events, looking for error codes, and providing explanations and suggestions for a remedy.

Citrix MP -30% EOL Deal

Citrix announced EOL for their SBC/ VDI SCOM MPs. We understand that a transition to an alternative solution takes some effort. Especially during these extraordinary times. To support a smooth transition we offer 30% discount for a 1-year subscription on MetrixInsight for CVAD SCOM MP during June, July and August 2020 for all Citrix CVA(D) Premium license customers.

Everything You Need to Know about Kubernetes Services Networking in Your Rancher Cluster

As a leading, open-source multi-cluster orchestration platform, Rancher lets operations teams deploy, manage and secure enterprise Kubernetes. Rancher also gives users a set of CNI options to choose from, including open-source Project Calico.